We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Engineering - DXR Engineering - Systems Engineer - Associate - Dallas

The Goldman Sachs Group
United States, Texas, Dallas
Apr 08, 2025

Procmon Platform delivers a highly scalable and reliable ecosystem for scheduling business critical jobs across Goldman Sachs.

Our platform is responsible for scheduling tens of millions of daily jobs for Global Banking & Markets, Asset & Wealth Management, Risk and other business and engineering functions.

The ecosystem includes a number of high availability, very large scale systems including



  • Job scheduling
  • Event streaming
  • Log shipping
  • Data warehouses
  • Security infrastructure



RESPONSIBILITIES



  • Own technical operations for systems that manage hundreds of thousands of compute cores
  • Build observability for new deployments to ensure robustness from day one, as well as mature deployments to identify and implement improvements
  • Troubleshoot and resolve issues with block devices, file descriptors, and packet loss
  • Lead real-time outage investigations and present postmortems to senior management
  • Define SLIs and SLOs and partner with development teams to ensure system are sufficiently well designed and instrumented
  • Partner with our development team throughout development and operations
  • Plan and manage deployments and migrations (including end-of-life programs)
  • Plan and implement robust business continuity and security programs
  • Provide regional coverage for the Procmon platform and participate in the on-call support



REQUIREMENTS



  • Excellent problem-solving and automation skills
  • Strong Linux fundamentals and system administration skills
  • Good networking fundamentals (familiarity with TCP/IP, IP routing, firewalls, secure tunneling protocols)
  • Experience working with distributed computing systems and Cloud computing environments
  • Proficiency in at least one programming language; the team uses a mix of Go, Python and Erlang
  • Able to operate effectively in a mission critical, highly regulated financial services environment

Applied = 0

(web-77f7f6d758-rjjks)