New
Software Engineer II
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewAzure VMware Solution (AVS) empowers enterprises to run VMware workloads natively on Azure. We are seeking a Site Reliability Engineer (SRE) to join our AVS engineering team. This role is ideal for engineers who thrive in high-scale, hybrid cloud environments and are passionate about improving service reliability, operational excellence, and customer experience. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesOwn the reliability, availability, and performance of AVS infrastructure and services.Lead incident response and root cause analysis (RCA) for complex issuesDesign and implement automation to improve detection, mitigation, and recovery from service-impacting events.Develop and maintain monitoring, alerting, and telemetry systems to proactively identify and resolve issues.You participate in onboarding, code/design reviews, and regular meetings with the engineering teams that develop and manage those products. You independently develop code or scripts that automate the performance of repetitive and easily scalable operations processes. You design, develop, and maintain telemetry pipelines and monitoring tools that detail operations metrics. You develop, test, troubleshoot, and implement changes to optimize code and improve products. You respond to incidents during regular on-call rotations. |