New
Quality Reliability Engineer
![]() | |
![]() United States, Texas, Austin | |
![]() | |
OverviewMicrosofts's Cloud business is expanding, and our team is seeking an experienced Quality Reliability Engineer to join our team! The Cloud Supply Chain (CSCP) organization is responsible for enabling the hardware infrastructure underlying this growth including AI! CSCP's vision is to empower customers to achieve more by delivering Cloud and AI capabilities at scale. Our mission is to deliver the world's computer with an industry-leading supply chain. The CSCP organization is responsible for traditional supply chain functions such as plan, source, make, deliver, but also manages supportability (spares), sustainability, and decommissioning of datacenter assets worldwide. We deliver the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, OneDrive and the Microsoft Azure platform for external customers. Our infrastructure is supported by more than 300 datacenters around the world that enable services for more than 1 billion customers in over 90 countries.Within CSCP, the Spares Supply Chain organization is reinventing and transforming the cloud service parts supply chain. Our goal is to ensure the right spare parts are in the right place at the right time to support global capacity requirements and make sure our customers have the cloud capacity they need when they need it. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesWhat you will do:Collaborate closely with Engineering and Supply Chain teams to ensure high Azure Cloud fleet health and achieve "nodes in service" targets supporting our customersPerform quality investigations and collaboratively develop solutions to mitigatehardware quality impacts Program management remediation solutions in an effective manner while maintaining SLA KPIs Drive improvements by analyzing telemetry from various quality data sources and implementing RCA's Promote continuous improvement by incorporating feedback from internal/external customers Identify and drive implementation of mitigation solutions including playbooks for operational teams Participate in serviceability discussions and support hardware quality improvements Conduct Rhythm of Business meetings to present topics to stakeholders and leadership |