We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Senior Systems Engineer

Microsoft
United States, California, Mountain View
Jul 19, 2025
OverviewMicrosoft Silicon Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Skype, OneDrive and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission. We are seeking a seasoned High-Performance Computing (HPC) professional to join Microsoft's Silicon Development Compute Solutions (SDCS) team, embedded within the broader silicon engineering organization. As a Senior Systems Engineer, you will play a critical role in designing, deploying, and managing scalable, Linux-based compute infrastructure that supports silicon design workloads. This role is central to ensuring the availability, performance, and efficiency of HPC services that power Microsoft's silicon innovation. You will work closely with CAD, Operations, Engineering, and cross-functional teams to deliver resilient and high-performing infrastructure solutions that meet the demands of a globally distributed design organization. If you are passionate about Linux systems at scale, HPC infrastructure, and enabling cutting-edge silicon design, this is a unique opportunity to make a significant impact.
ResponsibilitiesAdminister and optimize Linux environments (Red Hat, Rocky Linux, CentOS) across on-premises and Azure cloud infrastructure, including installation, configuration, patching, and troubleshooting.Help manage engineering services such as Exceed TurboX, VNC, and authentication platforms (Red Hat IdM, NIS), ensuring high availability, performance, and user satisfaction.Lead HPC infrastructure security compliance planning and remediation, aligning with compliance standards and operational requirements.Collaborate with storage teams to diagnose and resolve Linux-related issues involving Isilon, Pure Storage, and Azure NetApp Files (ANF).Implement and maintain monitoring and alerting systems to ensure system reliability, performance, and proactive incident response.Partner with Engineering and CAD teams to address complex technical challenges and optimize compute and storage performance for design workloads.Develop and maintain comprehensive documentation for system configurations, operational procedures, and support workflows.Mentor team members, providing technical guidance and promoting a culture of continuous improvement and operational excellence.Communicate effectively across global teams, demonstrating strong written and verbal communication skills and a collaborative mindset.
Applied = 0

(web-6886664d94-5gz94)