Microsoft is a leading technology company, and they are seeking a Senior Software Engineer for their AI Infrastructure team. The role involves designing and developing core AI Infrastructure services, ensuring high performance and reliability, and collaborating with various teams to enhance machine learning capabilities.
Responsibilities:
- Work on the design and development of the core AI Infrastructure distributed and in-cluster services that support large scale AI training and inferencing
- Develop, test, and maintain control plane services written in C#, hosted on Service Fabric or Kubernetes (AKS) clusters
- Enhance systems and applications to ensure high stability, efficiency and maintainability, low latency, tight cloud security
- Provide operational support and DRI (on-call) responsibilities for the service
- Develop and foster a deep understanding of the machine learning concepts, use cases, and relevant services used by our customers
- Collaborate closely with service engineers, product managers, and internal applied research and data science teams within Microsoft to build better solutions together
- Provide vision, expertise, and technical leadership to other team members
- Help to grow talent in these areas