General Motors is seeking an experienced Senior Machine Learning Engineer specializing in ML Training Infrastructure. This role involves designing and building scalable AI/ML platform infrastructure to support advanced AI research and model development initiatives.
Responsibilities:
- Design and development of scalable, reliable, high-performance ML framework to support model training at scale
- Model training performance analysis and optimization solutions to scale distributed training workflows and maximize resource utilization across heterogeneous hardware environments, and save cost
- Raise the bar on system observability, debuggability, and operational excellence, and user experience
- Collaborate with cross-functional teams to integrate new features and technologies into the platform