AMD is a company that builds products accelerating next-generation computing experiences, including AI and data centers. They are seeking a Principal Software Development Engineer to improve how models train and run on their GPUs, influencing the ROCm ecosystem and enhancing AI system performance.
Responsibilities:
- Own the AI software stack: Establish best practices and drive performance from low-level GPU kernels to large-scale distributed systems. Use modern LLMs and agent-based tooling where it accelerates development and tuning of the ROCm ecosystem
- Accelerate foundation models and agents: Improve training, post-training, and inference for LLMs and autonomous AI workloads so AMD is the default platform for the most demanding use cases
- Co-design hardware and software: Partner on the full lifecycle—from GPU architecture input to software for new accelerators—and engage with the broader AI community to keep AMD at the forefront