OXMIQ Labs is a startup focused on designing GPU and AI silicon for large-scale model inference and training. The Principal Performance Modeling Architect will own the performance analysis of the system-solution, extending the modeling platform and influencing architecture decisions through simulation-backed data.
Responsibilities:
- Own and extend OXMIQ's system-solution performance modeling platform end to end — Speed-of-Light modeling of performance, energy, and cost-per-token across heterogeneous accelerator hardware and cluster topologies
- Extend the platform beyond LLM inference to training workloads and vision-transformer / multimodal models, defining the modeling approach for each new workload class
- Influence silicon and system architecture decisions by turning proposed designs into defensible, simulation-backed numbers ahead of capital commitment
- Make the platform more extensible and agent-driven, including exposing analysis capabilities through the OxCapsule interface
- Validate and reason about simulation results — correlating model outputs against the observed behavior of real workload frameworks (e.g., vLLM, SGLang, training stacks) and continuously tightening model fidelity
- Set and own the platform's technical direction in line with the executive vision, uphold engineering and code-quality standards, and provide technical guidance to interns supporting the work
- Serve as a technical point of contact in selected proposal and partner engagements