Signapse is a fast-paced tech-for-good startup focused on creating accessibility solutions for Deaf communities through AI technology. They are seeking a Machine Learning Engineer to optimize and build infrastructure for real-time sign language video generation, working across the full ML inference stack.
Responsibilities:
- Profile and optimise deep learning models used for sign language video generation
- Reduce inference latency using quantisation, pruning, mixed precision, and kernel optimisation
- Improve GPU utilisation and throughput across inference pipelines
- Work closely with ML researchers to ensure models are production-ready
- Build and maintain scalable model serving systems on GPU clusters
- Design autoscaling infrastructure to meet real-time SLAs
- Contribute to model deployment pipelines, versioning, and rollback strategies
- Develop benchmarking frameworks for tracking inference performance
- Identify bottlenecks across the ML pipeline and eliminate latency hotspots
- Implement performance monitoring and alerting for production systems
- Evaluate new hardware accelerators and inference run times
- Work with the research team to expand sign languages and digital signers
- Architect systems that allow rapid onboarding of new languages
- Build low-latency infrastructure that scales to hundreds of concurrent streams