Scalence L.L.C. is seeking a Python FastAPI Engineer to design, build, and operate scalable API services for machine learning-driven risk models. The role involves developing real-time scoring APIs, batch scoring pipelines, and ensuring production readiness with a focus on monitoring and model governance.
Responsibilities:
- Design and develop REST APIs using Python + FastAPI to serve real-time model scoring for individual loans/customers and support batch scoring workloads
- Integrate APIs with upstream/downstream systems with strong input validation and error handling
- Support model hosting and execution for Python-based models with flexible deployment patterns
- Implement model operational capabilities: versioning/lineage, audit trails, reason codes/explainability hooks, and champion/challenger support where needed
- Build and enable observability: structured logging, metrics, tracing, and monitoring for performance, availability, and usage. Assist in implementing model performance and drift monitoring pipelines
Requirements:
- Strong hands-on experience with Python and FastAPI for building production REST services
- Experience serving or integrating ML models in production (risk scoring, classification/regression, or similar)
- Solid engineering practices: unit/integration testing, Git-based workflows, CI/CD, and API documentation (OpenAPI/Swagger)
- Familiarity with model governance concepts: version control/lineage, monitoring, explainability/reason codes, audit logs
- Experience with AWS and/or model execution environments such as SageMaker
- Advanced understanding of API design for production ML systems
- Experience in real-time scoring and batch processing pipelines