Rex.zone is hiring a remote, US-based STEM Engineer to build and maintain engineering systems that support AI/ML workflows. The role involves designing data workflows and developing tools to enhance training data quality and model performance.
Responsibilities:
- Own end-to-end delivery for systems supporting LLM training pipelines
- Build ETL/ELT jobs, data validation checks, dataset versioning, and automated tests
- Implement QA evaluation workflows (gold sets, consensus methods, review queues)
- Develop guideline compliance checks and content policy enforcement utilities
- Integrate labeling throughput, quality, and cost signals with partners and vendors
Requirements:
- Mid-Senior experience in software engineering or data engineering with production systems
- Strong Python and SQL; experience building reliable pipelines and tests
- Familiarity with ML workflows, dataset curation, and evaluation concepts
- Ability to define measurable quality metrics and operational tooling
- Strong written communication and ability to work asynchronously (remote)
- Experience with data labeling programs, annotation platforms, or vendor integrations
- Exposure to LLM evaluation, prompt evaluation, and rubric-based scoring
- Understanding of RLHF concepts and sampling strategies
- Familiarity with computer vision annotation formats and quality controls
- Background in content safety labeling, policy enforcement, or trust & safety