ML Model Development: Design, develop and implement scalable machine learning models to solve complex business problems
MLOps and Production: Build and maintain robust ML pipelines, ensuring deployment, monitoring and maintenance of models in production
Feature Engineering: Create and optimize features using DBT and PySpark, working with large volumes of data
Workflow Orchestration: Develop and manage data and ML pipelines using Apache Airflow
Data Processing: Perform large-scale distributed data processing with PySpark
Collaboration: Work closely with data scientists, data engineers and product teams to deliver end-to-end solutions
Optimization: Monitor model performance, identify degradation and implement continuous improvements
Documentation: Maintain clear technical documentation on architecture, models and processes
Requirements
Strong experience developing and deploying machine learning models in production
Advanced Python and ML/DL libraries (scikit-learn, TensorFlow, PyTorch, XGBoost, etc.)
Familiarity with GenAI architectures: Amazon Bedrock or similar, RAG pipelines, vector databases (pgvector, OpenSearch, Pinecone), and integration with LLM APIs
API and microservices architecture (FastAPI, API Gateway, ECS/EKS)
PySpark: Proven experience in distributed data processing
Advanced SQL and experience with PostgreSQL
Apache Airflow: Building and managing complex DAGs
AWS Cloud: Experience with services such as SageMaker, S3, EC2, Lambda, ECR/ECS
Experience with Snowflake for data storage and analytical processing
Knowledge of DBT for data transformation and modeling
Code versioning with Git and good development practices
Tech Stack
Airflow
Apache
AWS
Cloud
EC2
Postgres
PySpark
Python
PyTorch
Scikit-Learn
SQL
Tensorflow
Benefits
Health and dental insurance
Meal and food allowance
Childcare assistance
Extended parental leave
Partnerships with gyms and health and wellness professionals via Wellhub (Gympass) TotalPass
Profit Sharing (PLR)
Life insurance
Continuous learning platform (CI&T University)
Discount club
Free online platform dedicated to promoting physical and mental health and wellbeing