Lineate is a US-based international software development company with over two decades of experience. They are seeking a Senior Java + Python Software Engineer to join their established ML team, focusing on building data pipelines and preparing data that powers ML models in production.
Responsibilities:
- Design, develop, and maintain scalable backend services using Java and Python
- Build and optimize data pipelines for large-scale data processing (primarily with PySpark)
- Collaborate closely with ML engineers to prepare and transform data for ML models
- Work with large datasets to support search relevance and optimization systems
- Improve system performance, scalability, and reliability
- Contribute to architecture decisions and technical design
- Write clean, maintainable, and well-documented code
Requirements:
- 6+ years of commercial software development experience
- Strong hands-on experience with both Java and Python (required)
- Solid experience with Spark / PySpark or similar data processing frameworks
- Proven experience building data pipelines and working with large datasets
- Strong understanding of backend development, system design, and distributed systems
- Experience with APIs and microservices
- Good knowledge of data structures, algorithms, and performance optimization
- Ability to work in hours close to the New York timezone (at least 6 hours overlap with NY team)
- Experience with search and relevance optimization, including: Elasticsearch / Solr (lexical search), Semantic search, vector search, or RAG-based systems, Search relevance tuning
- Experience supporting ML workflows, such as: Data preparation or feature engineering, Recommendation systems or user behavior prediction
- Experience with Databricks (preferred) or similar environments: AWS EMR, Hadoop, Spark ecosystem