First American is a leading title insurance company that prioritizes its people and has a strong inclusive culture. The Data Engineer will design and develop scalable data platforms and pipelines, collaborating with cross-functional teams to deliver efficient data solutions that meet business objectives.
Responsibilities:
- Design, develop, and maintain scalable ETL/ELT pipelines and data processing frameworks
- Build and support big data solutions using Python, PySpark, Spark SQL, and SQL, leveraging platforms such as Databricks
- Develop data integration workflows using modern ETL tools
- Design and manage graph database solutions using Neo4j
- Develop and optimize database objects such as stored procedures, views, and materialized views
- Implement CI/CD pipelines for automated build, testing, deployment, and monitoring
- Support DevOps and release management through version control and automation tools
- Work with cloud-based data platforms in AWS and Azure environments
- Automate workflows and improve data delivery architecture for scalability and performance
- Perform troubleshooting, root-cause analysis, and resolve data quality issues
- Ensure data accuracy and consistency through testing and validation processes
- Participate in Agile SDLC processes including sprint planning, development, and deployment
- Collaborate with business stakeholders to gather requirements and deliver data solutions
- Provide production support, including off-hours support as required
Requirements:
- Bachelor's degree in Computer Science, Information Systems, Engineering, or a related field (or equivalent experience)
- 3–5 years of experience in Data Engineering or a related role
- Strong experience with ETL pipelines and data warehouse solutions
- Experience with distributed data processing and cloud analytics platforms
- Experience with CI/CD implementation and deployment automation
- Experience working with AWS and Azure cloud platforms
- Strong programming skills in Python
- Hands-on experience with PySpark and Spark SQL
- Experience working with Databricks
- Proficiency in SQL and T-SQL, including performance optimization
- Experience developing stored procedures, views, and materialized views
- Experience building ETL/data integration pipelines
- Experience working with Neo4j or graph databases
- Knowledge of CI/CD and DevOps tools such as Git, Azure DevOps, or GitHub Actions
- Strong understanding of data warehousing and big data processing concepts
- Strong communication and collaboration skills
- Self-motivated with strong problem-solving abilities
- Ability to manage multiple priorities in a fast-paced environment
- Experience working in Agile development environments
- Strong focus on data quality, validation, and accuracy
- Familiarity with Talend