GEICO is a leading insurance company dedicated to providing quality coverage and exceptional service to millions of customers. They are seeking a Senior Data Engineer to build and maintain robust data systems that enhance their analytics capabilities while driving innovation and best practices within the team.
Responsibilities:
- Scope, design, and build scalable, resilient distributed systems
- Utilize programming languages like Python, SQL, and NoSQL databases, along with Apache Spark for data processing, dbt for data transformation, container orchestration services such as Docker and Kubernetes, and various Azure tools and services
- Use your technical expertise to shape product definitions and drive towards optimal solutions
- Engage in cross-functional collaboration throughout the entire development lifecycle
- Lead in design sessions and code reviews with peers to elevate the quality of engineering across the organization
- Define, create, and support reusable data components and patterns that align with both business and technology requirements
- Build a world-class analytics platform to satisfy reporting needs
- Mentor other engineers
- Consistently share best practices and improve processes within and across teams
Requirements:
- Advanced programming experience and big data experience within Python, SQL, dbt, Spark, Kafka, Git, Containerization (Docker and Kubernetes)
- Experience with orchestration tools such as Apache Airflow or similar technologies to automate and manage complex data pipelines
- Proven understanding of microservices oriented architecture and REST APIs and GraphQL
- Experience architecting and designing new and current systems
- Advanced understanding of DevOps concepts including Azure DevOps framework and tools
- Experience with CI/CD to ensure smooth and continuous integration and deployment of data solutions
- Advanced PowerShell scripting skills
- Advanced understanding of monitoring concepts and tooling
- Advanced understanding of security protocols and products
- In-depth knowledge of CS data structures and algorithms
- Knowledge of developer tooling across the data development life cycle (task management, source code, building, deployment, operations, real-time communication)
- Strong problem-solving ability
- Ability to excel in a fast-paced environment
- 4+ years of professional experience in data engineering, programming languages and developing with big data technologies
- 3+ years of experience with architecture and design
- 3+ years of experience with AWS, GCP, Azure, or another cloud service
- 2+ years of experience in Big-data tools like Spark and Databricks
- Bachelor's degree in Computer Science, Information Systems, or equivalent education or work experience
- Experience with Apache Iceberg for managing large-scale tabular data in data lakes is a plus
- Experience with business intelligence tools (Power BI or Superset preferred)