Zachary Piper Solutions is a leading federal technology and services company that safeguards vital national healthcare systems. They are seeking a Data Engineer II to provide transformative solutions to clients' big data obstacles and help advance their missions.
Responsibilities:
- Build and maintain PySpark data pipelines within the Databricks environment, optimizing Spark job performance and resource utilization while identifying and resolving bottlenecks and backend inefficiencies
- Design, develop, and support robust backend software components and services, ensuring high levels of functionality, performance, and scalability
- Conduct research and develop proof‑of‑concept solutions across the data domain
- Write clean, well‑organized, and maintainable code in alignment with established coding standards and best practices
- Perform detailed code reviews, offering constructive feedback and identifying risks or areas for improvement
- Troubleshoot and resolve defects, proactively detecting and addressing issues before they affect end users
- Create and maintain clear, thorough, and up‑to‑date technical documentation
Requirements:
- 5+ years of related experience
- Solid understanding of R
- Solid understanding of data modeling, ETL process, and distributed computing
- Strong experience with Python / Apache Spark
- Strong understanding of software design patterns, data structures, and algorithms
- Experience with Agile development methodologies
- Ability to work independently as well as in a team
- Strong problem-solving and analytical skills
- Strong verbal and written communication skills
- Bachelor's degree in computer science, Computer Engineering or related field
- Must be able to obtain a Public Trust clearance