Zachary Piper Solutions is a leading federal technology and services company that safeguards vital national healthcare systems by delivering advanced cloud, payment processing, and machine‑learning solutions. The Data Engineer II will be responsible for providing transformative solutions to clients' big data obstacles and help advance their missions.
Responsibilities:
- Build and maintain PySpark data pipelines within the Databricks environment, optimizing Spark job performance and resource utilization while identifying and resolving bottlenecks and backend inefficiencies
- Design, develop, and support robust backend software components and services, ensuring high levels of functionality, performance, and scalability
- Conduct research and develop proof‑of‑concept solutions across the data domain
- Write clean, well‑organized, and maintainable code in alignment with established coding standards and best practices
- Perform detailed code reviews, offering constructive feedback and identifying risks or areas for improvement
- Troubleshoot and resolve defects, proactively detecting and addressing issues before they affect end users
- Create and maintain clear, thorough, and up‑to‑date technical documentation
Requirements:
- 5+ years of related experience
- Solid understanding of R
- Solid understanding of data modeling, ETL process, and distributed computing
- Strong experience with Python / Apache Spark
- Strong understanding of software design patterns, data structures, and algorithms
- Experience with Agile development methodologies
- Ability to work independently as well as in a team
- Strong problem-solving and analytical skills
- Strong verbal and written communication skills
- Bachelor's degree in computer science, Computer Engineering or related field
- Must be able to obtain a Public Trust clearance