Design, build, and maintain scalable ETL pipelines using Python and PySpark within Palantir Foundry.
Develop and refine "Objects" (e.g., Soldiers, Equipment, Units) and their relationships to ensure a clean, accurate digital representation of Army operations.
Assist with the cleanup of legacy data models and implement automated data quality checks.
Build low-code/no-code applications and dashboards that allow senior leaders to access high-quality data in real-time.
Implement Medallion Lakehouse concepts to organize data into Bronze, Silver, and Gold layers for downstream analytics.
Requirements
5–8 years of experience in Data Engineering with expert proficiency in Python, SQL, and PySpark.
Demonstrated experience with Army Vantage (Palantir Foundry) and the Advana ecosystem.
Active DoD Secret Clearance required.
Familiarity with Army data sources like IPPS-A, GCSS-Army, or GFEBS.
Minimal 10% travel.
Three days in Crystal City Va or Aberdeen MD. Open to Remote.