Booz Allen Hamilton is a leading consulting firm that focuses on using technology to drive impactful solutions. They are seeking a Data Engineering Intern to assist in creating a secure and scalable enterprise data lake, supporting data aggregation, and enabling advanced analytics for various client missions.
Responsibilities:
- Assist in the design, development, and implementation of an enterprise data lake that serves as a single and authoritative source of truth for a government command
- Support data engineering efforts to aggregate, catalog, and secure multiple data sets, enabling advanced analytics and data-driven decision-making
- Support the identification and integration of authoritative data sources and assist in engineering Extract, Transform, Load (ETL) pipelines to ingest data into the enterprise data lake
- Leverage Python, Apache Spark, AWS Glue, and Amazon S3 to process, transform, and store structured and semi-structured data
- Assist in building and configuring data pipelines and validating data quality, integrity, and accessibility
- Contribute to the development of metadata, cataloging, and documentation to enhance data discovery and usability
- Support the use of AWS Athena for querying data and AWS QuickSight to develop visualizations that demonstrate insights, use cases, and process improvements
- Document ETL processes, data sources, and visualization artifacts
- Prepare and deliver a final presentation and prototype demonstration showcasing the enterprise data lake, pipelines, and visualizations within the Booz Allen development environment
- Ensure all work complies with internal standards, security requirements, and government client expectations
Requirements:
- Experience using Python for data manipulation and analysis
- Experience with big data or cloud-based data processing tools, such as Apache Spark or AWS services
- Knowledge of ETL processes and data ingestion pipelines
- Ability to work within a secure development environment and follow data handling guidelines
- Ability to obtain a Secret clearance
- Scheduled to obtain a Bachelor's degree in Data Science, Computer Science, Information Systems, Engineering, or Analytics by Spring 2027
- Experience working with AWS services, such as Glue, S3, Athena, or QuickSight
- Experience creating or supporting enterprise data lakes or centralized data repositories
- Experience developing data visualizations to support decision-making
- Experience with Microsoft Excel for data organization and analysis
- Ability to document technical processes and communicate results through presentations
- Ability to collaborate effectively with peers, mentors, and cross-functional teams
- Ability to pay strict attention to detail
- Ability to be articulate, organized, and professional when engaging with technical and non-technical stakeholders
- Possession of strong problem-solving skills