Ardent Eagle Solutions (AES) is seeking an experienced Senior Data Engineer to support the Small Business Administration (SBA), Office of Inspector General (OIG), Technology Solutions Division. This position is responsible for designing, implementing, and maintaining modern Azure-based cloud data architecture supporting fraud analytics, machine learning, and investigative operations.
Responsibilities:
- Design, develop, implement, and maintain secure, scalable Azure-based data architecture
- Develop and maintain cloud-native ELT/ETL pipelines utilizing Azure Synapse Analytics and Azure Machine Learning
- Design modern, code-first data engineering solutions utilizing Python SDKs, REST APIs, CLI tools, and Infrastructure-as-Code methodologies
- Build, optimize, and maintain cloud data lakes and warehouse environments
- Implement robust error handling, logging, validation, and monitoring across production data pipelines
- Manage source control and CI/CD practices supporting data engineering assets
- Optimize ingestion, processing, transformation, and storage of structured and unstructured datasets including Parquet and related formats
- Develop reusable Python-based data engineering solutions utilizing Pandas and modern data processing frameworks
- Create self-service capabilities supporting analyst access to enterprise datasets
- Collaborate closely with Data Scientists to optimize infrastructure supporting machine learning workloads
- Develop data dictionaries, entity relationship diagrams, pipeline documentation, and standard operating procedures
- Expand cloud environments through onboarding of new datasets and services
- Evaluate emerging AI-assisted development tools and data engineering technologies
- Recommend architectural improvements that improve scalability, reliability, security, and cost efficiency
Requirements:
- Five (5)+ years administering SQL databases and performing advanced SQL/T-SQL development
- Five (5)+ years designing, implementing, and maintaining cloud-based ELT/ETL solutions
- Three (3)+ years utilizing Microsoft Azure Synapse Analytics and Azure Machine Learning
- Three (3)+ years developing Python-based data engineering solutions utilizing Pandas
- Experience designing modern cloud-based data architectures
- Experience implementing scalable data pipelines supporting machine learning workloads
- Bachelor's degree in Computer Science, Information Systems, Data Engineering, Software Engineering, or a related technical discipline
- U.S. Citizenship required
- Ability to successfully obtain and maintain a Public Trust determination
- Experience utilizing PySpark or Polars
- Experience implementing Infrastructure-as-Code
- Experience developing reusable modular Python code
- Experience with Azure Data Lake Storage
- Experience implementing CI/CD pipelines
- Experience utilizing Git source control
- Experience with REST APIs and Azure SDKs
- Experience with AI coding assistants and Large Language Model integration
- Experience supporting federal agencies or Inspector General organizations