ADG TECH CONSULTING is seeking a Systems Engineer - DevOps to support the operations and maintenance of GSA Data.gov systems. The role involves enhancing cloud-based infrastructure, supporting AWS and Linux environments, and improving automation for deployments and operational tasks.
Responsibilities:
- Support the operations, maintenance, and reliability of GSA Data.gov systems, including production monitoring, incident response, troubleshooting, and root cause analysis
- Maintain and enhance cloud-based infrastructure supporting Data.gov applications, APIs, data catalog services, search capabilities, databases, and data harvesting workflows
- Support AWS and Linux-based environments, including system configuration, application servers, web services, scheduled jobs, access controls, and environment stability
- Develop, maintain, and improve automation for deployments, infrastructure provisioning, configuration management, and repeatable operational tasks
- Support CI/CD pipelines and release processes using tools such as GitHub Actions, Jenkins, Terraform, CloudFormation, Ansible, Docker, or similar technologies
- Work closely with application developers to support CKAN, Data.gov catalog services, Data Harvester components, APIs, search services, and related open data platform capabilities
- Monitor system performance, availability, logs, capacity, and security events using tools such as CloudWatch, Splunk, New Relic, or similar monitoring platforms
- Troubleshoot application, infrastructure, database, network, security, and integration issues across development, test, staging, and production environments
- Support database and search platform operations, including PostgreSQL, MySQL, Solr, Elasticsearch, OpenSearch, or similar technologies
- Support security and compliance activities, including vulnerability remediation, access control reviews, configuration hardening, ATO documentation, and implementation of federal security requirements
- Create and maintain technical documentation, including runbooks, standard operating procedures, system diagrams, deployment guides, and troubleshooting procedures
- Collaborate with developers, architects, security engineers, product owners, Scrum Masters, and government stakeholders in an Agile delivery environment
- Support platform modernization efforts, including cloud optimization, DevSecOps improvements, reliability engineering, performance tuning, and technical debt reduction
- Assist with enhancements to APIs, Python scripts, data pipelines, Data Harvester functionality, CKAN components, and search improvements
- Support production release planning, including deployment readiness, smoke testing, rollback planning, and post-deployment validation
- Provide recommendations to improve system scalability, observability, maintainability, security, and operational efficiency
- Evaluate new tools, technologies, and practices that improve automation, monitoring, cloud operations, security, and overall platform reliability
Requirements:
- US citizenship is required to obtain a secret clearance
- Support the operations, maintenance, and reliability of GSA Data.gov systems, including production monitoring, incident response, troubleshooting, and root cause analysis
- Maintain and enhance cloud-based infrastructure supporting Data.gov applications, APIs, data catalog services, search capabilities, databases, and data harvesting workflows
- Support AWS and Linux-based environments, including system configuration, application servers, web services, scheduled jobs, access controls, and environment stability
- Develop, maintain, and improve automation for deployments, infrastructure provisioning, configuration management, and repeatable operational tasks
- Support CI/CD pipelines and release processes using tools such as GitHub Actions, Jenkins, Terraform, CloudFormation, Ansible, Docker, or similar technologies
- Work closely with application developers to support CKAN, Data.gov catalog services, Data Harvester components, APIs, search services, and related open data platform capabilities
- Monitor system performance, availability, logs, capacity, and security events using tools such as CloudWatch, Splunk, New Relic, or similar monitoring platforms
- Troubleshoot application, infrastructure, database, network, security, and integration issues across development, test, staging, and production environments
- Support database and search platform operations, including PostgreSQL, MySQL, Solr, Elasticsearch, OpenSearch, or similar technologies
- Support security and compliance activities, including vulnerability remediation, access control reviews, configuration hardening, ATO documentation, and implementation of federal security requirements
- Create and maintain technical documentation, including runbooks, standard operating procedures, system diagrams, deployment guides, and troubleshooting procedures
- Collaborate with developers, architects, security engineers, product owners, Scrum Masters, and government stakeholders in an Agile delivery environment
- Support platform modernization efforts, including cloud optimization, DevSecOps improvements, reliability engineering, performance tuning, and technical debt reduction
- Assist with enhancements to APIs, Python scripts, data pipelines, Data Harvester functionality, CKAN components, and search improvements
- Support production release planning, including deployment readiness, smoke testing, rollback planning, and post-deployment validation
- Provide recommendations to improve system scalability, observability, maintainability, security, and operational efficiency
- Evaluate new tools, technologies, and practices that improve automation, monitoring, cloud operations, security, and overall platform reliability