Dice is a company focused on providing job opportunities in technology, and they are seeking an Automation Platform Engineer to design and operationalize automation solutions. The role involves delivering production-ready automations, improving operational efficiency, and collaborating with various stakeholders to measure outcomes.
Responsibilities:
- Identify and prioritize automation candidates (manual tasks, recurring incidents, high-MTTR activities/KPI and others assigned) in partnership with operations/SRE
- Deliver automations that are secure, auditable, and supportable (monitoring, alerting, runbooks, and handoff)
- Standardize patterns and reusable components so solutions scale across domains/applications
- Quantify impact (hours saved, number of times run, defect reduction, MTTR improvement) and feed these to a dashboard for reporting
- Identify the run frequency that would indicate an underlying problem to address and setup notification for this condition
- Create workflows/scripts/services to automate operational processes (e.g., ticket triage steps, data validations, batch reruns, retries, reconciliations, reporting)
- Integrate with enterprise tooling (ServiceNow, monitoring/observability, schedulers, CI/CD) as needed to run automations safely in production
- Implement guardrails (idempotency, retries/backoff, rate limits, circuit breakers), robust error handling, and recovery paths
- Follow least-privilege access, secrets management, logging standards, and audit requirements for systems and data
- Build unit/integration tests, perform non-prod validation, and define acceptance criteria aligned to operational readiness
- Add structured logs, metrics, and health checks; define alerts/dashboards to detect failures and measure throughput
- Produce runbooks, support guides, and knowledge transfer for Tier 2/3 ownership
- Participate in intake grooming, technical design reviews, and demos; provide weekly status and risks/issues
Requirements:
- 5+ years building automations in enterprise environments (platform, operations, or reliability automation)
- Strong development skills in at least one automation-friendly language (e.g., Python, Java, JavaScript/TypeScript, PowerShell, Bash, .NET, C++), and comfort with APIs
- Experience building resilient, production-grade workflows (error handling, idempotency, retries, auditing)
- Experience with CI/CD and source control (e.g., Git), including branching and code review practices
- Hands-on experience with Linux/Unix and scripting; ability to troubleshoot across logs, jobs, and infrastructure
- Understanding of security fundamentals: least privilege, secrets management, non-person IDs/service accounts, and data handling
- Ability to work from ambiguous requirements and translate operational pain points into implementable technical solutions
- Experience documenting and training others on the automations created and how they are invoked-automatically when a condition occurs or manually to address an identified condition
- RPA experience (e.g., UiPath, Automation Anywhere, Power Automate) and/or workflow orchestration (e.g., Airflow, Control-M, Ansible, TWS/IWS)
- ServiceNow development/automation (Flow Designer, IntegrationHub, scripting) and ticket lifecycle automation
- Cloud-native automation (containers, Kubernetes, serverless functions) and infrastructure-as-code (e.g., Terraform)
- Observability tooling experience (logs/metrics/traces) and building dashboards/alerts
- Experience in regulated environments (SOX/SOC), including evidence capture and audit-friendly change practices
- Familiarity with data reconciliation, financial operations, or payment/rebates domains
- Experience with Jira, Confluence and ServiceNow