Actively contribute to defining, designing, building, and operating our internal developer platform and tools with a focus on enabling other engineering teams to focus on business requirements in their products
Serve as a leader and mentor on internal developer platforms, cloud architecture, infrastructure as code, monitoring and alerting, establish best practices and other operational excellence areas.
Ensure the products we build exceed our customer’s expectations by
Actively participate in on-call rotation for incident response during working hours for the platform and tools
Build monitoring that alerts on symptoms rather than on outages
Use metrics/data to improve our products continuously
Ensure we have the right SLIs defined and measured, and continuously ensure we meet or exceed our SLOs
Identify and implement improvements in the platform architecture and/or tools to reduce toil and/or improve resilience and scalability
Use solid engineering practices to build long-term solutions to prevent issues from repeating
Utilize chaos engineering practices to stress and validate that the right observability, monitoring, resilience, and HA/DR commitments are being met
Requirements
5+ years of experience with CI/CD tools like GitOps, Ansible, Jenkins, Github, Gitlab, etc. (GitHub, GitHub Actions/Workflows preferred)
5+ years of hands-on experience in DevOps/DevSecOps/SRE
4+ years writing secure, scalable, and resilient infrastructure as code -preferably in Terraform
and using standard SDLC methodologies for building, testing, deploying, and supporting cloud infrastructure (preferably in AWS)
4+ years of experience with Kubernetes (preferably EKS) and Docker
3+ years of experience writing automation and tools using various scripting or programming languages
3+ or more years building, running, and operating cloud-native applications using Docker and one or more programming languages (NodeJS, PHP, or Go preferred)
3+ years demonstrating leadership in technical best practices, quality, and delivery. Proven examples of driving change and raising the bar across the team.
Very strong background and passion for building secure solutions
Working knowledge of defining and managing SLIs and SLOs and methods for using error budgets to ensure SLOs are met
Strong knowledge of the English language and clear and crisp communication
both verbal and written
Solid emotional intelligence and ability to collaborate with others, especially under pressure.
Tech Stack
Ansible
AWS
Cloud
Docker
Jenkins
Kubernetes
Node.js
PHP
SDLC
Terraform
Go
Benefits
Field Nation LLC Performance Reward – Because every citizen of Field Nation deserves a stake in the win!
Festival Bonus – Celebrate the big festivals with some extra cheer (and cash!).
Referral Bonus – Incentives for successful employee referrals.
Gratuity – Honoring your long-term dedication
Leave Encashment – Opportunity to encash unused annual leave balance at year-end.
Medical Insurance – Comprehensive health coverage for employees and their immediate family (spouse and children).
Gym Membership – Stay fit, active, and energized.
Complimentary Lunch / Dinner – Because good work needs good food.
Unlimited Tea & Coffee – Keep the energy flowing.
Transportation – Helping you get to work hassle-free.
Mobile Data Allowance – Allowances to ensure connectivity.
Career Development Budget – Dedicated funds for professional learning and growth.
Work Model: Hybrid (2 days in-office, 3 days remote per week) – balance is key.
Summer & Winter Field Weeks – Two annual team retreats to connect, collaborate, and recharge.
Quarterly Team Outing Budget – Enjoy exciting activities and quality time with your team to bond, relax and celebrate together.
Occasional Gifts – Surprises and gifts to celebrate milestones & welcome new faces.
Leave Benefits:
Maternity Leave
Paternity Leave
Hajj/Umrah Leave
Paid Time Off – Take the time you need! Covers annual, casual, and sick leave so you can recharge and come back ready to shine.