Time Doctor is a workforce analytics platform that provides AI-powered insights into work performance. As a DevOps Engineer, you will architect and manage cloud-native infrastructure primarily on Google Cloud Platform, leveraging modern technologies to ensure high availability and performance for enterprise-grade SaaS solutions.
Responsibilities:
- Design, implement and maintain Terraform-based infrastructure across 40+ environments (production, staging, sandbox and customer-specific deployments)
- Manage GCP services including GKE, Cloud Run, Cloud Functions, Cloud Build, VPC networking, Cloud Armor and Cloud Load Balancers
- Develop and maintain reusable Terraform modules following best practices and DRY principles
- Manage multi-environment infrastructure using GitOps workflows and Terraform Cloud
- Administer MongoDB Atlas clusters including backup strategies, replication, network peering and performance optimization
- Manage Redis instances (Cloud Memorystore) for caching, session management and real-time features
- Configure and maintain BigQuery datasets, scheduled queries and data pipelines
- Implement data lifecycle policies for Cloud Storage and optimize storage costs
- Build and maintain CI/CD pipelines using GitHub Actions, Cloud Build and automated Terraform workflows
- Implement container build pipelines and manage container registries (GCR/Artifact Registry)
- Automate deployment processes with integration to GitHub via OIDC/Workload Identity Federation
- Develop infrastructure automation scripts in Python and Bash
- Configure and manage pre-commit hooks and code validation workflows
- Maintain comprehensive monitoring solutions using Google Cloud Monitoring and Sentry
- Create and manage custom alert policies for application and infrastructure metrics
- Implement distributed tracing and APM using Cloud Trace and OpenTelemetry
- Configure log-based metrics and structured logging with Cloud Logging
- Respond to incidents, perform root cause analysis and implement preventive measures
- Manage uptime checks and SLO/SLI monitoring across services
- Configure and maintain Cloud Armor WAF policies with rate limiting and DDoS protection
- Manage Identity-Aware Proxy (IAP) for secure access control
- Implement SSL/TLS automation using Let's Encrypt and Google-managed certificates
- Maintain secrets management using Google Secret Manager with proper IAM controls
- Configure VPC networking, private service access and Cloud NAT for secure communication
- Enforce security best practices including encryption at rest, network isolation and least-privilege IAM
- Work closely with engineering teams to optimize application performance and scalability
- Participate in architecture discussions and provide infrastructure expertise
- Create and maintain comprehensive documentation for infrastructure configurations, runbooks and procedures
- Mentor team members on DevOps best practices and infrastructure workflows
- Collaborate with cross-functional teams on feature deployments and system improvements
Requirements:
- Bachelor's degree in Computer Science, related technical field or equivalent practical experience
- 3-5 years of hands-on DevOps experience with production cloud environments
- Strong expertise in Google Cloud Platform (GCP), including: Cloud Run, Cloud Functions, GKE, VPC networking, Cloud Armor, Load Balancers, IAM, Secret Manager and security services
- Advanced Terraform proficiency: Infrastructure as code for complex multi-environment setups, Module development and state management, Terraform Cloud/Enterprise workflows
- MongoDB Atlas administration: Cluster configuration, sharding and replica sets, Backup/recovery strategies and performance tuning, Network peering and security configuration
- Container technologies: Docker containerization and multi-stage builds, Experience with Kubernetes or serverless container platforms, Container registry management
- CI/CD expertise: GitHub Actions workflows, Cloud Build pipelines, GitOps practices and automated deployments
- Scripting and automation: Strong Python and Bash scripting skills, Infrastructure automation and tooling development
- Monitoring and observability: Experience with Sentry/Datadog or similar APM/monitoring platforms, Cloud-native monitoring tools and practices, Log aggregation and analysis