Netic is the AI revenue engine for essential services who are the backbone of the American economy. As an Agent Infrastructure Engineer, you will architect and scale the infrastructure supporting autonomous AI agents, collaborating with a team to tackle real-world challenges using cutting-edge technologies.
Responsibilities:
- Build cloud infrastructure: Design and operate the backbone that hosts our AI agents and supports our platform
- Automate operations: Create infrastructure as code and automated deployment pipelines for reliable releases
- Enable scale: Implement systems that handle usage spikes gracefully through autoscaling and multi-region support
- Create observability: Build monitoring, logging, and dashboards that provide real-time visibility into system health
- Maintain security: Implement security best practices including IAM, network segmentation, and audit trails
Requirements:
- 4+ years running distributed systems at scale with a major cloud platforms (we use GCP but AWS and Azure is great, too)
- Proven record of owning infrastructure-as-code and CI/CD pipelines (Terraform, Git Actions, etc)
- Experience optimizing systems and databases to meet latency and cost targets under multi-modal workloads. For example, experience with pgBouncer, Kubernetes-based autoscaling, and similar tools
- Fluent with modern monitoring and tracing tooling (we use Datadog) and built-in tools in Vercel or GCP
- Understanding of enterprise security requirements and compliance needs like authentication and service proxies
- Treat infrastructure as a product and prioritize ambiguous requirements to see around the corner for 1-2 years ahead of our current systems—measure impact and iterate continuously
- Exposure to AI infrastructure and LLMs. Experience with hosting agents or with LLMs or an interest in experimenting with LLMs - even in your own free time