Oscar Health is the first health insurance company built around a full stack technology platform focused on serving its members. They are seeking a Senior Software Engineer, Cloud Infrastructure / SRE to build and maintain a resilient ecosystem using modern technology stacks and to empower the engineering organization with automated infrastructure.
Responsibilities:
- Become the expert on your team's business and technical domains such as DevOps, site reliability, and cloud best practices
- Lead the planning, execution and release of complex technical projects across multiple teams outside of Core Technology
- Work with partners, product managers, and designers to solve challenging problems
- Lead and mentor engineers on the team to improve technology and apply best practices
- Independently responsible for large or complex technology capabilities (set of components or services) within their team's domain or spanning multiple domains
- Facilitates, encourages, and enhances cross-team execution and collaboration; knows when cross-team projects are at risk and actively mitigates risk to deliver on time
- Prolific contributor to the objectives of their functional group, as well as organization-wide projects
- Drives prioritization of technical roadmap and influences prioritization of product roadmap and process enhancements within their team
- Actively identifies and reduces failure domains, designs and builds resilient systems, and strives to reduce adverse effects of an outage
- Builds software to minimize effort and business impact during maintenance and failures
- Guides the development of Service-Level Objectives (SLOs) for systems they are responsible for
- Own medium to large features or infrastructure projects from technical design through completion
- Compliance with all applicable laws and regulations
- Other duties as assigned
Requirements:
- 6+ years of professional software engineering experience, working with a variety of technologies, and have increasingly impactful accomplishments
- Experience as a major contributor cross-pod or cross-company deliverables
- Experience leading technical contributions, improving the quality of what your teams create, and are excited to build fault-tolerant, and scalable software systems
- Demonstrates expertise of the practical application of CS concepts within their team
- Sets and enforces the standard for writing stable, correct, and maintainable code
- Experience mentoring and training more junior engineers
- Cloud Proficiency: Deep expertise in managing production environments within AWS or GCP at scale
- Infrastructure as Code: Advanced experience with Terraform or similar IaC tools to manage complex, multi-account structures
- Orchestration & Delivery: Proven track record with Kubernetes and workflows using ArgoCD
- SRE Discipline: Strong background in Site Reliability Engineering, including Service Level Objectives (SLOs), error budgets, and incident management
- CI/CD & Automation: Experience building robust deployment pipelines via GitHub Actions
- Observability: Proficiency with monitoring using tools like Prometheus, Grafana, or similar
- Security & Networking: Knowledge of cloud-native security (IAM, VPC peering) and service mesh technologies like Istio
- Programming: Understanding of at least one coding language that you are able to use to develop scripts and software
- Education: B.S. in Computer Science, a related technical field, or equivalent high-level industry experience