Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As a Senior Site Reliability Engineer on the Vault Platform team, you will ensure the scalability and reliability of enterprise applications while tackling complex challenges at a global scale.

Responsibilities:

Build Cloud Infrastructure: Rapidly build new cloud infrastructure from scratch, adhering to software development best practices
Drive Reliability & Scalability: Ensure our platform meets the scalability and reliability needs of our hundreds of global customers (across North America, Europe, and Asia)
Lead Incident Management: During an incident, effectively lead triage and mitigation efforts, potentially performing periodic on-call duty for escalations
Automate & Optimize: Develop tools and automation to eliminate manual work and reduce issue resolution times
Full-Stack Diagnostics: Proactively learn all necessary systems to provide full-stack diagnostics and determine root causes of production problems
Strategic Engineering Partnership: Strategize with engineering teams on complex problems, offering insights on what will work at scale (supporting 2M+ users) and guiding development decisions before features ship
Influence Design: Participate in engineering design reviews of new features and drive initiatives to improve operational efficiency and platform scalability
Cross-functional Collaboration: Partner effectively with Product Management, Design, and QA to deliver cutting-edge solutions and direct customer value
Backend Focus: Work across multiple layers of our technology stack, with a primary focus on backend development, and opportunities in frontend and infrastructure
Effective Communication: Communicate clearly with engineering teams, succinctly describing problems for seamless hand-offs during outages with both technical and non-technical audiences
Mentorship: Actively mentor team members, contributing to a positive and high-performing team environment

Requirements:

5+ years of experience in Java development, with a strong preference for experience within enterprise cloud software companies
Hands-on operational experience in a high-volume or critical production service environment, including incident management and root cause analysis
Proven ability to write clean, testable, readable, and maintainable code within a collaborative team setting
Hands-on experience with a range of open-source technologies, such as Spring, MySQL, Hibernate, Solr, Maven, Git, Tomcat, Linux, AWS, Vagrant, Docker, and Kubernetes
3+ years of experience in relational databases with expert-level SQL skills
Solid scripting proficiency with languages such as Shell, Bash, Ansible, Python, Go, Ruby, etc
Demonstrated history of incident management and leadership ability, with effective communication skills across all levels (individual contributors to executives)
Proven record of making your team better through mentorship
This role requires a working schedule of Monday - Friday, 2 PM - 10 PM PST, and candidates must be located in the HST or PST time zones to be considered
Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time

Senior Software Engineer - SRE

Key skills

About this role

Responsibilities:

Requirements: