Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As a Senior Site Reliability Engineer on the Vault Platform team, you will ensure the scalability and reliability of enterprise applications while tackling complex challenges at a global scale.
Responsibilities:
- Build Cloud Infrastructure: Rapidly build new cloud infrastructure from scratch, adhering to software development best practices
- Drive Reliability & Scalability: Ensure our platform meets the scalability and reliability needs of our hundreds of global customers (across North America, Europe, and Asia)
- Lead Incident Management: During an incident, effectively lead triage and mitigation efforts, potentially performing periodic on-call duty for escalations
- Automate & Optimize: Develop tools and automation to eliminate manual work and reduce issue resolution times
- Full-Stack Diagnostics: Proactively learn all necessary systems to provide full-stack diagnostics and determine root causes of production problems
- Strategic Engineering Partnership: Strategize with engineering teams on complex problems, offering insights on what will work at scale (supporting 2M+ users) and guiding development decisions before features ship
- Influence Design: Participate in engineering design reviews of new features and drive initiatives to improve operational efficiency and platform scalability
- Cross-functional Collaboration: Partner effectively with Product Management, Design, and QA to deliver cutting-edge solutions and direct customer value
- Backend Focus: Work across multiple layers of our technology stack, with a primary focus on backend development, and opportunities in frontend and infrastructure
- Effective Communication: Communicate clearly with engineering teams, succinctly describing problems for seamless hand-offs during outages with both technical and non-technical audiences
- Mentorship: Actively mentor team members, contributing to a positive and high-performing team environment
Requirements:
- 5+ years of experience in Java development, with a strong preference for experience within enterprise cloud software companies
- Hands-on operational experience in a high-volume or critical production service environment, including incident management and root cause analysis
- Proven ability to write clean, testable, readable, and maintainable code within a collaborative team setting
- Hands-on experience with a range of open-source technologies, such as Spring, MySQL, Hibernate, Solr, Maven, Git, Tomcat, Linux, AWS, Vagrant, Docker, and Kubernetes
- 3+ years of experience in relational databases with expert-level SQL skills
- Solid scripting proficiency with languages such as Shell, Bash, Ansible, Python, Go, Ruby, etc
- Demonstrated history of incident management and leadership ability, with effective communication skills across all levels (individual contributors to executives)
- Proven record of making your team better through mentorship
- This role requires a working schedule of Monday - Friday, 2 PM - 10 PM PST, and candidates must be located in the HST or PST time zones to be considered
- Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time