Bentley Systems is a leading software company that helps infrastructure professionals design, build, and operate resilient infrastructure. They are seeking a Senior Site Reliability Engineer to automate cloud-based services infrastructure and tooling, focusing on DevOps automation principles, while ensuring the smooth operation of services and addressing urgent issues as needed.
Responsibilities:
- Manage, implement, and improve automation (CI/CD Infrastructure) and tooling through Azure DevOps, scripting, developing tools and proprietary systems
- Automate Azure cloud-based deployments, resource provisioning and other Azure infrastructure related tasks
- Troubleshoot and resolve issues related to application development, deployment, and operations
- Dive deep into availability, performance and outages for infrastructure and systems, and provide technical leadership for proactive resolutions
- Ensure compliance with industry’s best practices and organizational policies
- Continuously improving processes and tools to enhance efficiency and productivity
- Maintain monitoring and alerting and participate as a member of a rotating on-call schedule
- Share on-call responsibilities, including collaborating with other engineers to triage and fix issues that come up in production for our users
Requirements:
- Degree in computer science, software engineering or relevant training and/or experience
- +8 years of experience with Cloud Services development, deployment and/or IT Cloud infrastructure setup and maintenance (Azure Cloud or AWS or GCP)
- Expertise in containerization and orchestration technologies (Docker, Kubernetes)
- Experience with Scripting and automation skills using languages like PowerShell, Bash, Ansible, JavaScript or similar
- Programming experience, preferably in a high-level language like C#, Python, Golang, Ruby, or equivalent
- Knowledge of AD and DNS, IIS, and networking
- Experience with FedRamp background screening
- Experience with Azure DevOps (Pipelines, YAML) or GitHub enterprise (Git, Actions)
- Good knowledge of Microsoft SQL Server/Azure SQL setup, SQL statements/scripts and troubleshooting
- Ability to document architectural designs along with operational processes and procedures to support ongoing administration of cloud systems
- Willingness to learn and become an expert at a wide array of Cloud based technologies in an ever-changing, fast-paced environment
- Ability to work effectively in a global team and cross-organization
- Experience with Azure Resource Manager (ARM) templates for infrastructure deployment into Azure is a plus
- Expertise in defining operations in code, utilizing version control, peer review, and CI/CD for seamless deployment of changes to both applications and infrastructure