Collibra is seeking a highly motivated and experienced DevOps Production Engineer to join their Cloud Operations Team. In this pivotal role, you will ensure the seamless delivery of cloud services while driving continuous improvement and fostering innovation across the platforms.
Responsibilities:
- Proactively research and integrate emerging technologies and cutting-edge software practices into our release processes and infrastructure
- Formulate and champion new ideas and features to continuously enhance our infrastructure, functionality, and overall product effectiveness
- Apply data modeling and predictive analysis techniques to anticipate and mitigate potential issues before they impact production
- Act as a key liaison and facilitator between Production Engineering, other technical functions, external vendors, business partners, and end-users
- Lead and actively participate in cross-team engineering discussions to design and evolve scalable, measurable, fault-tolerant, and cost-effective cloud services
- Collaborate cross-functionally to ensure a holistic approach to identifying, addressing, and resolving platform and functionality issues
- Actively participate in architectural design reviews and discussions, providing valuable input on features, service improvements, and new service implementations
- Utilize visual tools, such as flowcharts and diagrams, effectively during design and problem-solving processes
- Provide high-level support and expertise for team and cross-functional projects related to release automation and infrastructure
- Maintain clear and complete understanding of complex situations through intent listening and incisive questioning to ensure accurate interpretation and resolution
Requirements:
- 7 years of experience in Release Engineering, DevOps, or a similar role focused on software delivery and infrastructure
- Proven track record of driving process improvements and implementing automation in complex environments
- Experience with cloud platforms (e.g., AWS, Azure, GCP) and their respective services
- Strong proficiency in CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, Azure DevOps, ArgoCD)
- Expertise in scripting and automation languages (e.g., Python, Go, Bash)
- Familiarity with containerization (Docker) and orchestration (Kubernetes)
- Understanding of infrastructure-as-Code (IaC) principles and tools (e.g., Terraform, CloudFormation)
- Experience with monitoring, logging, and observability tools
- Professional experience installing, configuring, managing, and/or administering ElasticSearch
- Exceptional with communication and interpersonal skills and have the ability to articulate complex technical concepts clearly to diverse audiences
- Strong in analytical and problem-solving abilities, with a proactive and preventative mindset
- Able to influence and lead discussions across engineering teams
- Self-driven and takes initiative to identify and implement improvements