Cloudera is a company that empowers people to transform complex data into clear and actionable insights. They are seeking a talented and motivated Sr. Staff Systems Engineer to join their Technical Operations team, focusing on the deployment and automated management of internal IT infrastructure, particularly in on-premise Linux environments and cloud services.
Responsibilities:
- Architect, deploy, and provide senior-level operational support for our on-premise and cloud-based Linux infrastructure and core IT services (e.g., virtualization, baremetal, storage, DNS), ensuring high availability and reliability
- Develop, maintain, and champion our Infrastructure-as-Code (IaC) and automation frameworks using tools like Terraform, Ansible, and Foreman/MaaS to manage and deploy platform services
- Implement and automate system-level security best practices, including patching, hardening, and configuration management, ensuring compliance and resilience from the ground up
- Build and automate deployment pipelines for IT infrastructure services (e.g., system images, configuration, platform services) using tools like GitHub/Git, Ansible, and scripting tools
- Serve as a technical Subject Matter Expert (SME), working with IT Systems, CloudOps, Security, and Engineering teams to design and implement robust, scalable, and optimal solutions
- Participate in a shared on-call rotation to support mission-critical IT services (with clear documentation and runbooks provided)
- Create and maintain accurate documentation for automation, operational audits, and compliance
- Proactively identify and drive improvements in system performance, monitoring, and operational processes through automation and observability
- Mentor junior team members
Requirements:
- Bachelor's degree in Computer Science or 6+ years of equivalent experience in a large-scale enterprise environment
- Deep, expert-level Linux systems administration experience (e.g., Red Hat, Rocky, Ubuntu) and mastery of common Command Line Interface (CLI) tools and services
- Strong hands-on skills with Python and shell scripting, used for systems automation, tooling, and integration
- Proven experience with Infrastructure-as-Code (e.g., Terraform, Ansible) and version control (GitHub/Git)
- Solid experience managing hybrid infrastructure, with deep expertise in on-premise environments (virtualization/Platform9, storage, networking) and a strong understanding of core public cloud services (AWS/Azure/GCP)
- Advanced knowledge configuring, operating, and integrating authentication systems: LDAP, AD, Kerberos, SAML, OIDC, etc
- A security-first mindset and experience designing, building, and operating secure, automated infrastructure
- Strong networking fundamentals (TCP/IP, DNS, DHCP, routing, firewalls), including public cloud equivalents
- Certifications such as Red Hat (RHCE), Terraform, or public cloud (AWS, Azure, GCP)
- Knowledge of enterprise security principles, cryptography, PKI, and operational security practices
- Experience operating in regulated/high-governance/compliance environments (e.g., FedRAMP, PCI, ISO27001, SOC2, etc.)
- Familiarity with monitoring and observability tools
- Experience with containerization (Docker) and orchestration (Kubernetes)
- Project management experience
- Previous experience mentoring junior team members