Saransh Inc is seeking a Senior Infrastructure Engineer to own and operate the physical hardware layer of the client’s private cloud. This role is responsible for the lifecycle management, reliability, and continuous improvement of the compute, networking, and storage hardware that underpins private cloud platforms.
Responsibilities:
- Own and operate the physical infrastructure that powers Adobe’s private cloud, including compute, storage, and networking hardware
- Perform day-2 operations for datacenter hardware, including installation, break/fix, firmware upgrades, patching, and lifecycle management
- Manage hardware capacity planning, expansion, refresh cycles, and decommissioning
- Troubleshoot complex production issues across:
- Server hardware (CPU, memory, disks, NICs, HBAs)
- Storage systems and disk subsystems
- Network connectivity and physical switching
- Manage firmware, BIOS, and hardware management controllers (iDRAC, iLO, Redfish)
- Partner with platform, SRE, networking, storage, and security teams during incidents and planned maintenance
- Improve infrastructure reliability through standardization, automation, and proactive monitoring
- Build and maintain automation for hardware provisioning and operations using tools such as Ansible, CI/CD pipelines, and infrastructure-as-code
- Lead incident response, root-cause analysis, and drive corrective and preventative actions
- Create and maintain operational documentation, hardware standards, runbooks, and on-call playbooks
- Mentor junior engineers and raise the operational maturity of the private cloud hardware platform
Requirements:
- 7+ years of experience in infrastructure or datacenter engineering
- Strong hands-on experience managing server, storage, and networking hardware in large-scale production environments
- Deep understanding of x86 server architectures, including CPU topology, memory configurations, NUMA, and I/O subsystems
- Experience operating enterprise storage systems (local and shared) and understanding performance, resiliency, and failure modes
- Familiarity with datacenter networking concepts, including L2/L3, VLANs, bonding, and physical switching
- Strong Linux fundamentals and experience troubleshooting hardware-related OS issues
- Experience with hardware lifecycle management, capacity planning, and vendor coordination
- Proven ability to automate infrastructure operations and reduce manual work
- Experience supporting highly available infrastructure with on-call responsibilities