Administration of Linux machines, Web servers, Application servers, Databases Application and cloud infrastructure support for customer environments.
Provide application support on Java and Ruby applications.
Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence.
Tools development and automation to increase availability and performance.
Ensuring the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective.
Collaborate with Product and Release engineering for new product releases and maintenance.
Coordinate incident, problem and change management.
Participate in on-call rotation for after-hours and weekend emergencies.
Requirements
Bachelor's Degree with 8+ years of professional experience handling large scale production systems.
Experience with AWS or comparable cloud providers with certification.
Experience in designing of new services on AWS or comparable cloud provider, migration of services to cloud and deployment of new services on AWS or comparable cloud provider.
Hands on experience with Terraform and configuration management tools like Chef, Ansible or equivalent.
Experience in application support/development on Java or Ruby.
Hands on scripting experience with anyone of these: Python or Bash.
Excellent knowledge of large scale web applications/distributed systems.
Experience in Kubernetes, Docker, and/or cloud deployment technologies.
Experience in observability tools like NewRelic, Datadog etc
Expertise in problem solving and analyzing global scale distributed systems.
Excellent written and verbal communication skills.
Critical thinking, continuously challenging how and why we do things to help us improve