Build and maintain a high performing platform and ensure the platform meets the need of business, providing stability, reliability, security and scalability.
Manage underlying infrastructure, including hardware, software and storage resources.
Manage multiple environments (Prod/Pre-Prod/Lower environments) and ensure the integration and stability of environments.
Streamline code promotion process and providing ways to improve developer productivity.
Ensure the stability of platform and TRW ecosystem.
Find ways to optimize platform and address system bottlenecks.
Ability to analyze system issues and provide a solution to resolve the issue permanently.
Promote the best practices and follow the continuous integration and delivery.
Drive transformation by continuously looking for ways to automate existing processes and workflows.
Explore new capabilities and technologies to drive innovation.
Work closely with development and production operation teams to support their needs, troubleshoot issues and enhance the platform.
Requirements
8+ years in managing and supporting large scale and mission critical platform.
Experience in large scale platform migration initiative.
Strong experience with managing Hadoop platform including HDFS, Spark and Hive.
8+ years' experience in Linux system administration.
Experience in managing web and service/API platform.
Experience working with messaging technologies (e.g. Kafka)
Experience with RDBMS, demonstrable fluency in SQL and any non-relational DB such as Cassandra (is preferred).
6+ years of experience managing Git, CI/CD toolset (Maven), Jenkins, working within an Agile environment.
Experience in implementing and managing system monitoring tools.
Experience developing and managing highly scalable and low latency systems.
A passion for bringing best products to the customer, simplifying and automating work, for making things better, for being efficient and helping others.