Serve as a Solutions Architect part of GPU Private Cloud team used by thousands of NVIDIANs globally for interactive development, centralized CI / CD and QA testing, with an opportunity to positively impact all of the SW development teams across NVIDIA.
Evaluating, identifying and developing software solutions to optimize critical software development workflows across various organizations within Nvidia.
Architecting, Implementing & supporting end-to-end CI/CD system using open-source and Nvidia proprietary software.
Customer (NVIDIA Internal development teams) onboarding to Private cloud infrastructure with a good discovery of the use case and available solutions within the cloud
Identify performance bottlenecks and optimize the speed and cost efficiency of AI development and testing systems.
Leading software development projects and technically direct a team of brilliant engineers and guide them to provide efficient and impactful solutions.
Looking for problems within software systems and resolving the issues
Craft and implement critical metrics using various analytics methods and dashboards
Requirements
BS EE/CS or equivalent experience with 12+ years of systems software development including at least 1 year dedicated to developing/exploring AI.
Experience of maintaining cloud infrastructure and highly available production environment.
Strong programming and software development skills in JAVA, Python, Shell-script along with good understanding of distributed systems and REST APIs.
Experience in working with SQL/NoSQL database systems such as MySQL, Cassandra, MongoDB or Elasticsearch.
Excellent knowledge and working experience with Docker containers and Virtual Machines.
Good background of Cloud technologies like: OpenStack, Docker, Kubernetes, Chef/Puppet, Hadoop/Ceph/SwiftStack, LXC, Git, Perforce, JFrog, Kafka.
Ability to work across organizational boundaries effectively to improve alignment and productivity between teams in a multi-national, multi-time-zone corporate environment.