Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale
Contribute to Apache Spark, Livy
Develop new features in Scala/Java/Python on a modern platforms
Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems.
Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark, which you can utilize in your daily tasks
Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters
Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures
Work on improving internal infrastructure
Collaborate with other team members and stakeholders
Requirements
Bsc/Msc in related field or equivalent experience
6+ years professional software development.
Experience leading and delivering complex product enhancements.
Strong understanding of at least one of the following languages: Java, Scala, Python.
Experience with systems design, development.
Strong oral and written communication skills.
Strong ability to research and solve problems independently without constant supervision.
Open-minded, desire to learn new things and build great products.
Experience with distributed systems
Experience with SQL planners
Experience with using/developing Apache Spark, Livy or other related technologies.
Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling.
Solid experience with at least one cloud services.