Contribute to Scalable Solutions: Implement and test features for Cloudera’s Data Engineering Experience, assisting in the development of systems that operate at a massive scale on thousands of production nodes.
Support Open-Source Innovation: Work with senior engineers to contribute to Apache Spark, learning how to shape the future of distributed data processing in the open-source community.
Build with Modern Stacks: Develop high-performance features using Scala, Java, and Python on modern data platforms.
Deepen Technical Mastery: Gain and apply knowledge in core distributed data processing concepts, including SQL Planners, Apache Parquet, and Iceberg.
Address Large-Scale Challenges: Assist with developing and debugging features in massive distributed systems, learning about scaling from hundreds to thousands of nodes.
Ensure System Integrity: Conduct thorough root cause analysis, debug system-level issues, and resolve failures to maintain high system quality.
Enhance Engineering Velocity: Improve internal infrastructure and tooling to streamline development, testing, and deployment processes.
Collaborate Effectively: Work closely within a distributed team and with stakeholders to drive product delivery.
Requirements
2-3+ years of experience in professional software development
Proven experience implementing features or addressing issues, and delivering high-quality code
Strong proficiency in Java, Scala, or another JVM-based language
Passion for clean coding, attention to detail, and a focus on software quality and maintainability
Strong oral and written communication skills for effective collaboration within a distributed team
Demonstrated ability to research, learn quickly, and solve technical problems with guidance
An open-minded approach with a desire to learn new technologies and an unwavering passion for building exceptional products
Experience with using/developing Apache Spark, Apache Iceberg, or other related technologies (optional)
Experience with large-scale, distributed systems development (optional)
Experience with or interest in SQL Planners and Optimizers (optional)
Prior experience or interest as a contributor to open-source projects (optional)