Reddit is a community of communities, and they are seeking a Software Engineer to join their Data Platform team. The role involves solving large scale data platform challenges, improving data infrastructure, and collaborating with various teams to enhance data analytics capabilities.
Responsibilities:
- Refine and maintain our data infrastructure technologies to support ML and analytics on data collected from hundreds of millions of users
- Own the Data Warehouse Platform used for long term storage of this data and Airflow Platform used to efficiently orchestrate how this data is processed
- Take part in building opinionated guardrails to drive improvements in data quality, cost efficiency, and data governance
- Build automation software which minimizes toilsome work for data users at Reddit and provides a declarative, self-service experience for working with data
- Monitoring/alerting for our core systems and the mechanisms built on top
- Collaborate effectively with a team of proficient software engineers to develop and maintain the fundamental platform that powers the cutting-edge Reddit's data infrastructure
- Engage in the complete data lifecycle at Reddit, participating in the development process and working with one of the world's most extensive and data-rich datasets
- Design, build and deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform
- Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations
- Collaborate and Share on-call responsibilities, including incident management
Requirements:
- 2+ years of software engineering experience in a production setting writing clean, maintainable, and well-tested code
- Proficient in object-oriented programming languages like Python, Scala, Go, or Java
- Demonstrated expertise in designing and implementing large-scale systems, diligently monitoring project progress, and showcasing proactive leadership as a self-starter on diverse projects
- Experience working with cloud services, Airflow, Kubernetes, CI/CD, Spark, Flink and/or working with modern cloud-based infrastructure
- Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context