Design, develop, and maintain a data platform that processes petabytes of data
Participate in technical reviews of our products and help us develop new features and enhance stability
Continually help us improve the efficiency of our services so that we can delight our customers
Help us research and implement new ways for both internal stakeholders as well as customers to query their data efficiently and extract results in the format they desire
Requirements
Degree in Computer Science (or professional experience in data structures/algorithms / distributed systems)
5+ years' experience combined between backend/cloud development and data platform engineering roles
A solid understanding of algorithms, distributed systems design and the software development lifecycle
Solid background in Java/Scala and a scripting language like Python
Experience building large scale data pipelines
Strong familiarity with the Apache Hadoop ecosystem including : Spark, Kafka, Flink, Iceberg/Delta Lake/Hive, Apache Presto/Trino, etc
Experience with relational SQL and NoSQL databases, including Postgres/MySQL, Cassandra, DynamoDB
Good test driven development discipline
Reasonable proficiency with Linux administration tools
Proven ability to work effectively with remote teams
Experience with the following tools is desirable: Go Kubernetes Jenkins Parquet Protocol Buffers/GRPC
Tech Stack
Apache
Cassandra
Cloud
Distributed Systems
DynamoDB
GRPC
Hadoop
Java
Jenkins
Kafka
Kubernetes
Linux
MySQL
NoSQL
Postgres
Python
Scala
Spark
SQL
Go
Benefits
Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for recharge
Paid parental and adoption leaves
Professional development opportunities for all employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections