Dropbox is seeking an Infrastructure Software Engineer to join their Data Platform team, which focuses on building and evolving the core infrastructure for customer analytics and data-driven product decisions. The role involves hands-on ownership of critical platform components and contributing to a major platform modernization effort, including enabling AI/ML use cases on the data lake.
Responsibilities:
- Build and maintain platform capabilities that enable reliable ingestion, storage, and processing of customer and product data at scale
- Contribute to petabyte-scale data lake modernization, including migration to new underlying storage/table formats
- Develop platform features to support AI/ML workflows and enable leveraging AI on top of the data lake
- Partner with engineering teams across Dropbox to integrate with the customer data platform and improve usability and adoption
- Participate in an on-call rotation and help define operational standards for platform services
Requirements:
- 3+ years of software engineering experience building production systems
- Proficiency in at least one general-purpose programming language (e.g., Python, Go, Java or C#)
- Familiarity with batch and/or streaming data systems concepts (e.g., scheduling, backfills, schema evolution, late data, idempotency)
- Experience debugging and operating production services using logs/metrics and incident response practices
- Experience with big data tooling such as Spark/SparkSQL, Kafka, Hive, Airflow, or Superset
- Experience with Databricks or other big data platforms (e.g., Snowflake, Redshift, BigQuery)
- Experience with large-scale data lake storage systems and/or table formats (e.g., lakehouse patterns, schema evolution, partitioning)