Mapbox is the leading real-time location platform for a new generation of location-aware businesses. The Software Development Engineer II will focus on data engineering for feature expansion, improving coverage, increasing polygon precision, and building pipelines to ingest new data sources.
Responsibilities:
- Integrate third-party data sources from different geographic areas into our data pipelines
- Inspect and edit geospatial data using open-source tools
- Develop geographic data processing pipelines
- Interface with engineers from other teams to understand their needs for geospatial data and provide solutions
- Implement automated quality metrics to ensure we are continuously delivering high quality data to our customers
- Mentor other software engineers to develop all aspects of their engineering skill sets, including participating in design and code reviews
Requirements:
- 5+ years of industry experience working with data pipelines
- Proficiency in at least one modern programming language (like Python or Java) suitable for data processing
- Expertise with GIS tools and processes, preferably open-source ones such as QGIS, Postgres/PostGIS, GDAL/OGR, GeoJSON
- Experience with AWS technologies like S3 and Athena
- Experience writing bash scripts and running command line tools
- Working knowledge with SQL databases and ideally experience writing custom functions (e.g. with pl/pgsql)
- Excellent debugging experience and performance profiling that includes running QA or test systems at scale
- Strong experience with batch data processing and developed judgment to implement new data pipelines and best practices around it
- Comfortable working with a git-based environment (such as GitHub)
- Familiarity with Docker
- Familiarity with CI/CD processes
- Integrating data with APIs and querying data through APIs
- Familiarity working with Iceberg, Glue and AWS Step Functions