Inside Real Estate is a fast-growing, independently-owned real estate software firm serving over 500,000 top brokerages, agents, and teams. The Data Engineer II will design and build analytic data stores and integration processes to meet the data needs of various internal and external constituents, while collaborating with cross-functional teams to deliver data solutions.
Responsibilities:
- Be a proactive member of an autonomous, cross-disciplined team with a goal of building best in class data solutions that delight internal and external customers
- Build ETL processes that integrate data from multiple, highly variant sources into analytic models and data stores that feed multiple end user solutions and perspectives
- Continually research and learn the latest approaches to building analytical solutions in highly variant data environments
- Work closely with our operations and other development teams to optimize source data acquisition processes and strategies
- Participate in technology evaluation and selection initiatives that grow the company’s core data and analytics capacity
- Understand product descriptions for new features and how they fit into the greater InsideRealEstate system while learning how our customers use them
- Take initiative on assigned day-to-day tasks and keep pace with the team to get the job done while knowing when to ask for help
Requirements:
- BS degree in Computer Science, Math or related technical field
- 2-4 years of work experience in a data engineering role
- A deep understanding of general ETL processing, data pipelines and data warehousing concepts and workflows
- Relevant work experience leveraging Apache Airflow and AWS data processing and storage solutions
- Strong knowledge of SQL, PySpark and Python as primary languages for data engineering solutions
- Significant experience with large scale data sources, structures and processes
- Experience deploying infrastructure as code with tools such as AWS CloudFormation
- Experience acquiring data from web based APIs
- Experience ingesting data from one of the following: Salesforce, HubSpot, Maxio
- Deep understanding and experience building star and/or snowflake data models