IMO Health is a company focused on ontology-driven data engineering, and they are seeking a Data Engineer specializing in Semantic Web and Ontology Engineering. The role involves working with complex medical terminology to build automated data ingestion processes and improve content relationships for healthcare professionals.
Responsibilities:
- Serve as a technical resource in semantic web development on cross-skilled clinical, content, and data teams as we develop meaningful data relationship experiences
- Design and write quality source code for data pipelines and services, including detailed documentation of design and implementation
- Implement and maintain the deployment of semantic services using containerization technologies (Docker) and cloud orchestration platforms like Amazon ECS. Contribute to and maintain automated deployment, testing, and integration activities utilizing robust CI/CD pipelines
- Work together with the product owner to break down features into actionable user stories and technical tasks
- Utilize existing taxonomies and ontologies applicable to our work. Contribute to the documentation and publishing of taxonomies/ontologies, supporting the core concepts of our domain
- Adhere to and support the established governance process for structured data, ensuring compliance across implementations
Requirements:
- A relevant STEM BA/BS Degree and 3-5 years of relevant coding experience, or five years of relevant professional experience in semantic web development/data engineering using multiple languages (Python, Java)
- Proficient in AWS services (EC2, EMR, RDS, etc.)
- Strong SQL knowledge, with experience in complex query authoring, relational databases (PostgreSQL), and NoSQL databases (DynamoDB, MongoDB, Elasticsearch)
- Hands-on experience with containerization technologies (Docker) and cloud deployment strategies using services like Amazon ECS
- Familiar with agile development and CI/CD processes using tools such as Git and Terraform
- Experience working with Spark and/or PySpark
- Proven ability to communicate complex technical concepts effectively to both technical and non-technical stakeholders, and to lead collaborative efforts across diverse teams
- Demonstrated ability to analyze complex data challenges, identify root causes, and architect strategic, scalable solutions within a semantic context
- Experience in an Agile/Scrum environment, iteratively developing and deploying data solutions
- Familiarity with RDF, OWL, SHACL, and SPARQL
- Understanding healthcare ontologies and standards like SNOMED-CT, LOINC, RxNorm, and ICD-10