AWSCloudDockerFlaskGoogle Cloud PlatformJavaKubernetesLinuxPySparkPythonRuby on RailsSpringSpring BootSpringBootRubyMachine LearningData EngineeringRailsGCPGoogle CloudGitCI/CD
About this role
Role Overview
support Data Engineering and Data Science Teams, including application development support (Java, Python, etc.) for deployment of multi-cloud big-data applications and Machine Learning use cases
focus on automation of deployment and integration tasks (CI/CD)
drive cloud infrastructure management and automation
work closely with Data Scientists to bring models into production
evaluate new tools and approaches to continuously evolve the team's DevOps practice
Requirements
4+ years developing in an Object-Oriented Programming language (Java preferred)
3+ years developing in Python (may be combined with the above requirement, if Python experience is strictly OO coding)
Familiarity with specialized scripting/SDKs related to data management (e.g.: Boto3, PySpark)
Familiarity with key file formats: JSON, YAML, AVRO, Parquet
2+ years working in Public Cloud environments (AWS or GCP required, others desirable)
Strong familiarity with GCP environment
2+ years working in Linux environments
2+ years in Integration Engineering, CI/CD or related field
Practical experience setting up build pipelines
2+ years working with GIT and other SCM
Experience with containerization frameworks (Kubernetes, Docker)
Experience with REST web-services frameworks (e.g.: Spring Boot, Flask, Rails)
Tech Stack
AWS
Cloud
Docker
Flask
Google Cloud Platform
Java
Kubernetes
Linux
PySpark
Python
Ruby on Rails
Spring
Spring Boot
SpringBoot
Benefits
comprehensive medical, dental, vision, and wellbeing benefits
competitive 401(k) contribution
pension plan
annual incentive
9 paid holidays
paid time off program (23 days accrued annually for full-time employees)