Kforce is an enterprise networking and cloud solutions company. They are seeking an experienced Senior Data Engineer to design and scale data pipelines for machine-generated data, working closely with machine learning engineers and researchers.
Responsibilities:
- Senior Data Engineer will build and scale distributed data pipelines for large-scale time series and log data
- Design reliable, high-performance Spark/Python workflows for model training datasets
- Analyze and resolve performance bottlenecks (latency, memory, skew, throughput)
- Improve data quality, validation, and reproducibility for ML workloads
- As a Senior Data Engineer, you will partner with ML engineers and researchers to accelerate foundation model development
- Measure and optimize application and transaction performance in production systems
Requirements:
- 5+ years of software engineering experience
- Hands-on experience with Apache Spark (PySpark or Scala)
- Experience building large-scale data pipelines in distributed environments
- Experience working with time series, logs, or high-volume event data
- Strong proficiency in Python
- Strong debugging and performance optimization skills
- Experience supporting ML or large model training workflows
- Experience with streaming systems (Kafka, Spark Streaming)
- Experience with cloud-native or Kubernetes-based platforms
- Familiarity with sequence modeling or time series data systems