Kforce Inc is seeking an experienced Senior Data Engineer to support their enterprise networking and cloud solutions customer based in San Jose, CA. The role involves designing and scaling data pipelines for model training and production systems, while collaborating with machine learning engineers to enhance foundation model development.
Responsibilities:
- Senior Data Engineer will build and scale distributed data pipelines for large-scale time series and log data
- Design reliable, high-performance Spark/Python workflows for model training datasets
- Analyze and resolve performance bottlenecks (latency, memory, skew, throughput)
- Improve data quality, validation, and reproducibility for ML workloads
- As a Senior Data Engineer, you will partner with ML engineers and researchers to accelerate foundation model development
- Measure and optimize application and transaction performance in production systems
Requirements:
- 5+ years of software engineering experience
- Hands-on experience with Apache Spark (PySpark or Scala)
- Experience building large-scale data pipelines in distributed environments
- Experience working with time series, logs, or high-volume event data
- Strong proficiency in Python
- Strong debugging and performance optimization skills
- Experience supporting ML or large model training workflows
- Experience with streaming systems (Kafka, Spark Streaming)
- Experience with cloud-native or Kubernetes-based platforms
- Familiarity with sequence modeling or time series data systems