Role Overview

Design, implement, and maintain CI/CD pipelines for enterprise data processing and ingestion.
Automate build, test, and deployment workflows for Spark, Hive, Kafka, and real‑time jobs.
Establish standards ensuring reliability, scalability, and repeatability across environments.
Provision and manage Hadoop and distributed compute clusters using Ansible, Mesos, and Marathon.
Lead lifecycle management, including upgrades, expansions, and decommissioning.
Support modernization initiatives across thousands of nodes and multi‑tenant workloads.
Implement observability solutions: Metrics dashboards with Grafana/Prometheus Centralized logs via Elasticsearch Job and platform monitoring with ITRS, Dynatrace, and similar tools
Ensure stability, SLA adherence, and rapid incident response for critical pipelines.
Dockerize ingestion and processing components, including C3 workloads.
Enable and operate Spark workloads on Kubernetes.
Promote container execution models for scalability and operational efficiency.
Design and manage complex workflows and DAGs using Oozie, Autosys, and Marathon.
Ensure fault‑tolerant, auditable, and reliable pipeline execution.
Define orchestration standards for onboarding new applications and business lines.
Tune Impala, YARN, and Kudu for multi‑tenant performance and fairness.
Optimize Spark executor memory, shuffle behavior, and resource allocation.
Configure storage and compute parameters for high‑throughput processing.
Support and optimize data formats on HDFS.
Maintain efficient partitioning strategies for Hive, Kudu, and HBase.
Enable scalable, governed access for analytics and operational workloads.
Apply and manage fine‑grained access controls using Apache Ranger.
Integrate Kerberos and ensure encryption in transit and at rest.
Maintain compliance with regulatory, audit, and enterprise standards.
Partner with senior executives across Markets, Risk, and Banking to align platform strategy.
Lead modernization efforts, including Hadoop upgrades and end‑of‑life remediation for 2,000+ nodes.
Coordinate resiliency and disaster‑recovery testing for 100+ tenant applications.
Drive cross‑business enablement through unified governance, cataloging, and data‑management services.
Onboard and integrate AI, data‑modeling, and GenAI platforms (C3, Talend, AskGPS).
Provide capacity planning, infrastructure forecasting, and financial guidance for annual investment planning.

Requirements

Extensive experience with large‑scale data platforms and distributed systems, including Hadoop, Spark, Hive, Kafka, Impala, YARN, Kudu, and HBase.
Strong background in DevOps and automation, including CI/CD pipelines, Ansible, Mesos, and Marathon.
Hands‑on experience with containerization and orchestration technologies such as Docker and Kubernetes.
Experience implementing enterprise monitoring, observability, and logging solutions.
Deep understanding of data security, access controls, and compliance in regulated environments.
Proven ability to operate independently and influence technology strategy at an enterprise level.
Experience partnering with senior leadership and managing cross‑functional stakeholders.
Strong financial acumen, including capacity forecasting and cost optimization.
Demonstrated success leading complex modernization and transformation initiatives.
Bachelor’s degree in Computer Science, Engineering, or a related field (advanced degree preferred).

Tech Stack

Ansible
Apache
Distributed Systems
Docker
ElasticSearch
Grafana
Hadoop
HBase
HDFS
Kafka
Kubernetes
Prometheus
Spark
Yarn

Benefits

affordable, competitive and flexible benefits
health insurance
wellness programs

Senior Tech Services Manager

Key skills

About this role

Role Overview

Requirements

Tech Stack

Benefits