Partner with Product Management to develop and maintain roadmaps across shipment lifecycle domains (tracking, ETA, dispatch, carrier management)
Drive your pods to deliver roadmap features with agility and high quality — from concept to production at scale
Own Customer Portal 2.0 and similar customer-facing platform initiatives
Represent your teams and product lines in cross-functional discussions with Product, QA, SRE, Data Engineering, and Customer Success teams
Own and drive the reliability, performance, and scalability of 75+ OTR worker/services across 4 programming languages (Ruby, Java, Node.js, Python)
Maintain zero P0/P1 incident streak through proactive monitoring (Chronosphere, PagerDuty), RCA-driven improvements, and structured on-call rotations
Drive zero-bug-backlog culture with PMO-driven process excellence and AI-powered acceleration
Lead architecture modernization efforts including Ruby-to-Java/Spring Boot migrations (Shipments Platform)
Own Redis, Kafka, PostgreSQL, and ClickHouse health for OTR services; drive capacity planning and cost optimization
Champion AI-driven development lifecycle (AI-DLC) adoption across all pods — including Claude Code, MCP integrations, and agentic AI tooling
Own and scale Dispatch MCP — MCP endpoints co-located in OTR microservices enabling AI agents (Tracy, Sam, Foresight, Cassie) to interact with shipment data
Drive developer productivity through CI/CD pipeline optimization, DORA metrics tracking, code quality automation, and PR analytics
Build and maintain RCA bots using structured decision trees integrated with Cassie for L1 support automation
Establish and evangelize engineering best practices: code reviews, test coverage, release sign-off processes, and documentation standards
Hire, nurture, and mentor top engineering talent across multiple pods
Conduct structured performance appraisals, promotion nominations, and career development conversations
Manage performance improvement plans with empathy and accountability
Scale the org from current state to 9 pods by Q4 FY27 as per the staffing plan
Requirements
Overall 10+ years of experience in software development with strong computer science fundamentals
Minimum 3+ years of experience managing groups of 10+ software engineers across multiple pods/teams
Strong design and architecture exposure in building large-scale enterprise SaaS solutions using cloud technologies (AWS/Azure)
Hands-on experience with Java/Spring Boot, Ruby, Node.js, or Python in production environments
Deep experience with Kafka, Redis, PostgreSQL, and distributed message-driven architectures
Proven track record of maintaining high-availability (24x7) systems with zero/near-zero downtime
Experience with extensive CI/CD pipelines, release management, and production deployment practices
Strong communication skills and ability to convey complex product requirements or technical concepts to stakeholders at all levels
Excellent command of SDLC activities including analysis, design, development, testing, deployment, and post-production support
Preferred:
Experience with AI/ML integration in production systems — LLM-based agents, MCP protocol, agentic workflows
Exposure to big data technologies: Spark, ClickHouse, Elasticsearch
Experience with container orchestration (Kubernetes/Docker)
Familiarity with monitoring/observability platforms: Chronosphere, PagerDuty, Datadog
Experience in supply chain, logistics, or freight visibility domain.
Tech Stack
AWS
Azure
Cloud
Docker
ElasticSearch
Java
JavaScript
Kafka
Kubernetes
Microservices
Node.js
Postgres
Python
Redis
Ruby
SDLC
Spark
Spring
Spring Boot
SpringBoot
Benefits
Medical benefits start on first day of employment
36 PTO days( Sick, Casual and Earned) , 5 recharge days, 2 volunteer days
Home Office setups and Technology reimbursement
Lifestyle & Family benefits
Annual Swags/ Festive Swags
Ongoing learning & development opportunities ( Professional development program, Toast Master club etc.)