Support the day-to-day operations of a mobile point-of-sale system
Provide first-line operational support, monitor systems, and resolve production incidents
Troubleshoot cloud systems and integrations, applying corrective actions
Manage escalations and collaborate on bug fixes and hotfixes
Administer MDM solutions and support remote software deployments
Implement automated monitoring and alerting to improve incident response
Document processes, maintain knowledge bases, and create incident runbooks
Participate in on-call rotation to ensure 24/7 critical incident coverage
Contribute to post-incident reviews to improve monitoring, response, and resolution
Build Node/TypeScript utilities to automate workflows, parse logs/JSON, and validate API payloads
Troubleshoot REST/GraphQL integrations and analyze request/response traces
Manage third-party API integrations and work with teams to improve error handling
Analyze system and application logging and telemetry to resolve issues
Manage and administrate system access
Requirements
Bachelor’s degree in Computer Science, Engineering, or a related field
3+ years supporting production systems, focused on incident response and resolution
Strong experience in operational support or SRE roles in cloud environments
Proficiency in Node.js, including debugging, error handling, and performance troubleshooting
Experience with AWS, Azure, or GCP, including monitoring and troubleshooting cloud-native applications
Experience working with APIs and integrations
Familiarity with logging and monitoring tools (Winston, Bunyan, Datadog, ELK Stack, CloudWatch)
Strong problem-solving skills in high-pressure, time-sensitive situations
Experience with CI/CD pipelines and automated deployments (Jenkins, GitLab CI, AWS CodePipeline)
Strong communication skills, with clear and structured incident reporting and documentation
Effective cross-functional collaboration across development, DevOps, and product teams
Upper-Intermediate+ English level
Desirable:
Experience with containerization (Docker, Kubernetes)
Knowledge of REST APIs, WebSockets, and microservices architecture
Familiarity with incident management frameworks (ITIL, SRE practices)
Understanding of cloud security best practices
Experience with mobile POS platforms or mobile application environments
Familiarity with mobile device management (MDM) solutions
Tech Stack
AWS
Azure
Cloud
Docker
Google Cloud Platform
GraphQL
JavaScript
Jenkins
Kubernetes
Microservices
Node.js
TypeScript
Benefits
Get 30 paid days off per year to use however you like — vacations, holidays, or personal time
5 paid sick days, up to 60 days of medical leave, and up to 6 paid days off per year for major family events like weddings, funerals, or the birth of a child
Partially covered health insurance after the probation, plus a wellness bonus for gym memberships, sports nutrition, and similar needs after 6 months
We pay in U.S. dollars and cover all approved overtime
Join English lessons and Dev.Pro University programs, and take part in fun online activities and team-building events