Long-Term Stock Exchange (LTSE) is an innovative national securities exchange focused on promoting sustainable, long-term growth for companies. The Systems Reliability Engineer will ensure the reliability of LTSE’s cloud-native infrastructure and regulatory reporting workflows, while managing billing, security, and data systems.

Responsibilities:

Assist the finance team with monthly billing workflow execution, ensuring accuracy, auditability, and adherence to regulatory/SEC requirements
Help design and build automated reporting and billing pipelines using AWS services and Terraform/Terragrunt-managed infrastructure
Create tickets for required changes, improvements, or patches, drive these items through completion
Manage secure compute environments (e.g., restricted-access EC2 instances) used for billing and regulatory batch workflows
Maintain documentation, runbooks, checklists, and archival processes for billing artifacts and audit requirements
Proactively initiate conversations and escalate when anomalies, risks, or high-importance issues are detected
Address cloud security findings and coordinate remediation, with a focus on daily Tenable findings and high/critical vulnerabilities
Disposition, categorize, and document findings to maintain compliance and audit readiness
Oversee vendor patching operations for data-center or platform-related vulnerabilities
Track SLA expectations for remediation and ensure timely closure
Proactively initiate discussions if severe risks or unusual patterns emerge
Collaborate with engineering and BI/DS functions on infrastructure needs, data source integration, and pipeline enhancements
Support proof-of-concept efforts, prototype new tooling, and assist with vendor evaluations
Help manage AWS infrastructure, permissions, and resource provisioning needed for BI and analytical workloads
Proactively raise observations or concerns related to data quality, pipeline stability, or analytical tooling
Assist the team in addressing ongoing technical debt, including cleanup, patching, refactoring, and modernization
Take ownership of repetitive tasks, deployments, documentation, and operational workflows to reduce team overhead
Ensure all changes follow proper audit trails, checklists, and change-management processes
Maintain high-quality documentation, runbooks, diagrams, engineering design specs, and operational logs
Proactively highlight high-priority issues, gaps, or improvement opportunities
Participate in on-call rotations, respond to incidents, perform first-level triage, and escalate when necessary
Communicate clearly, promptly, and respectfully - especially in public Slack channels
Coordinate with vendors (platform operators, network providers, tools vendors) to ensure timely resolution and SLA compliance

Requirements:

Strong AWS experience, especially with data pipeline architectures
Proficiency with Terraform/Terragrunt and GitHub workflows
Expert scripting skills
Familiarity with vulnerability management tools (Tenable or similar)
Strong Linux fundamentals
Ability to manage compliance-oriented, audit-driven workflows
Strong communication and documentation habits
Ability to triage issues independently and escalate appropriately
Experience in data-driven systems, billing or financial reporting pipelines, or BI/DS infrastructure support
Experience in regulated or compliance-heavy environments (finance, fintech, health, etc.)
Prior vendor coordination or SLA-driven operational experience is a plus
Demonstrated history of direct responsibility for, background or strong interest in financial markets is highly desirable
Experience with Go (preferred) and Python (legacy)

Systems Reliability Engineer (SRE)

Key skills

About this role

Responsibilities:

Requirements: