Long-Term Stock Exchange (LTSE) is an innovative national securities exchange focused on promoting sustainable, long-term growth for companies. The Systems Reliability Engineer will ensure the reliability of LTSE’s cloud-native infrastructure and regulatory reporting workflows, while managing billing, security, and data systems.
Responsibilities:
- Assist the finance team with monthly billing workflow execution, ensuring accuracy, auditability, and adherence to regulatory/SEC requirements
- Help design and build automated reporting and billing pipelines using AWS services and Terraform/Terragrunt-managed infrastructure
- Create tickets for required changes, improvements, or patches, drive these items through completion
- Manage secure compute environments (e.g., restricted-access EC2 instances) used for billing and regulatory batch workflows
- Maintain documentation, runbooks, checklists, and archival processes for billing artifacts and audit requirements
- Proactively initiate conversations and escalate when anomalies, risks, or high-importance issues are detected
- Address cloud security findings and coordinate remediation, with a focus on daily Tenable findings and high/critical vulnerabilities
- Disposition, categorize, and document findings to maintain compliance and audit readiness
- Oversee vendor patching operations for data-center or platform-related vulnerabilities
- Track SLA expectations for remediation and ensure timely closure
- Proactively initiate discussions if severe risks or unusual patterns emerge
- Collaborate with engineering and BI/DS functions on infrastructure needs, data source integration, and pipeline enhancements
- Support proof-of-concept efforts, prototype new tooling, and assist with vendor evaluations
- Help manage AWS infrastructure, permissions, and resource provisioning needed for BI and analytical workloads
- Proactively raise observations or concerns related to data quality, pipeline stability, or analytical tooling
- Assist the team in addressing ongoing technical debt, including cleanup, patching, refactoring, and modernization
- Take ownership of repetitive tasks, deployments, documentation, and operational workflows to reduce team overhead
- Ensure all changes follow proper audit trails, checklists, and change-management processes
- Maintain high-quality documentation, runbooks, diagrams, engineering design specs, and operational logs
- Proactively highlight high-priority issues, gaps, or improvement opportunities
- Participate in on-call rotations, respond to incidents, perform first-level triage, and escalate when necessary
- Communicate clearly, promptly, and respectfully - especially in public Slack channels
- Coordinate with vendors (platform operators, network providers, tools vendors) to ensure timely resolution and SLA compliance
Requirements:
- Strong AWS experience, especially with data pipeline architectures
- Proficiency with Terraform/Terragrunt and GitHub workflows
- Expert scripting skills
- Familiarity with vulnerability management tools (Tenable or similar)
- Strong Linux fundamentals
- Ability to manage compliance-oriented, audit-driven workflows
- Strong communication and documentation habits
- Ability to triage issues independently and escalate appropriately
- Experience in data-driven systems, billing or financial reporting pipelines, or BI/DS infrastructure support
- Experience in regulated or compliance-heavy environments (finance, fintech, health, etc.)
- Prior vendor coordination or SLA-driven operational experience is a plus
- Demonstrated history of direct responsibility for, background or strong interest in financial markets is highly desirable
- Experience with Go (preferred) and Python (legacy)