Home
Jobs
Saved
Resumes
SRE – Site Reliability Engineering at Stefanini Brasil | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
SRE – Site Reliability Engineering
Stefanini Brasil
Website
LinkedIn
SRE – Site Reliability Engineering
São Paulo, São Paulo, Brazil
Full Time
2 hours ago
No Sponsorship
Apply Now
Key skills
AWS
Azure
BigQuery
Cloud
DNS
EC2
ElasticSearch
Google Cloud Platform
Grafana
JavaScript
Kubernetes
Next.js
NGINX
Node.js
Postgres
Prometheus
RabbitMQ
React
Redis
Terraform
TypeScript
Express
NestJS
GCP
Google Cloud
Serverless
Lambda
S3
RDS
CloudFront
CloudWatch
SQS
API Gateway
PostgreSQL
Prisma
Sequelize
Elasticsearch
Caching
CI/CD
Mentoring
Communication
Cloud Security
WAF
About this role
Role Overview
Focus on raising the level of reliability, observability and resilience of systems operated by the team
Structure, standardize, measure risk and transform operations into engineering
Ensure that production systems are reliable, available, observable, scalable and financially sustainable
Evolve current monitoring toward business visibility and continuity
Identify recurring production failures and perform incident analysis and resolution
Create and maintain operational runbooks
Lead and document post-mortems, identify root causes and propose structural improvements
Support production database management, analyze bottlenecks and operational risks
Work on cloud security with a focus on availability and evaluate configuration and architectural risks
Relate cost x reliability x capacity and suggest improvements for efficient resource usage
Work with the DevOps team on critical releases and assist in pipeline failure remediation.
Requirements
Deep technical mastery of the platforms currently used by the team
Deep / Advanced Knowledge (Required)
Proficiency in Cloud Providers: AWS, GCP; basic knowledge of Huawei Cloud is desirable
Containers and Orchestration: Kubernetes
Compute & Serverless: AWS Lambda, EC2, AWS RDS
Databases & Caching: PostgreSQL, Redis
Networking, Edge and Security: CloudFront, WAF, ELB / ALB / NLB, VPC, Subnets, Security Groups, DNS and routing
CI/CD: CI/CD pipelines, Terraform
Storage: AWS S3
Proxy and Web Server: Nginx
Monitoring and Observability: Grafana / Prometheus, AWS CloudWatch
Messaging and Events: SQS, RabbitMQ
AWS Communication and Services: SES, API Gateway, ECR
Languages and Ecosystem: JavaScript / TypeScript, Node.js, NestJS, ReactJS, Next.js, Sequelize / Prisma / Express
Desired Knowledge: ElasticSearch / OpenSearch, Huawei Cloud, pentest project experience, GCP BigQuery, Microsoft CodePush on Azure.
Tech Stack
AWS
Azure
BigQuery
Cloud
DNS
EC2
ElasticSearch
Google Cloud Platform
Grafana
JavaScript
Kubernetes
Next.js
NGINX
Node.js
Postgres
Prometheus
RabbitMQ
React
Redis
Terraform
TypeScript
Benefits
Meal allowance or food voucher
Discounts on courses, universities and language schools
Stefanini Academy — platform with free, up-to-date online courses and certificates
Mentoring
Benefits club for consultations and exams
Health insurance
Dental insurance
Discounts and perks at top establishments
Travel club
Pet insurance/partnership
Apply Now
Home
Jobs
Saved
Resumes