PerfectServe is a leader in clinical communication and physician scheduling solutions, seeking a Senior Platform Engineer to enhance their cloud platform capabilities. The role involves designing and optimizing cloud-native systems, integrating AI and AIOps, and mentoring fellow engineers to improve engineering quality across the organization.
Responsibilities:
- Lead the design and planning of cloud-native systems, ensuring they meet business objectives, scalability needs, and security standards
- Drive the deployment and evolution of cloud-based solutions with a strong focus on AWS services, leveraging modern Infrastructure as Code and automation strategies
- Identify and implement opportunities to apply AI and machine learning to platform operations, including intelligent monitoring, anomaly detection, predictive autoscaling, automated incident triage, and root-cause analysis
- Continuously tune and improve cloud environments to deliver exceptional performance and reliability for internal teams and external customers
- Develop, document, and maintain robust support, monitoring, and recovery processes for corporate, private, and public cloud systems
- Contribute to incident response and drive improvements through blameless post-mortems
- Elevate the team’s capabilities by mentoring fellow engineers, contributing to code reviews, and sharing expertise through documentation and internal tech talks
- Proactively stay current with tools, technologies, and best practices, including advances in AI, ML, and AIOps, bringing fresh ideas and innovations to the team and championing initiatives that advance PerfectServe’s platform
Requirements:
- Extensive hands-on Kubernetes administration experience at scale, including ArgoCD, Helm, and cluster lifecycle management
- Expert-level proficiency with Terraform and Infrastructure as Code frameworks; experience defining reusable IaC patterns and modules
- Advanced understanding of modern SaaS platforms and proven experience architecting and supporting complex, distributed systems
- Deep expertise with AWS, including networking, compute, storage, and security; AWS certifications are a strong plus
- Experience with AIOps practices and tooling, such as ML-driven observability, intelligent alerting, automated remediation, or predictive scaling, is highly valued
- Ability to evaluate, customize, integrate, and secure cloud platforms to improve overall efficiency and developer experience
- Strong experience with CI/CD pipelines, GitOps workflows, DevOps practices, Internal Developer Platforms (IDP), and security/IAM principles
- Track record of mentoring peers and contributing to a culture of engineering excellence
- Strong communication skills with the ability to convey complex technical concepts to diverse audiences
- Solution Architect Professional and/or Kubernetes (CKA/CKAD) certifications are highly desirable