Job Title: Senior Site Reliability Engineer

Reports to: Lead Cloud Platform & Reliability Engineer

About Us

Welcome to Pinnacle, the ultimate destination for sports enthusiasts seeking an exhilarating sportsbook and gaming experience! Established in 1998, we have solidified our position as one of the globe's foremost licensed online gaming companies. With our cutting-edge offerings, we guarantee an electrifying experience that will keep you on the edge of your seat.

Pinnacle invites you to join our team and become an instrumental figure in the exciting realm of sports betting. Our vibrant team is fueled by passion and driven by innovation, working together to redefine the landscape of sports betting and gaming. Together, we constantly strive to surpass limitations and deliver unparalleled experiences to sports enthusiasts worldwide. Prepare yourself for a thrilling journey and discover sports in an entirely new dimension with Pinnacle!

Role Overview

We are seeking a highly experienced and passionate Senior Site Reliability Engineer (SRE) to join our Engineering and Platform Delivery team. This role will focus on ensuring the high availability, performance, scalability, and reliability of our platform and supporting systems. You will lead efforts in automation, observability, incident response, and infrastructure optimization, collaborating across cross-functional teams including Developers, System Engineers, Security, and Networking.

This position requires strong expertise in Kubernetes (EKS/ECS), AWS infrastructure, container orchestration, and observability solutions including Elastic Cloud. If you thrive in a high-performance environment and are excited to shape and support critical systems used by millions globally, we encourage you to apply.

KEY RESPONSIBILITIES

Infrastructure & Platform Engineering

Design, build, and maintain scalable, reliable, and secure infrastructure solutions using Kubernetes (EKS), ECS and EC2.
Develop and implement CI/CD pipelines, automating deployments and monitoring of distributed systems.
Use Terraform and infrastructure-as-code best practices for provisioning, scaling, and maintaining environments.

Observability & Monitoring

Implement and maintain observability solutions using Elastic Cloud, CloudWatch, and other tools to monitor application health, availability, and performance.
Analyze metrics and logs to identify trends, optimize performance, and prevent incidents.

Operational Excellence

Improve system resiliency through automation and incident reduction strategies.
Establish and manage second-level incident response processes for business-critical applications.
Collaborate with development teams to integrate SRE principles throughout the software delivery lifecycle.

Collaboration & Continuous Improvement

Advocate for SRE and DevOps best practices, mentoring junior engineers and Tier 1 support staff.
Contribute to system design discussions and operational readiness reviews.
Work closely with infrastructure, DBA, and Dev teams to ensure seamless system integrations and stability.

REQUIRED QUALIFICATIONS

Technical Skills & Experience

5+ years of solid experience in a Site Reliability Engineering or DevOps role in AWS environment.
3+ Years of hands-on experience with AWS EKS, ECS or Kubernetes.
5+ Years of experience with AWS services including EC2, CloudWatch, Route53, and AWS Code* toolsets.
Proficiency in infrastructure-as-code using Terraform Cloud and CloudFormation.
3+ Years of experience with observability platforms such as Elastic Cloud, and CloudWatch.
Experience with CI/CD tooling (Octopus, Bitbucket, Git, TeamCity).
Strong scripting skills in Python, PowerShell, or Bash.
Experience working in Agile environments.

Certifications

Mandatory:
AWS Certified DevOps Engineer – Professional
Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)
CompTIA Network+ or equivalent
Preferred:
Certified Kubernetes Security Specialist (CKS)
Elastic Observability or Elastic Certified Engineer
LPIC-1: Linux Administrator
AWS Certified Advanced Networking - Specialty

Nice-to-Have Skills

Exposure to RDS PostgreSQL and MS SQL database administration.

Work Environment & Expectations

The role is embedded within the Engineering and Platform Delivery team, collaborating closely with infrastructure, and software development teams.

This role requires candidates to have their core working hours aligned with business operations in either Europe or North America, depending on assignment. Successful candidates must be able to work within these time zones to support regional offices, ensure effective collaboration, and provide timely operational support. Flexibility may be required for occasional meetings or critical incidents outside of core hours.

We are an equal opportunity employer dedicated to fostering an inclusive and diverse workplace. We prioritize hiring the best candidates based on their skills and qualifications, irrespective of race, gender, age, religion, or any other characteristic. Our strength lies in our diverse teams, and we proudly celebrate and empower everyone to embrace and promote diversity throughout their time with us.

Job Type: Full-time

Save Apply

Report job

Senior Site Reliability Engineer

Lead/Senior Site Reliability Engineer

Data Privacy Engineering

F&b Admin

Kế Toán Nội Bộ

Nhân Viên It - Cổng Trời Đông Giang, Quảng Nam - Thu Nhập Hấp Dẫn

Product Marketing Specialist

Graphic Design Collaborator

Partnership Collaborator

Operations Manager

Chuyên Viên Bán Hàng Dự Án Quà Tặng