Job Title: Senior Site Reliability Engineer
Reports to: Lead Cloud Platform & Reliability Engineer
About Us
Welcome to Pinnacle, the ultimate destination for sports enthusiasts seeking an exhilarating sportsbook and gaming experience! Established in 1998, we have solidified our position as one of the globe's foremost licensed online gaming companies. With our cutting-edge offerings, we guarantee an electrifying experience that will keep you on the edge of your seat.
Pinnacle invites you to join our team and become an instrumental figure in the exciting realm of sports betting. Our vibrant team is fueled by passion and driven by innovation, working together to redefine the landscape of sports betting and gaming. Together, we constantly strive to surpass limitations and deliver unparalleled experiences to sports enthusiasts worldwide. Prepare yourself for a thrilling journey and discover sports in an entirely new dimension with Pinnacle!
Role Overview
We are seeking a highly experienced and passionate Senior Site Reliability Engineer (SRE) to join our Engineering and Platform Delivery team. This role will focus on ensuring the high availability, performance, scalability, and reliability of our platform and supporting systems. You will lead efforts in automation, observability, incident response, and infrastructure optimization, collaborating across cross-functional teams including Developers, System Engineers, Security, and Networking.
This position requires strong expertise in Kubernetes (EKS/ECS), AWS infrastructure, container orchestration, and observability solutions including Elastic Cloud. If you thrive in a high-performance environment and are excited to shape and support critical systems used by millions globally, we encourage you to apply.
KEY RESPONSIBILITIES
Infrastructure & Platform Engineering
- Design, build, and maintain scalable, reliable, and secure infrastructure solutions using Kubernetes (EKS), ECS and EC2.
- Develop and implement CI/CD pipelines, automating deployments and monitoring of distributed systems.
- Use Terraform and infrastructure-as-code best practices for provisioning, scaling, and maintaining environments.
Observability & Monitoring
- Implement and maintain observability solutions using Elastic Cloud, CloudWatch, and other tools to monitor application health, availability, and performance.
- Analyze metrics and logs to identify trends, optimize performance, and prevent incidents.
Operational Excellence
- Improve system resiliency through automation and incident reduction strategies.
- Establish and manage second-level incident response processes for business-critical applications.
- Collaborate with development teams to integrate SRE principles throughout the software delivery lifecycle.
Collaboration & Continuous Improvement
- Advocate for SRE and DevOps best practices, mentoring junior engineers and Tier 1 support staff.
- Contribute to system design discussions and operational readiness reviews.
- Work closely with infrastructure, DBA, and Dev teams to ensure seamless system integrations and stability.
REQUIRED QUALIFICATIONS
Technical Skills & Experience
- 5+ years of solid experience in a Site Reliability Engineering or DevOps role in AWS environment.
- 3+ Years of hands-on experience with AWS EKS, ECS or Kubernetes.
- 5+ Years of experience with AWS services including EC2, CloudWatch, Route53, and AWS Code* toolsets.
- Proficiency in infrastructure-as-code using Terraform Cloud and CloudFormation.
- 3+ Years of experience with observability platforms such as Elastic Cloud, and CloudWatch.
- Experience with CI/CD tooling (Octopus, Bitbucket, Git, TeamCity).
- Strong scripting skills in Python, PowerShell, or Bash.
- Experience working in Agile environments.
Certifications
- Mandatory:
- AWS Certified DevOps Engineer – Professional
- Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)
- CompTIA Network+ or equivalent
- Preferred:
- Certified Kubernetes Security Specialist (CKS)
- Elastic Observability or Elastic Certified Engineer
- LPIC-1: Linux Administrator
- AWS Certified Advanced Networking - Specialty
Nice-to-Have Skills
- Exposure to RDS PostgreSQL and MS SQL database administration.
Work Environment & Expectations
- The role is embedded within the Engineering and Platform Delivery team, collaborating closely with infrastructure, and software development teams.
This role requires candidates to have their core working hours aligned with business operations in either Europe or North America, depending on assignment. Successful candidates must be able to work within these time zones to support regional offices, ensure effective collaboration, and provide timely operational support. Flexibility may be required for occasional meetings or critical incidents outside of core hours.
We are an equal opportunity employer dedicated to fostering an inclusive and diverse workplace. We prioritize hiring the best candidates based on their skills and qualifications, irrespective of race, gender, age, religion, or any other characteristic. Our strength lies in our diverse teams, and we proudly celebrate and empower everyone to embrace and promote diversity throughout their time with us.
Job Type: Full-time