Senior Site Reliability Engineer Infrastructure

Role Overview

This senior-level role involves defining and implementing reliability, scalability, and operational excellence strategies as a founding SRE at Underdog. Day-to-day responsibilities include owning incident response processes, guiding teams on Service Level Objectives (SLOs), and leading capacity planning initiatives. The hire will have high impact and real ownership, partnering with platform, infrastructure, and product teams to ensure system reliability and developer experience.

Perks & Benefits

The role offers a rare opportunity to shape SRE practices from the ground up in a fast-growing company, with high impact and real ownership from day one. It likely includes a remote work setup typical for tech jobs, with expectations aligned with U.S. time zones, and a culture that values care, performance, and pushing boundaries for sports fans.

Full Job Description

At Underdog, we make sports more fun.

Our thesis is simple: build the best products and we'll build the biggest company in the space, because there's so much more to be built for sports fans. We're just over five years in, and we're one of the fastest-growing sports companies ever, most recently valued at $1.3B. And it's still the early days.

We've built and scaled multiple games and products across fantasy sports, sports betting, and prediction markets, all united in one seamless, simple, easy to use, intuitive and fun app.

Underdog isn't for everyone. One of our core values is give a sh*t. The people who win here are the ones who care, push, and perform. If that's you, come join us.

Winning as an Underdog is more fun.

This is a rare opportunity to be a founding SRE at Underdog, helping define how reliability, scalability, and operational excellence work as the company continues to grow. You'll operate in exploration mode early on, identifying the highest-leverage reliability challenges and shaping our approach to incident response, observability, and SLOs. This is a high-impact role with real ownership from day one, partnering closely with platform, infrastructure, and product teams to ensure Underdog scales through peak traffic, game-day spikes, and rapid iteration while improving both system reliability and developer experience.

About the role

  • Own and maintain the incident response process, including defining procedures, tools, and best practices
  • Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
  • Lead capacity planning initiatives, focusiPlease mention the word **KINDLINESS** and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Similar jobs

Found 6 similar jobs