Senior Site Reliability Engineer Infrastructure

Role Overview

This senior-level role involves defining and implementing reliability, scalability, and operational excellence strategies as a founding SRE at Underdog. Day-to-day responsibilities include owning incident response processes, guiding teams on Service Level Objectives (SLOs), and leading capacity planning initiatives. The hire will have high impact and real ownership, partnering with platform, infrastructure, and product teams to ensure system reliability and developer experience.

Perks & Benefits

The role offers a rare opportunity to shape SRE practices from the ground up in a fast-growing company, with high impact and real ownership from day one. It likely includes a remote work setup typical for tech jobs, with expectations aligned with U.S. time zones, and a culture that values care, performance, and pushing boundaries for sports fans.

⚠️ This job was posted over 5 months ago and may no longer be open. We recommend checking the company's site for the latest status.

Full Job Description

At Underdog, we make sports more fun.

Our thesis is simple: build the best products and we'll build the biggest company in the space, because there's so much more to be built for sports fans. We're just over five years in, and we're one of the fastest-growing sports companies ever, most recently valued at $1.3B. And it's still the early days.

We've built and scaled multiple games and products across fantasy sports, sports betting, and prediction markets, all united in one seamless, simple, easy to use, intuitive and fun app.

Underdog isn't for everyone. One of our core values is give a sh*t. The people who win here are the ones who care, push, and perform. If that's you, come join us.

Winning as an Underdog is more fun.

This is a rare opportunity to be a founding SRE at Underdog, helping define how reliability, scalability, and operational excellence work as the company continues to grow. You'll operate in exploration mode early on, identifying the highest-leverage reliability challenges and shaping our approach to incident response, observability, and SLOs. This is a high-impact role with real ownership from day one, partnering closely with platform, infrastructure, and product teams to ensure Underdog scales through peak traffic, game-day spikes, and rapid iteration while improving both system reliability and developer experience.

About the role

Own and maintain the incident response process, including defining procedures, tools, and best practices
Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
Lead capacity planning initiatives, focusiPlease mention the word **KINDLINESS** and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Apply on original site

Similar jobs

Found 6 similar jobs

Senior Product Designer, Motion

Underdog Sports • Worldwide

Senior Software Engineer iOS

Underdog Sports • United States

Executive Assistant

Underdog Sports • USA

contract

Senior Data Scientist

Underdog Sports • USA

full_time

Senior QA Engineer

Underdog Sports • USA

full_time

Commercial Counsel

Stripe • United States

Underdog Sports

underdogsports.com

Underdog Sports is a platform that specializes in fantasy sports and sports betting. Their typical customers are sports enthusiasts looking for engaging ways to participate in fantasy leagues and betting activities. The company offers various tools and resources for users to maximize their gaming experience while providing a competitive edge. Underdog Sports embraces a fully remote work culture, allowing employees to work from anywhere while promoting flexibility and work-life balance.

Industry

Sports Technology

Fully Remote

6 open positions

About this company (remote-wise)

Headquarters:: Distributed / remote-first
Hires in:: US / North America
Team style:: Async-ish, remote-first

View company profile →

About the job

Posted onJan 20, 2026

LocationUnited States

Skills

Incident ResponseService Level ObjectivesCapacity PlanningObservabilityScalabilityInfrastructurePlatform EngineeringDeveloper Experience

Share this job

💌 Get remote jobs in your inbox

Subscribe to get the latest curated remote jobs every week.