Senior Site Reliability Engineer

Role Overview

This senior-level role involves leading efforts to improve system reliability, scalability, and performance across critical services. As a technical leader, you will design observability systems, define SLIs/SLOs, lead incident response, and automate workflows to reduce operational toil. You will influence engineering decisions and build tooling to enhance production safety and predictability.

Perks & Benefits

The job is based in Austin, TX, US, implying a hybrid or on-site setup with potential flexibility. As a senior role, it offers opportunities for technical leadership, career growth in SRE functions, and influence across engineering teams. The company culture emphasizes innovation, AI-driven solutions, and a focus on security and scalability in a cloud-native environment.

Full Job Description

About Us UJET leads the way in AI-powered contact center innovation, delivering a future-proof, cloud platform that redefines the customer experience with cutting-edge AI, true multimodality, and a mobile-first approach. We infuse AI across every aspect of your customer journey and contact center operations, to drive automation and efficiency. UJET's AI solutions empower agents, optimize customer journeys, and transform contact center operations for elevated experiences and actionable insights. Built on a cloud-native architecture with a unique CRM-first approach, UJET ensures unmatched security, scalability, and prioritized data insights (without storing PII). Designed for effortless use, UJET partners with businesses to deliver exceptional interactions, smarter decision-making, and accelerated growth in the AI-driven world. Learn more at www.ujet.cx.Position Overview We’re looking for a Senior Site Reliability Engineer to help build and scale a high-impact SRE function. You’ll be a technical leader on a team responsible for improving system reliability, reducing operational toil, and establishing best practices across engineering.bIn this position, you’ll design how reliability works in UJET, influence engineering decisions, and build the tooling and processes that make production safer and more predictable.

Responsibilities

Lead efforts to improve system reliability, scalability, and performance across critical services Define and implement SLIs/SLOs and error budgets, and use them to guide engineering priorities Design and develop observability systems (metrics, logging, tracing, alerting) that produce actionable alerts and data with minimal noise Lead complex incident response, acting as incident commander when needed Conduct postmortems focused on systemic causes rather than individual fault, and ensure corrective actions from those reviews are completed. Identify and eliminate toil through automation, tooling, and improved workflows Partner with product and platform teams on architecture decisions, production readiness, and dePlease mention the word **UNQUESTIONABLY** and tag RMjE3LjE0NC4xODguMTQ2 when applying to show you read the job post completely (#RMjE3LjE0NC4xODguMTQ2). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Apply on original site

Similar jobs

Found 1 similar job

Product Manager - CCaaS Services

Ujet • Worldwide

Ujet

ujet.cx

Ujet is a technology company that develops customer service software solutions. Their platform helps businesses manage customer interactions across multiple channels including messaging, email, and social media. Their typical customers are mid-sized to large enterprises looking to streamline their customer support operations. The company appears to operate with a distributed or remote-friendly work culture, allowing team members to collaborate from various locations.

Industry

Technology/Software

Remote-friendly or distributed team

2 open positions

About this company (remote-wise)

Headquarters:: Distributed / remote-first

View company profile →

About the job

Posted onApr 18, 2026

LocationAustin, TX, US

Skills

Site Reliability EngineeringSLI/SLO ImplementationObservability SystemsIncident ResponseAutomationCloud-Native ArchitectureSystem ScalabilityPostmortem Analysis

Share this job

💌 Get remote jobs in your inbox

Subscribe to get the latest curated remote jobs every week.