Senior Site Reliability Engineer
Role Overview
This senior-level role involves leading efforts to improve system reliability, scalability, and performance across critical services. As a technical leader, you will design observability systems, define SLIs/SLOs, lead incident response, and automate workflows to reduce operational toil. You will influence engineering decisions and build tooling to enhance production safety and predictability.
Perks & Benefits
The job is based in Austin, TX, US, implying a hybrid or on-site setup with potential flexibility. As a senior role, it offers opportunities for technical leadership, career growth in SRE functions, and influence across engineering teams. The company culture emphasizes innovation, AI-driven solutions, and a focus on security and scalability in a cloud-native environment.
Full Job Description
About Us UJET leads the way in AI-powered contact center innovation, delivering a future-proof, cloud platform that redefines the customer experience with cutting-edge AI, true multimodality, and a mobile-first approach. We infuse AI across every aspect of your customer journey and contact center operations, to drive automation and efficiency. UJET's AI solutions empower agents, optimize customer journeys, and transform contact center operations for elevated experiences and actionable insights. Built on a cloud-native architecture with a unique CRM-first approach, UJET ensures unmatched security, scalability, and prioritized data insights (without storing PII). Designed for effortless use, UJET partners with businesses to deliver exceptional interactions, smarter decision-making, and accelerated growth in the AI-driven world. Learn more at www.ujet.cx.Position Overview We’re looking for a Senior Site Reliability Engineer to help build and scale a high-impact SRE function. You’ll be a technical leader on a team responsible for improving system reliability, reducing operational toil, and establishing best practices across engineering.bIn this position, you’ll design how reliability works in UJET, influence engineering decisions, and build the tooling and processes that make production safer and more predictable.
Responsibilities
Lead efforts to improve system reliability, scalability, and performance across critical services Define and implement SLIs/SLOs and error budgets, and use them to guide engineering priorities Design and develop observability systems (metrics, logging, tracing, alerting) that produce actionable alerts and data with minimal noise Lead complex incident response, acting as incident commander when needed Conduct postmortems focused on systemic causes rather than individual fault, and ensure corrective actions from those reviews are completed. Identify and eliminate toil through automation, tooling, and improved workflows Partner with product and platform teams on architecture decisions, production readiness, and dePlease mention the word **UNQUESTIONABLY** and tag RMjE3LjE0NC4xODguMTQ2 when applying to show you read the job post completely (#RMjE3LjE0NC4xODguMTQ2). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
Similar jobs
Found 1 similar job