Site Reliability Engineer

This listing is synced directly from the company ATS.

Role Overview

This is a senior-level Site Reliability Engineering role focused on developing and operating Kong's Managed Gateways Platform, ensuring 99.99% uptime SLA and high reliability. The engineer will architect and operate software systems, collaborate with product leadership on strategy, and drive innovation for the platform's technical direction. They will have end-to-end ownership of the platform's features, quality, and performance, working in a team environment to maintain scalable distributed systems.

Perks & Benefits

The job is fully remote, with no specific time zone mentioned, implying flexibility for global collaboration. It offers opportunities for career growth through innovation and shaping technical direction, with a culture that values strong communication and collaboration across teams. Benefits likely include typical tech perks such as professional development and a focus on work-life balance, given the remote setup and emphasis on high-pressure incident management.

Full Job Description

Are you ready to power the World's connections?

If you don’t think you meet all of the criteria below but are still interested in the job, please apply. Nobody checks every box - we’re looking for candidates that are particularly strong in a few areas, and have some interest and capabilities in others.

About The Role

Kong’s API Management suite, Konnect, offers customers the ability to create managed API gateways in any cloud across the world. We offer two flavors of managed gateways Dedicated Cloud Gateways and Serverless Gateways.

Dedicated Cloud Gateways provide organizations with a fully managed, private, and isolated gateway environment that eliminates the complexity of managing infrastructure while ensuring security and compliance. Designed for high performance and scalability, these gateways allow businesses to control traffic flow across their services without sharing resources with other customers. By offering dedicated gateways in the cloud, Kong empowers organizations to maintain the highest levels of reliability and performance while reducing operational overhead. See more at Announcing the GA of Kong Konnect Dedicated Cloud Gateways

Serverless Gateways offer a fully managed, elastic API gateway solution that meets traffic demands without the need for provisioning or managing any underlying infrastructure. Built to simplify operations and reduce costs, these gateways empower organizations to deploy APIs seamlessly in serverless environments, ensuring efficient resource utilization. Konnect Serverless Gateways: Lightweight, Cost-Effective, and Fully Managed Kong Gateways

This role is a Site Reliability Engineering (SRE) position to both develop and operate the Managed Gateways offerings at high nines of reliability.

What You'll Do

  • Own the end-to-end technical success of the Managed Gateways Platform, ensuring it delivers industry-leading features, quality, performance, and reliability.

  • Architect and operate software systems to maintain 99.99% uptime SLA.

  • Play a pivotal role in shaping the technical direction for the Managed Gateways product by driving innovation.

  • Collaborate closely with product leadership to define and refine strategy, roadmap, and objectives for managed gateways, ensuring alignment between engineering efforts and business goals.

What You'll Bring

  • Bachelor's or Master's degree in Computer Science or a related field.

  • 3+ years of experience in building and operating highly reliable SaaS/PaaS systems.

  • Hands-on experience with at least one of the major cloud platforms (AWS, Azure, or GCP)

  • Strong experience with Kubernetes.

  • Familiarity with observability tools such as Datadog, Prometheus, Grafana, Victoria Metrics, Loki, or similar technologies.

  • Expertise in designing and developing highly scalable distributed systems

  • Strong expertise in networking concepts, particularly OSI Layer 4 (Transport) and Layer 7 (Application) protocols including DNS, TLS/SSL, and HTTP, and public cloud network architectures.

  • Experience managing incidents and communicating effectively under high-pressure situations

  • Backend development experience (preferably with GoLang)

  • 3+ years of experience in SaaS development, specifically operating software with 99.99% reliability or higher.

  • Strong verbal and written communication skills to effectively collaborate across teams.

Bonus Points

  • Experience with PostgreSQL

  • Experience with building and maintaining Kubernetes Controllers

  • Experience with working or developing L4/L7 proxies such as Nginx, HA-proxy, Envoy, or others

  • Contributions at technical conferences or meetups as a speaker

#LI-KK1

About Kong:

Kong Inc., a leading developer of API and AI connectivity technologies, is building the infrastructure that powers the agentic era. trusted by the Fortune 500 and startups alike, Kong's unified API and AI platform, Kong Konnect, enables organizations to secure, manage, accelerate, govern, and monetize the flow of intelligence across APIs and AI models. For more information, visit www.konghq.com.

Similar jobs

Found 6 similar jobs

Browse more jobs in: