Site Reliability Engineer
Role Overview
This mid-level Site Reliability Engineer role involves building and maintaining production infrastructure using Terraform, Python, and TypeScript, while managing CI/CD pipelines and participating in on-call rotations. The engineer will work closely with a small SRE team and cross-functional members to enhance system availability, security, and performance, focusing on eliminating toil and improving delivery speed. This position impacts the reliability and scalability of digital health systems that support early detection of cognitive disorders.
Perks & Benefits
The role is fully remote but requires the candidate to be based in the US, with no sponsorship provided. It offers a collaborative team environment with opportunities for continuous improvement and advocacy of SRE best practices, likely including flexible hours typical of remote tech jobs. Career growth is implied through involvement in accelerated company growth and exposure to cutting-edge technologies in AI and neuroscience.
Full Job Description
Linus Health is a Boston-based digital health company transforming brain health worldwide. We combine cutting-edge neuroscience, clinical expertise, and AI to advance early detection and intervention for cognitive and brain disorders—empowering people to live longer, healthier lives. With 100+ team members and growing, we're entering a phase of accelerated growth and looking for top talent to help shape our future.
Currently, we are looking for a Mid-level SRE to join our small but mighty team. This role will report to our Director of IT, Cloud & Security and work closely with our Staff SRE as well as other Engineering team members and cross functional team members. Please note that while this role is remote, you must be based in the US to be considered for this position. Unfortunately, we are not able to provide sponsorship at this time.
What You'll Do:
- Leverage infrastructure as code (Terraform) to build and maintain complex production and analytics workflows including networking and containerized services.
- Rapidly diagnose and resolve faults in system services as part of a 24/7 on-call rotation focused on actionable alerting and eliminating toil.
- Improve speed of delivery by developing and maintaining CI/CD pipelines.
- Develop infrastructure automation leveraging Terraform, Python and Typescript.
- Improve system availability, security, compliance, cost effectiveness and performance.
- Estimate work, prioritize tasks, track dependencies, report progress, highlight blockers
- Participate in continuous improvement initiatives, advocate for SRE best practices, and stay current with emerging technologies and trends.
- Be part of a team where your focus will be on building, measuring, and refining the systems infrastructure that runs ouPlease mention the word **REALISTIC** and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
Similar jobs
Found 1 similar job