About Knowtex
Knowtex is building the future of voice AI operating systems for clinicians, transforming how healthcare documentation happens at the point of care. Founded by Stanford AI scientists with deep clinical experience, we're experiencing explosive growth across both commercial health systems and federal healthcare, with our ambient documentation platform scaling rapidly to thousands of clinicians across hundreds of specialties. We're at an inflection point where cutting-edge AI meets real clinical impact, giving clinicians hours back each day to focus on what matters most - their patients.
Position Overview
We're seeking a Senior DevOps Engineer to architect and scale our infrastructure as we grow 50x in health systems across the United States. You'll be instrumental in building resilient, compliant, and high-performance systems that process millions of clinical encounters while maintaining sub-millisecond latency for real-time voice AI.
Key Responsibilities
Design and implement AWS GovCloud infrastructure supporting FISMA and FedRAMP compliance requirements
Scale ECS container orchestration and Lambda functions for explosive user growth
Build and optimize CI/CD pipelines for rapid, safe deployments across development, staging, and production environments
Implement infrastructure as code using Terraform for reproducible, auditable deployments
Architect high-availability systems with 99.99% uptime SLAs for clinical operations
Optimize Triton Inference Server deployment for ML model serving at scale
Design HIPAA-compliant data pipelines and storage solutions for PHI
Implement comprehensive monitoring, alerting, and observability systems
Manage VistA/CPRS integration infrastructure and API gateway configurations
Lead incident response and post-mortem processes for production issues
Implement zero-downtime deployment strategies for continuous delivery
Design disaster recovery and business continuity plans for healthcare operations
Collaborate with security team on ATO documentation and compliance audits
Optimize costs while maintaining performance across multi-region deployments
Required Qualifications
5+ years of DevOps/SRE experience with at least 2 years in senior/lead capacity
Expert-level knowledge of AWS services (ECS, Lambda, RDS, API Gateway, CloudFormation, VPC)
Strong experience with containerization (Docker) and orchestration (ECS/Kubernetes)
Proficiency in infrastructure as code (Terraform, CloudFormation)
Experience with CI/CD tools (GitLab CI, GitHub Actions, Jenkins)
Strong scripting skills (Python, Bash, Go)
Experience with monitoring and observability tools (Datadog, CloudWatch, Prometheus)
Understanding of security best practices and compliance frameworks
Experience with high-availability and disaster recovery planning
Bachelor's degree in Computer Science, Engineering, or equivalent experience
Preferred Qualifications
AWS GovCloud and FedRAMP compliance experience
Healthcare IT infrastructure experience (HIPAA, HITRUST, SOC 2)
Experience with ML model deployment infrastructure (Triton, SageMaker)
Knowledge of VistA or other healthcare system integrations
Experience scaling from startup to enterprise (10x+ growth)
Familiarity with FHIR R4 and healthcare interoperability standards
Experience with real-time systems and low-latency optimization
Federal healthcare or government contracting experience
AWS Solutions Architect or DevOps Engineer certification
Technical Environment
AWS GovCloud infrastructure with multi-region deployment
ECS containers and Lambda serverless functions
Triton Inference Server for ML model deployment
FHIR R4 integration with major EHR systems
Real-time voice processing with <200ms latency requirements
Processing hundreds of thousands of clinical encounters daily
Multi-tenant architecture supporting 200+ medical specialties
Benefits
Meaningful equity compensation
Unlimited PTO
Premium health, dental, and vision coverage
401(k) plan
Hybrid work model: 3 days/week in our San Francisco office