Member of Technical Staff, Post-Training

This listing is synced directly from the company ATS.

Role Overview

This senior-level role involves designing and implementing high-performance software for training AI models, focusing on post-training to achieve state-of-the-art performance. You will work in a collaborative team environment, bridging research and production by experimenting with training techniques and coordinating with specialist teams to enhance model capabilities. The impact includes advancing AI model performance and contributing directly to Cohere's cutting-edge AI research and deployment.

Perks & Benefits

The role is remote-flexible with offices in multiple cities and a co-working stipend, offering flexibility in location and work setup. Benefits include full health and dental coverage, mental health support, 6 weeks of vacation, parental leave top-up, and personal enrichment perks for arts, fitness, and well-being. The culture is open and inclusive, emphasizing collaboration with top researchers and a focus on innovation in AI, with opportunities for career growth through hands-on experience with large-scale distributed training.

⚠️ This job was posted over 13 months ago and may no longer be open. We recommend checking the company's site for the latest status.

Full Job Description

Who are we?

Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems.

We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate about their craft.

We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us!

Why this role?

Advance the state of the art for model post training, ship state of the art models to production, and bridge the gap between research and production. We have one of the highest ratio of compute to engineers in the world. We do not delineate strongly between engineering and research. Everyone will contribute to writing production code and supporting our research effort depending on individual interest and organisational needs. We have all the compute, data, and talent available for you to do your best work.

Please Note: We have offices in London, Paris, Toronto, San Francisco and New York but also embrace being remote-friendly!

As a Member of Technical Staff, you will:

Design and write high-performant and scalable software for training models.
Consistently post-train the models to reach SOTA level performance.
Coordinate with other specialist teams (Agentic, Code…) to produce models that have strong all encompassing performance.
Craft and implement techniques to improve the performance and results of our training cycles both on the SFT and the RL regime.
Research, implement, and experiment with ideas on our supercompute and data infrastructure.
Learn from and work with the best researchers in the field.

You may be a good fit if you have:

Extremely strong software engineering skills.
Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR.
Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray).
Experience using large-scale distributed training strategies.
Hands on experience on training large model at scale.
Hands on experience with the post training phase of model training, with a strong emphasis on performance optimisation.
Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).

* This is neither an exhaustive nor necessary set of attributes. Even if none of these apply to you, but you believe you will contribute to Cohere, please reach out. We have a wide variety of backgrounds at Cohere.

Full-Time Employees at Cohere enjoy these Perks:

A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch.
Full health and dental benefits, including a separate budget for mental health.
RRSP matching, 401K, Pension Scheme.
100% Parental Leave top-up for up to 6 months, for either parent.
Annual enrichment benefits:
Arts & culture, fitness/wellness, quality time, and a workspace improvement credit.
Education & learning stipend for conferences, courses, and coaching.

6 weeks of paid vacation (30 working days!)
Budget for traveling to other offices if you are remote, plus an annual company offsite.

How and Where We Work:

Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.
For those in the office: a daily lunch program, plenty of snacks, and regular community and social events.
For those not near an office: a co-working benefit so you can work alongside others in your city.
Everyone receives a $500 home office stipend to set up your workspace properly.

If any of the above doesn’t line up exactly with your experience, we still encourage you to apply.

We strive to create an inclusive work environment for all; we welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider.

Apply on original site

Similar jobs

Found 6 similar jobs

Head of Strategic Finance

Cohere • Remote

Data Annotation Specialist, Data Science

Cohere • Remote

Senior Product Designer

Cohere • Remote

Forward Deployed Engineer, Sovereign AI

cohere.com

Cohere is an AI company that specializes in natural language processing and understanding. Their primary offering is a suite of language models designed to help businesses and developers integrate advanced AI capabilities into their applications. Typical customers include tech companies, developers, and enterprises looking to leverage AI for tasks such as text generation, sentiment analysis, and more. Cohere fosters a remote-first work culture, allowing employees to collaborate seamlessly from various locations, which enhances flexibility and work-life balance.

Industry

Artificial Intelligence

Fully remote

247 open positions

About this company (remote-wise)

Headquarters:: Distributed / remote-first
Team style:: Async-ish, remote-first

View company profile →

About the job

Posted onJun 13, 2025

LocationRemote

Skills

Python

JAXPyTorchXLA/MLIR

Kubernetes

SlurmRayDistributed TrainingLarge Model TrainingPerformance Optimization

Share this job

💌 Get remote jobs in your inbox

Subscribe to get the latest curated remote jobs every week.