Software Engineer, AI Data & Evaluation
Role Overview
As a Senior Software Engineer on the AI Data & Evaluation team, you will design and build synthetic data pipelines, evaluation methodologies, and operational automation systems to improve frontier AI models. You'll work cross-functionally with research, operations, and product teams, owning end-to-end delivery from prototyping to production. This senior role focuses on impact and shipping, requiring a product-oriented mindset and deep interest in AI/ML data systems.
Perks & Benefits
The role is remote, with benefits including a $1.5K monthly meal stipend, free Equinox membership, $200 monthly laundry and wellness reimbursements, health/dental/vision insurance, and a bi-annual performance bonus. Equity is granted over 4 years, and relocation bonuses up to $15K are available. While time zone expectations aren't specified, the remote setup suggests flexibility, and the proximity bonus indicates a preference for in-office presence in SF, NYC, or London.
Full Job Description
About Mercor
Mercor's mission is to organize human intelligence to power the AI economy. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development. Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $2 million a day.
Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society. Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our San Francisco, NYC, or London offices.
About the Role
As a Senior Software Engineer (AI Data & Evaluation) at Mercor, you will be at the core of building the data infrastructure and evaluation systems that power the next generation of frontier AI models. Our team's mission is to develop high-quality data types that push frontier models forward and drive the AI industry ahead.
Software Engineers on this team are builders and innovators first. You will design and develop the evaluation methods and flywheels that drive continuous model improvement, engineer synthetic data pipelines and environments that generate high-signal training data at scale, and build the operational automation that keeps it all running with precision and efficiency. This role demands a product- and impact-oriented mindset, a bias toward shipping, and the ability to thrive at the intersection of data engineering, systems design, and applied AI research.
You Will
Innovate and develop evaluation methodologies and flywheels that continuously improve data quality and model performance at scale.
Design and build synthetic data generation systems and simulation environments that produce high-signal, high-diversity training data for frontier AI models.
Architect and ship operational automation systems that maximize throughput, efficiency, and quality across the end-to-end data pipeline.
Collaborate cross-functionally with Operations, Research, and Product to translate evolving model needs into robust, scalable engineering solutions.
Own end-to-end delivery of critical systems — from prototyping novel ideas to scaling production infrastructure.
What We're Looking For
Strong software engineering skills with a proven track record shipping production systems end-to-end.
Deep interest in and experience with AI/ML data pipelines, evaluation frameworks, or training data systems.
Systems thinking: ability to design for scalability, quality, and operational reliability simultaneously.
Comfort operating with ownership and pragmatism in fast-moving, ambiguous environments.
Effective communication and collaboration with engineering, research, and operations teams.
Experience with synthetic data generation, reinforcement learning environments, or large-scale data quality systems is highly valued.
Why Mercor
Impact: Your work directly shapes the quality of data powering the world's leading AI labs' frontier models.
Learning: Get early, first-hand exposure to cutting-edge model capabilities months before they reach the market.
Growth: Work at the intersection of data engineering and AI research with fast paths to ownership and leadership.
Benefits
Bi-annual performance bonus structure
Generous equity grant vested over 4 years
Up to $15k Relocation bonus
$10K proximity bonus (if you live within 0.5 miles of our office)
$1.5K monthly stipend for meals
Free Equinox membership
$200 monthly laundry reimbursement
$200 monthly personal wellness reimbursement
Health, Dental, Vision insurance
Similar jobs
Found 6 similar jobs