Generalist - Labeling

Role Overview

In this role, you will evaluate and rank AI-generated responses based on quality and coherence, primarily working with preference ranking models. This position is suitable for detail-oriented generalists with a focus on independent work in a remote setting, impacting the effectiveness of conversational AI systems through nuanced judgment and consistent evaluation.

Perks & Benefits

This freelance position offers flexible hours, allowing you to set your own schedule while committing 10 to 20 hours per week. Compensation ranges from $25 to $35 per hour, with weekly payments issued via Stripe Connect. The role promotes a remote and asynchronous work environment, supporting a healthy work-life balance.

⚠️ This job was posted over 8 months ago and may no longer be open. We recommend checking the company's site for the latest status.

Full Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

Mercor is collaborating with a leading AI lab on a short-term project focused on improving preference ranking models for conversational AI systems. We re seeking detail-oriented generalists ideally with prior experience in data labeling or content evaluation to assess and rank model outputs across a variety of domains. This opportunity is well-suited for professionals comfortable with nuanced judgment tasks and working independently in a remote setup.

Key Responsibilities

Evaluate and compare AI-generated responses based on quality, coherence, and helpfulness

Assign preference rankings to pairs or sets of model outputs

Follow detailed labeling guidelines and adjust based on evolving criteria

Provide brief written explanations for ranking decisions when required

Flag edge cases or inconsistencies in task design or model output

Qualifications

Prior experience in data labeling, content moderation, or preference ranking tasks

Excellent critical thinking and reading comprehension skills

Comfort working with evolving guidelines and ambiguity

Strong attention to detail and consistency across repetitive tasks

Availability for regular part-time work on a weekly basis

Requirements

Remote and asynchronous set your own hours

Expected commitment: 10 20 hours/week

Flexible workload depending on your availability and performance

Benefits

$25 35/hour depending on experience and location

Payments issued weekly via Stripe Connect

This is a freelance engagement; you ll be classified as an independent contractor

Application Process

Submit your resume to get started

Complete a short form to highlight your relevant experience

You may be asked to complete a brief assessment to evaluate task fit

Expect a response within 3 5 business days

Apply on original site

Similar jobs

Found 6 similar jobs

Assistant Account Payable

The Obesity Society • USA

full_time

Online Data Analyst Canada

TELUS Digital • Canada

part_time

Tier III Service Desk Engineer

Coalition Technologies • Worldwide

freelance

Data Labeling Specialists

Workada • USA

freelance

Moonlight

moonlightrollerway.com

Moonlight is a platform designed to connect freelance developers with companies looking for remote tech talent. Their typical users include startups and established tech firms seeking to scale their engineering teams without the constraints of geographical limitations. Moonlight's main service revolves around facilitating project-based work, allowing developers to find gigs that match their skills and availability. The company promotes a remote-first culture, where collaboration is achieved through digital tools and communication platforms, enabling flexibility and work-life balance for their team members.

Industry

Technology

Fully remote

1 open position

About this company (remote-wise)

Headquarters:: Distributed / remote-first
Hires in:: US / North America
Team style:: Async-ish, remote-first

View company profile →

About the job

Posted onNov 9, 2025

Job typefreelance

CategoryAll others

LocationUSA • USA

Skills

Data labelingContent evaluationCritical thinkingAttention to detailReading comprehensionFlexibilityIndependent work

Share this job

💌 Get remote jobs in your inbox

Subscribe to get the latest curated remote jobs every week.