This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.
Role Description
Mercor is collaborating with a leading AI lab on a short-term project focused on improving preference ranking models for conversational AI systems. We re seeking detail-oriented generalists ideally with prior experience in data labeling or content evaluation to assess and rank model outputs across a variety of domains. This opportunity is well-suited for professionals comfortable with nuanced judgment tasks and working independently in a remote setup.
Key Responsibilities
Evaluate and compare AI-generated responses based on quality, coherence, and helpfulness
Assign preference rankings to pairs or sets of model outputs
Follow detailed labeling guidelines and adjust based on evolving criteria
Provide brief written explanations for ranking decisions when required
Flag edge cases or inconsistencies in task design or model output
Qualifications
Prior experience in data labeling, content moderation, or preference ranking tasks
Excellent critical thinking and reading comprehension skills
Comfort working with evolving guidelines and ambiguity
Strong attention to detail and consistency across repetitive tasks
Availability for regular part-time work on a weekly basis
Requirements
Remote and asynchronous set your own hours
Expected commitment: 10 20 hours/week
Flexible workload depending on your availability and performance
Benefits
$25 35/hour depending on experience and location
Payments issued weekly via Stripe Connect
This is a freelance engagement; you ll be classified as an independent contractor
Application Process
Submit your resume to get started
Complete a short form to highlight your relevant experience
You may be asked to complete a brief assessment to evaluate task fit
Expect a response within 3 5 business days
← Back to jobs
Generalist - Labeling
USA
freelance
USA
$25–35/hour depending on experience and location
Role Summary
In this role, you will evaluate and rank AI-generated responses based on quality and coherence, primarily working with preference ranking models. This position is suitable for detail-oriented generalists with a focus on independent work in a remote setting, impacting the effectiveness of conversational AI systems through nuanced judgment and consistent evaluation.
Benefits & Culture
This freelance position offers flexible hours, allowing you to set your own schedule while committing 10 to 20 hours per week. Compensation ranges from $25 to $35 per hour, with weekly payments issued via Stripe Connect. The role promotes a remote and asynchronous work environment, supporting a healthy work-life balance.
Full Job Description
Similar jobs
Found 6 similar jobs
M
Quality Assurance Engineer
Monterail • Poland
freelance
S
Sr. Systems Engineer
ScalableOS • USA
full_time
S
Principal Infrastructure Engineer
SambaNova Systems • USA
full_time
V
Staff/Principal Site Reliability Engineer
Veza Technologies, Inc. • USA
full_time
P
Enterprise Business Development Representative
PeakMetrics • USA
full_time
D
Customer Success Manager
DAT • USA
full_time