About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: AI Model Evaluation Contractor

Type: Contract

Compensation: $25–$35/hour

Commitment: 20 hours/week

Write realistic prompts reflecting professional and consumer domain-specific guidance.
Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness.
Identify fabricated claims, incorrect references, or misleading reasoning in model outputs.
Score and rank multiple model responses using structured rubrics across dimensions.
Provide written justifications with specific evidence for each evaluation.

Qualifications

Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.

Submit your resume to begin.
Complete the Model Response Evaluation assessment.

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Professional Evaluator - Fully Remote | Upto $35/hr Hourly

Similar Jobs

Recent Jobs

You May Also Like