Mercor
This role involves writing and refining prompts to guide model behavior in healthcare scenarios and evaluating large language model (LLM)-generated responses for accuracy, reasoning, clarity, and completeness. The evaluator will fact-check medical claims, annotate model responses for strengths and inaccuracies, and assess tone and appropriateness for real-world healthcare use while applying consistent evaluation standards.
Key Responsibilities
- • Write and refine prompts to guide model behavior in healthcare scenarios.
- • Evaluate LLM-generated responses to healthcare queries for accuracy, reasoning, clarity, and completeness.
- • Conduct fact-checking of all medical and healthcare claims using trusted public sources and authoritative references.
- • Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
- • Assess tone, completeness, and appropriateness of responses for real-world healthcare use.
- • Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines.
Required
- • 5+ years of real-world professional experience in Healthcare, supported by an associated expert degree (e.g., MD, DO, RN, NP, PA, PharmD, MPH, or equivalent).
- • Experience in one or more of the following sub-domains: General Clinical Care, Specialty Medicine or Surgery, Diagnostics, Imaging & Laboratory Medicine, Public Health, Healthcare Systems & Administration.
- • Significant experience using large language models (LLMs) and understanding of their use.
- • Excellent writing communication skills for complex medical topics.
- • Strong attention to detail and comfort in evaluating clinical reasoning and medical explanations.
Preferred
- • Prior experience with RLHF, model evaluation, or data annotation work.
- • Experience writing or editing high-quality medical or healthcare-related content.
- • Experience in clinical documentation, charting, or patient communication, including explaining medical information to non-clinical audiences.
- • Familiarity with evaluation rubrics, benchmarks, or quality scoring systems.
Company Overview
Industry: Healthcare Technology
Company Size: 500-1,000 employees
Founded: 2015
Headquarters: San Francisco, CA
Company Links
Key Contacts
Contact information not available
About the Company
Leading healthcare technology company focused on improving patient outcomes through innovative digital solutions. We're transforming the way healthcare is delivered with cutting-edge technology and data-driven insights. Our platform serves over 10,000 healthcare professionals and has processed millions of patient interactions.
Recent News & Updates
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: Healthcare AI Evaluator
Type: Contract
Compensation: $189/hour
Location: Remote
Role Responsibilities
- Write and refine prompts to guide model behavior in healthcare scenarios.
- Evaluate LLM-generated responses to healthcare queries for accuracy, reasoning, clarity, and completeness.
- Conduct fact-checking of all medical and healthcare claims using trusted public sources and authoritative references.
- Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
- Assess tone, completeness, and appropriateness of responses for real-world healthcare use.
- Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines.
Must-Have
- 5+ years of real-world professional experience in Healthcare, supported by an associated expert degree (e.g., MD, DO, RN, NP, PA, PharmD, MPH, or equivalent).
- Experience in one or more of the following sub-domains: General Clinical Care, Specialty Medicine or Surgery, Diagnostics, Imaging & Laboratory Medicine, Public Health, Healthcare Systems & Administration.
- Significant experience using large language models (LLMs) and understanding of their use.
- Excellent writing communication skills for complex medical topics.
- Strong attention to detail and comfort in evaluating clinical reasoning and medical explanations.
- Prior experience with RLHF, model evaluation, or data annotation work.
- Experience writing or editing high-quality medical or healthcare-related content.
- Experience in clinical documentation, charting, or patient communication, including explaining medical information to non-clinical audiences.
- Familiarity with evaluation rubrics, benchmarks, or quality scoring systems.
- Upload resume
- AI interview based on your resume
- Submit form
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: [email protected]
,
Keep track of your job search
Save personal notes for each job to track your thoughts, application status, and follow-ups.
Try for freeUpload your resume
Sign up to upload your resume and get AI-powered customization for job applications.
Sign up free