Apply

Mercor

Healthcare Professional

Warm Intro

United States +1

Remote

Contract

Mid-Senior

January 07, 2026

$189/hr

This role involves writing and refining prompts to guide model behavior in healthcare scenarios and evaluating large language model (LLM)-generated responses for accuracy, reasoning, clarity, and completeness. The evaluator will fact-check medical claims, annotate model responses for strengths and inaccuracies, and assess tone and appropriateness for real-world healthcare use while applying consistent evaluation standards.

Key Responsibilities

• Write and refine prompts to guide model behavior in healthcare scenarios.
• Evaluate LLM-generated responses to healthcare queries for accuracy, reasoning, clarity, and completeness.
• Conduct fact-checking of all medical and healthcare claims using trusted public sources and authoritative references.
• Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
• Assess tone, completeness, and appropriateness of responses for real-world healthcare use.
• Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines.

Required

• 5+ years of real-world professional experience in Healthcare, supported by an associated expert degree (e.g., MD, DO, RN, NP, PA, PharmD, MPH, or equivalent).
• Experience in one or more of the following sub-domains: General Clinical Care, Specialty Medicine or Surgery, Diagnostics, Imaging & Laboratory Medicine, Public Health, Healthcare Systems & Administration.
• Significant experience using large language models (LLMs) and understanding of their use.
• Excellent writing communication skills for complex medical topics.
• Strong attention to detail and comfort in evaluating clinical reasoning and medical explanations.

Preferred

• Prior experience with RLHF, model evaluation, or data annotation work.
• Experience writing or editing high-quality medical or healthcare-related content.
• Experience in clinical documentation, charting, or patient communication, including explaining medical information to non-clinical audiences.
• Familiarity with evaluation rubrics, benchmarks, or quality scoring systems.

Company Overview

Industry: Healthcare Technology

Company Size: 500-1,000 employees

Founded: 2015

Headquarters: San Francisco, CA

Company Links

LinkedIn Profile Crunchbase Glassdoor Reviews

Key Contacts

Contact information not available

About the Company

Leading healthcare technology company focused on improving patient outcomes through innovative digital solutions. We're transforming the way healthcare is delivered with cutting-edge technology and data-driven insights. Our platform serves over 10,000 healthcare professionals and has processed millions of patient interactions.

Recent News & Updates

Series B Raised $50M Series B funding - Jan 2024

Award Named "Best Healthcare Startup" by TechCrunch - Dec 2023

Growth Expanded to 5 new states - Nov 2023

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Healthcare AI Evaluator

Type: Contract

Compensation: $189/hour

Location: Remote

Role Responsibilities

Write and refine prompts to guide model behavior in healthcare scenarios.
Evaluate LLM-generated responses to healthcare queries for accuracy, reasoning, clarity, and completeness.
Conduct fact-checking of all medical and healthcare claims using trusted public sources and authoritative references.
Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
Assess tone, completeness, and appropriateness of responses for real-world healthcare use.
Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines.

Qualifications

Must-Have

5+ years of real-world professional experience in Healthcare, supported by an associated expert degree (e.g., MD, DO, RN, NP, PA, PharmD, MPH, or equivalent).
Experience in one or more of the following sub-domains: General Clinical Care, Specialty Medicine or Surgery, Diagnostics, Imaging & Laboratory Medicine, Public Health, Healthcare Systems & Administration.
Significant experience using large language models (LLMs) and understanding of their use.
Excellent writing communication skills for complex medical topics.
Strong attention to detail and comfort in evaluating clinical reasoning and medical explanations.

Preferred

Prior experience with RLHF, model evaluation, or data annotation work.
Experience writing or editing high-quality medical or healthcare-related content.
Experience in clinical documentation, charting, or patient communication, including explaining medical information to non-clinical audiences.
Familiarity with evaluation rubrics, benchmarks, or quality scoring systems.

Application Process (Takes 20–30 mins to complete)

Upload resume
AI interview based on your resume
Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

,

Keep track of your job search

Save personal notes for each job to track your thoughts, application status, and follow-ups.

Try for free

Mercor

Key Responsibilities

Required

Preferred

Company Overview

Company Links

Key Contacts

About the Company

Recent News & Updates

Keep track of your job search

Upload your resume

Sign up required

Join or sign in

Report issue

Job Actions