Get AI-powered advice on this job and more exclusive features.
This range is provided by Call For Referral. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range
$20.00/hr - $20.00/hr
Direct message the job poster from Call For Referral
Helping AI Startups Hire 0→1 Founding Engineers & Product Talent | Backend & Infra | US & Europe
Hourly Contract | Part-Time | Remote $20 per hour
About the Role
A
leading AI research initiative , in partnership with
Mercor , is hiring
Audio Model Trainers
to support the development of next-generation multimodal AI systems.
In this role, you will record short, high-quality audio clips that describe visual content — helping train AI models to better understand and connect
spoken language with visual information . This project plays a key part in shaping the next generation of AI that listens, sees, and speaks naturally.
Key Responsibilities
View a series of images and record
clear, natural-sounding spoken descriptions .
Create
short audio clips (2–3 minutes each)
using provided tools or platforms.
Maintain
high audio quality , ensuring recordings are free from noise or distortion.
Follow detailed
linguistic, timing, and stylistic guidelines
provided by the research team.
Collaborate asynchronously with AI researchers and QA specialists to ensure data precision and quality.
Ideal Qualifications
Excellent
verbal clarity, diction, and pronunciation .
Native or near-native fluency in English
(additional languages are a plus).
High attention to detail and ability to follow structured annotation guidelines.
Prior experience in
voice recording, data labeling, or content moderation
is a plus (not required).
Comfortable working
independently and asynchronously .
What You’ll Gain
Opportunity to contribute to
foundational AI research
with a global leader.
Hands-on experience in
multimodal AI systems
(audio, vision, and language).
Flexible,
remote-friendly schedule
— work on your own time.
Compensation & Contract
Rate:
$20 USD/hour
Payments:
Weekly via
Stripe Connect
Application & Onboarding Process
Complete a
15-minute AI-led interview
to assess communication clarity and recording quality.
Fill out a
brief availability form
for scheduling and workload preferences.
Expect follow-up from the
Mercor team within 3–5 business days .
⚡
PS: Mercor team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
⚡
Seniority level
Entry level
Employment type
Part-time
Job function
Customer Service, Administrative, and Writing/Editing
Industries
Movies, Videos, and Sound and Artists and Writers
#J-18808-Ljbffr

Call For Referral is hiring: Voice Actor & Voice Over Artist (Part-Time | $20/hr
Call For Referral, New York, NY, USA
Pay: $20.00/hr
Job type: Contract