Academic Research, Artificial Intelligence (AI), Benchmarking, Biology, Chemistry, Civil Engineering, Communication Skills, Computer Science, Data Analysis, Data Modeling, Data Science, Electricity, Machine Learning, Material Science, Mathematics, Mechanical Engineering, Physics, Research Laboratory, Software Engineering, Statistical Modeling, Statistics, Training Data Sets, Writing Skills
LOCATION
New York, New York
POSTED
10 days ago
About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: Expert Professionals — AI & Data Science Type:Contract Compensation:$70–$100/hour Location:Remote Commitment:40 hours/week
Role Responsibilities
Guide research and engineering teams to close knowledge gaps in AI and data science domains. Surface nuances that distinguish expert-level work from surface-level reasoning.
Design challenging agentic tasks rooted in real-world ML, data science, data engineering, and software workflows. Write accurate, well-documented solutions that serve as ground truth.
Evaluate AI agent outputs against your solutions. Provide detailed written feedback capturing correctness, efficiency, and reasoning quality.
Develop and refine evaluation frameworks and rubrics for assessing agentic behavior on AI and data science tasks.
Collaborate with other subject matter experts to ensure consistency and accuracy in training data.
Qualifications
Must-Have
3+ years of research, academic, or industry experience in Machine Learning, Data Science, Software Engineering, Computer Science, Statistics, Biology, Electrical/Mechanical/Civil Engineering, Physics, Chemistry, Mathematics, Materials Science, or other STEM background.
Demonstrated technical expertise in programming, data analysis, ML modeling, statistical methods, or computational methods.
Ability to commit to 40 hours per week during weekdays for the duration of the engagement.
Strong written communication skills and the ability to explain technical decisions clearly.
Preferred
Prior experience with data annotation, labeling, evaluation, or human feedback collection.
Experience with LLMs, AI systems, or agentic workflows; familiarity with agentic frameworks.
Application Process (Takes 20–30 mins to complete)
Upload resume
AI interview based on your resume
Submit form
Resources & Support
For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.