AI Prompt & Agent Developer

Recruiting From Scratch

San Francisco, California

JOB DETAILS
SKILLS
Analysis Skills, Artificial Intelligence (AI), Artificial Intelligence (AI) Agents, Automation, Best Practices, Call Centers, Cognitive Science, Communication Skills, Continuous Improvement, Cross-Functional, Customer Experience, Customer Relations, Customer/Consumer Behavior, Data Sets, Design Evaluation, Detail Oriented, Healthcare, Legal, Linguistics, Machine Tool, Metrics, Onboarding, Organizational Skills, Performance Management, Philosophy, Production Control, Python Programming/Scripting Language, Quality Management, Reporting Dashboards, Seed Funding, Software Engineering, Speech Recognition, Speech Technology, Startup, Systems Administration/Management, Telephony, Test Harness, Voice Applications, Writing Skills
LOCATION
San Francisco, California
POSTED
8 days ago

AI Prompt & Agent Developer

Location: San Francisco, CA
Company Stage of Funding: Seed Stage AI Startup ($6M Raised)
Office Type: Onsite (5 Days Per Week)
Salary: $90,000–$130,000 + Competitive Equity

Company Description

We're representing a fast-growing AI startup building enterprise-grade voice AI agents for the healthcare industry. Rather than relying on third-party AI infrastructure, the company has developed a proprietary stack spanning speech models, agent orchestration, and real-time voice systems to deliver highly reliable AI call center solutions for medical practices.

With enterprise customers already deploying the platform at scale, the team is focused on continuously improving agent behavior through prompt engineering, evaluation systems, and production experimentation. As one of the earliest members of the AI team, you'll directly influence how production AI agents interact with thousands of real users every day.

What You Will Do

  • Design, write, and maintain production prompts that power enterprise voice AI agents.
  • Own key components of agent behavior, including intent classification, information extraction, scheduling workflows, objection handling, and edge-case recovery.
  • Analyze production conversations daily to identify failure modes and continuously improve agent performance.
  • Build evaluation datasets, automated testing frameworks, and prompt optimization pipelines to validate behavioral improvements before deployment.
  • Develop and refine human-in-the-loop onboarding workflows that configure AI agents for new healthcare customers.
  • Design evaluation metrics that measure booking rates, automation success, and overall customer experience.
  • Collaborate closely with engineering teams to improve internal tooling, dashboards, and agent development workflows.
  • Conduct structured experiments to validate prompt changes using production data and measurable outcomes.
  • Help scale AI systems supporting millions of healthcare conversations while maintaining reliability and quality.
  • Continuously improve AI behavior through data-driven iteration and production monitoring.

Ideal Background

  • 2+ years of professional experience in prompt engineering, conversational AI, or AI agent development.
  • Experience owning production prompt behavior for customer-facing AI systems.
  • Strong analytical skills with the ability to evaluate large datasets and identify behavioral improvements.
  • Excellent written communication and language intuition with strong attention to detail.
  • Familiarity with voice AI technologies, speech recognition (ASR), text-to-speech (TTS), or conversational AI platforms.
  • Comfortable reading Python and TypeScript code while collaborating closely with engineering teams.
  • Strong organizational skills with the ability to manage multiple experiments and deployments simultaneously.
  • Experience working in fast-paced startup environments with rapid iteration cycles.
  • Excellent cross-functional communication skills and customer-focused thinking.

Preferred

  • Experience building production AI agents serving thousands of users.
  • Background in linguistics, philosophy, law, cognitive science, or other language- and logic-intensive disciplines.
  • Experience with prompt evaluation frameworks, automated testing, or LLM optimization techniques.
  • Familiarity with telephony platforms such as Twilio or other voice infrastructure.
  • Experience working with healthcare workflows or enterprise AI products.
  • Strong understanding of LLM behavior, prompt engineering best practices, and AI evaluation methodologies.
  • Comfortable leveraging AI coding tools to accelerate development and experimentation.
  • Passion for building reliable, production-quality AI systems through continuous empirical improvement.

Compensation and Benefits

  • Base salary: $90,000–$130,000.
  • Competitive equity package.
  • Five-day onsite collaboration in San Francisco.
  • Opportunity to join a rapidly growing AI startup building proprietary voice AI infrastructure.
  • Significant ownership over production AI agent behavior and customer experience.
  • Direct collaboration with founders and a small, highly technical engineering team.
  • Paid take-home exercise during the interview process.
  • Opportunity to help shape next-generation conversational AI systems deployed across enterprise healthcare customers.
 
 
 

About the Company

R

Recruiting From Scratch