Applied AI Software EngineerCanvas Medical • San Francisco, CA, US

Applied AI Software Engineer

Canvas Medical • San Francisco, CA, US

16 days ago

Job type

Full-time

Job description

Job Description

Canvas Medical is the electronic medical records (EMR) and payments development platform for healthcare. We build modern, elegant front- and back-end tooling to enable new ways for developers and clinicians to collaborate to solve healthcare’s toughest challenges. Canvas is institutionally backed by some of the greatest technology investors in the world (funded notable health tech companies such as GoodRx, Oscar Health, and Hims & Hers Health).

The Role

We’re hiring an Applied AI Software Engineer to lead evaluations for agents in development and the post-deployment fleet of agents operating in Canvas to automate work for our customers. You will help develop agents in Canvas using state of the art foundation model inference and fine-tuning APIs along with our server-side SDK. The server-side SDK provides extensive tools and virtually all the context necessary for excellent agent performance. You’ll be responsible for designing and running rigorous evaluation experiments that measure performance, safety, and reliability across a wide variety of clinical, operational, and financial use cases.

This role is ideal for someone with deep experience evaluating LLM-based agents at scale. You’ll create high-fidelity unit evals and end-to-end evaluations, define expert-determined ground truth outcomes, and manage iterations across model variants, prompts, tool use, and context window configurations. Your work will directly inform model selection, fine-tuning, and go / no-go decisions for AI features used in production settings.

You’ll collaborate with product, ML engineering, and clinical informatics teams to ensure that Canvas's AI agents are not only capable, but trustworthy and robust under real-world healthcare constraints. You will also work with technical product marketers and developer advocates to help our broader developer community and the broader market understand the uniquely differentiated value of agents in Canvas.

Who You Are

You have extensive hands-on experience evaluating LLM-based systems, including multi-agent architectures and prompt-based pipelines.
You are deeply familiar with foundation model APIs (OpenAI, Claude, Gemini, etc.) and how to systematically benchmark agent performance using those models in applied settings.
You care about correctness and reproducibility and have built or contributed to frameworks for automated evals, annotation pipelines, and experiment tracking.
You bring structure to ambiguity and know how to define “correctness” in complex, nuanced domains.
You are comfortable collaborating across engineering, product, and clinical subject matter experts.
You are not afraid of complexity and are energized by the rigor required in healthcare deployments.

What You’ll Do

Design and execute large-scale evaluation plans for LLM-based agents performing clinical documentation, scheduling, billing, communications, and general workflow automation tasks.

Build end-to-end test harnesses that validate model behavior under different configurations (prompt templates, context sources, tool availability, etc.).

Partner with clinicians to define accurate expected outcomes (gold standard) for performance comparisons in domains of clinical consequence, and partner with other subject matter experts in other non-clinical domains.

Run and replicate experiments across multiple models, parameters, and interaction types to determine optimal configurations.

Deploy and maintain ongoing sampling for post-deployment governance of agent fleets.

Analyze results and summarize tradeoffs in clarity for product and engineering stakeholders, as well as for technical stakeholders among our customers and the broader market.

Take ownership over internal eval tooling and infrastructure, ensuring speed, rigor, and reproducibility.

Identify and recommend candidates for reinforcement fine-tuning or retrieval augmentation based on gaps identified in evals.

What Success Looks Like at 90 Days

An expanded set of robust evaluation suites exists for all major AI features currently in development and in production.

We have well-defined correctness criteria for each workflow and a reliable source of expert-determined outcome objects.

Product and engineering teams have integrated your evaluation tools into their daily workflows.

Evaluation results are clearly documented and reproducible, enabling trust in the performance trajectory.

Your have effectively engaged your marketing counterparts to translate your work into key messages to the market and to Canvas customers.

Qualifications

5+ years of experience in applied machine learning or AI engineering, with a focus on evaluation and benchmarking.

Proficiency with foundation model APIs and experience orchestrating complex agent behaviors via prompts or tools.

Experience designing and running high-throughput evaluation pipelines, ideally including human-in-the-loop or expert-labeled benchmarks.

Superlative Python engineering skills and familiarity with experiment management tools and data engineering toolsets in general including, yes, SQL and database management.

Familiarity with clinical or healthcare data is a strong plus.

Experience with reinforcement fine-tuning, model monitoring, or RLHF is a plus.

Research shows that women and other minority groups might avoid applying if they don’t meet 100% of the qualifications. We encourage you to apply even if you don’t meet everything listed in the job posting.

We are a mostly remote, distributed team. We encourage people to do their work when and where they perform at their best. Because of this structure, strong written communication skills, time management skills, and personal accountability are very important to us.

Employee Benefits :

Competitive Salary & Equity Package

Health Insurance

Home Office Stipend

401k

Paid Maternity / Paternity Leave (12 weeks)

Flexible / unlimited PTO

Canvas Medical provides equal employment opportunities to all employees and applicants for employment without regard to race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Create a job alert for this search

Applied Ai Engineer • San Francisco, CA, US

Related jobs

Applied AI Engineer

VirtualVocations • Santa Clara, California, United States

Full-time

A company is looking for an Applied AI Engineer to develop and deploy AI models and hybrid AI frameworks.Key Responsibilities Develop and deploy AI models and hybrid AI frameworks Conduct resear...Show more

Last updated: 30+ days ago • Promoted

Software Engineer in AI

VirtualVocations • Santa Clara, California, United States

Full-time

A company is looking for a Laureate Software Engineer, Gen AI.Key Responsibilities Assist in driving the long-term architectural strategy for AI and LLM adoption Design and build foundational co...Show more

Last updated: 5 days ago • Promoted

AI Application Engineer

VirtualVocations • Santa Clara, California, United States

Full-time

A company is looking for an AI Application Engineer.Key Responsibilities : Design and build AI-powered user features for sports insights Prototype and iterate AI behaviors using prompting and too...Show more

Last updated: 30+ days ago • Promoted

AI / ML Software Engineer

VirtualVocations • Fremont, California, United States

Full-time

A company is looking for an AI / ML Software Engineer.Key Responsibilities Rapidly prototype product features to explore new AI capabilities while writing clean, understandable code Deliver genera...Show more

Last updated: 30+ days ago • Promoted

High Performance AI Engineer

VirtualVocations • Fremont, California, United States

Full-time

A company is looking for a High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. Key Responsibilities Design, build, and optimize agentic AI systems for ...Show more

Last updated: 4 days ago • Promoted

AI Integration Engineer

VirtualVocations • Concord, California, United States

Full-time

A company is looking for an AI Integration Engineer - ServiceNow / Workday.Key Responsibilities Lead deployment and configuration of ServiceNow / Workday connectors Translate HR / IT / Facilities use ca...Show more

Last updated: 30+ days ago • Promoted

Lead AI Engineer

VirtualVocations • Concord, California, United States

Full-time

A company is looking for a Lead AI Engineer (Remote).Key Responsibilities : Define the AI-first vision and technical roadmap, owning the design and architecture of AI features Leverage expertise ...Show more

Last updated: 30+ days ago • Promoted

Staff AI Engineer

VirtualVocations • Fremont, California, United States

Full-time

A company is looking for a Staff AI Engineer.Key Responsibilities Build 0-1 AI systems to transform core brokering workflows and manage client renewals Embed with brokering teams to identify aut...Show more

Last updated: 30+ days ago • Promoted

Applied AI Engineer, Digital Employee Experience

Planet Labs PBC • San Francisco, CA, United States

Full-time

We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...Show more

Last updated: 25 days ago • Promoted

Prompt Engineer for AI Systems

VirtualVocations • Concord, California, United States

Full-time

A company is looking for a Prompt Engineer specializing in Data Science and Quality Analysis.Key Responsibilities Develop, test, and refine prompts for various AI tasks to optimize performance D...Show more

Last updated: 30+ days ago • Promoted

Applied AI Engineer, Enterprise GenAI

Scale AI, Inc. • San Francisco, CA, United States

Full-time

AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, ...Show more

Last updated: 30+ days ago • Promoted

AI Agent Developer

VirtualVocations • Fremont, California, United States

Full-time

A company is looking for an AI Agent Developer.Key Responsibilities Architect and implement AI-powered agents using Large Language Models (LLMs) Connect agents to internal systems and databases,...Show more

Last updated: 30+ days ago • Promoted

Full Stack AI Engineer

VirtualVocations • Concord, California, United States

Full-time

A company is looking for a Full Stack AI Engineer (Python, AWS) to join their AI Operations team.Key Responsibilities Build production deployment pipelines for AI models in AWS Bedrock and SageMa...Show more

Last updated: 30+ days ago • Promoted

AI Software Engineer

VirtualVocations • San Francisco, California, United States

Full-time

A company is looking for an AI Software Engineer to develop generative AI applications for digital learning platforms.Key Responsibilities Develop and maintain reliable, scalable, and secure AI-p...Show more

Last updated: 30+ days ago • Promoted

Applied AI Engineer Consultant

VirtualVocations • Oakland, California, United States

Full-time

A company is looking for an Applied AI Engineer Consultant.Key Responsibilities Build production-ready Intelligent Automations and Agentic AI solutions for enterprise clients Guide clients in de...Show more

Last updated: 30+ days ago • Promoted

AI Engineer

VirtualVocations • Santa Clara, California, United States

Full-time

A company is looking for an AI Engineer to join a global medical technology company.Key Responsibilities Building production-ready agentic AI systems Designing and implementing CI / CD pipelines f...Show more

Last updated: 30+ days ago • Promoted

AI Tools Engineering Expert

VirtualVocations • Concord, California, United States

Part-time

A company is looking for a Part-Time Industry Expert in AI Tools for Engineering.Key Responsibilities Consult on curriculum structure and project portfolio to align with industry best practices ...Show more

Last updated: 3 days ago • Promoted

Senior AI Software Engineer

VirtualVocations • Santa Clara, California, United States

Full-time

A company is looking for a Senior / Staff AI Software Engineer.Key Responsibilities Design and develop robust, scalable, event-driven services using Python, FastAPI, Apache Kafka, and GraphQL Bu...Show more

Last updated: 30+ days ago • Promoted