Apply: Stanford AI in Healthcare Leadership & Strategy Program (May–June 2026)

Research

Explore our publications and preprints advancing healthcare through rigorous AI evaluation.

Nature Medicine
Feb 3, 2026

Scaling medical AI across clinical contexts

Medical artificial intelligence (AI) tools, including clinical language models, vision–language models and multimodal health record models, are used to summarize […]

NPJ Health Systems
Feb 2, 2026

Uses of generative AI by non-clinician staff at an academic medical center

Large language model (LLM) chat tools have the potential to transform healthcare workflows by improving efficiency and reducing administrative burdens. […]

Journal of General Internal Medicine
Jan 21, 2026

“I Double Checked It with My Own Knowledge:” Physician Perspectives on the Use of AI Chatbots for Clinical Decision-Making

AI chatbots are proliferating in healthcare systems. It is essential to explore how physicians use these tools in order to […]

Journal of General Internal Medicine
Jan 20, 2026

Elevating Management Reasoning to Preserve Professional Identity in the AI Era

AI systems are rapidly approaching expert-level diagnostic reasoning; however, management reasoning —the art of translating diagnoses into personalized care—remains distinctly […]

Nature Medicine
Jan 20, 2026

Holistic evaluation of large language models for medical tasks with MedHELM

While large language models (LLMs) achieve near-perfect scores on medical licensing exams, these evaluations inadequately reflect the complexity and diversity […]

The BMJ
Dec 12, 2025

Parallel pressures: the common roots of doctor bullshit and large language model hallucinations

The phenomenon of doctors presenting unfounded statements with unwavering arrogance—colloquially known as “bullshit”—has long been recognised in medical practice. In […]

NPJ Digital Medicine
Dec 5, 2025

A typology of physician input approaches to using AI chatbots for clinical decision-making

This study explores how physicians approach using LLM chatbots during clinical reasoning tasks and whether the amount of clinical case […]

Preprint
Dec 1, 2025

First, do NOHARM: towards clinically safe large language models

Large language models (LLMs) are routinely used by physicians and patients for medical advice, yet their clinical safety profiles remain […]

Preprint
Oct 28, 2025

BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text

Large language models (LLMs) hold great promise for medical applications and are evolving rapidly, with new models being released at […]

NPJ Digital Medicine
Oct 24, 2025

Artificial intelligence for autoimmune diseases

Emerging evidence suggests generative artificial intelligence (AI) may offer potential for autoimmune and rheumatic disease care, moving beyond traditional narrow […]

Latest News

View all

Get the latest on our studies, grant awards, and media coverage.