Performance of 5 AI Models on United States Medical Licensing Examination Step 1 Questions: Comparative Observational Study

Performance of 5 AI Models on United States Medical Licensing Examination Step 1 Questions: Comparative Observational Study

Artificial intelligence (AI) models are increasingly being used in medical education. Although models like ChatGPT have previously demonstrated strong performance on United States Medical Licensing Examination (USMLE)–style questions, newer AI tools with enhanced capabilities are now available, necessitating comparative evaluations of their accuracy and reliability across different medical domains and question formats.

Medigy Insights

Large language models show strong performance on USMLE Step 1–style medical questions, but accuracy varies across models and question types, highlighting both their potential and current limitations in clinical knowledge.



Next Article

Did you find this useful?

Medigy Innovation Network

Connecting innovation decision makers to authoritative information, institutions, people and insights.

Medigy Logo

The latest News, Insights & Events

Medigy accurately delivers healthcare and technology information, news and insight from around the world.

The best products, services & solutions

Medigy surfaces the world's best crowdsourced health tech offerings with social interactions and peer reviews.


© 2026 Netspective Foundation, Inc. All Rights Reserved.

Built on Mar 27, 2026 at 12:44pm