In partnership with

Receive Honest News Today

Join over 4 million Americans who start their day with 1440 – your daily digest for unbiased, fact-centric news. From politics to sports, we cover it all by analyzing over 100 sources. Our concise, 5-minute read lands in your inbox each morning at no cost. Experience news without the noise; let 1440 help you make up your own mind. Sign up now and invite your friends and family to be part of the informed.

Hi!

Today is LLM’s day.

A day when I will only share Research, Tools, and News on Large Language Models.

But what are LLMs? Well, they are deep learning algorithms that can recognize, summarize, translate, predict, and generate content.

LLMs are trained on immense amounts of data, making them capable of understanding and generating natural language, images, code, and videos.

The most known is ChatGPT, but I’ll share some other LLMs with you, and their best use from my point of view.

Are you ready? Here we go.

LLMs

🔬 Methods

Data: 2,133 vignettes from the Human Diagnosis Project.

Participants: 

  • Licensed Physicians

  • 5 large language models (LLMs): Claude 3 Opus, GPT‑4, Gemini Pro, Mistral, and Llama 2.

Comparison groups:

  • Physician alone

  • Physician teams

  • Individual LLMs

  • LLM ensembles

  • Physicians + LLMs

📊 Results

Accuracy of diagnosis:

  • Physicians alone: 68.3%

  • Physician teams: 75%

  • LLM alone:

    • Claude 3 Opus: 72.1%

    • GPT-4: 71.6%

    • Gemini Pro: 69.2%

    • Mistral/Llama 2: <65%

  • LLM ensembles: 74.8%

  • Physicians team + LLM ensemble: 79.8%

  • Physician alone + LLMs: 80.4% (p<0.001)

Error correction:

  • Physicians corrected 58.7% of LLM mistakes

  • LLM corrected 61.3% of Physician errors

🔑 Key Takeaways

  • Physician + LLMs collectives outperformed all other groups.

  • Best performing LLMs: Claude 3 Opus, and GPT-4.

  • LLMs covered gaps where physicians make mistakes, and vice versa.

  • This research supports collaborative workflows in clinical settings.

🔗Zöller N, Berger J, Lin I, et al. Human–AI collectives most accurately diagnose clinical vignettes. Proc Natl Acad Sci U S A. 2025;122(24):e2426153122. doi:10.1073/pnas.2426153122

🦾TechTools

There are several LLMs; some of them are general, and there are also LLMs designed for clinical use.

Today, I’ll start with 3 generals: ChatGPT, Claude, and Manus (👉click the name to test them).

  • The most well-known LLM is great for everyday tasks.

  • Versatile and conversational.

  • Good for creative writing and generating images.

  • It’s fast, which makes your work easier.

  • Is great at summarizing and analysing long documents (upload a paper and ask questions about it).

  • Great for long, more professional writing, deep thinking, and clarity.

  • It spots ethical risks, and it’s fast too.

  • Good for complex medical cases.

  • Best for researching medical information that requires accurate references.

  • Can manage complex tasks and workflows (without having to explain every step).

  • Good at deep reasoning.

  • Great for automation and integration with other apps.

That’s all for now.

Would you like me to continue writing this email every week? You can hit reply with your answer.

If you know people in healthcare who would like to get updates on LLM news, feel free to share it by:

↪️Forwarding this email or 📲copy this link and send it on your phone. 

Thank you!

Until next Wednesday.

Itzel Fer, MD PM&R

Follow me on LinkedIn | Substack | X | Instagram

Forwarded this email? Sign up here

How did you like today's newsletter?

Login or Subscribe to participate

How 1,500+ Marketers Are Using AI to Move Faster in 2025

Is your team using AI like the leaders—or still stuck experimenting?

Masters in Marketing’s AI Trends Report breaks down how top marketers are using tools like ChatGPT, Claude, and Breeze to scale content, personalize outreach, and drive real results.

Inside the report, you’ll discover:

  • What AI use cases are delivering the strongest ROI today

  • How high-performing teams are integrating AI into workflows

  • The biggest blockers slowing others down—and how to avoid them

  • A 2025 action plan to upgrade your own AI strategy

Download the report. Free when you subscribe to the Masters in Marketing newsletter.

Learn what’s working now, and what’s next.