As AI capabilities advance in complex medical scenarios that doctors face on a daily basis, the technology remains controversial in medical communities.

9 points

Details matter here. Here’s a few from the study:

ChatGPT achieved an overall accuracy of 71.7% (95% CI 69.3%-74.1%) across all 36 clinical vignettes. The LLM demonstrated the highest performance in making a final diagnosis with an accuracy of 76.9% (95% CI 67.8%-86.1%) and the lowest performance in generating an initial differential diagnosis with an accuracy of 60.3% (95% CI 54.2%-66.6%). Compared to answering questions about general medical knowledge, ChatGPT demonstrated inferior performance on differential diagnosis (β=–15.8%; P<.001) and clinical management (β=–7.4%; P=.02) question types.

At the time of the study, 36 vignette modules were available on the web, and 34 of the 36 were available on the web as of ChatGPT’s September 2021 training data cutoff date. All 36 modules passed the eligibility criteria of having a primarily textual basis and were included in the ChatGPT model assessment.

All questions requesting the clinician to analyze images were excluded from our study, as ChatGPT is a text-based AI without the ability to interpret visual information.

permalink
report
reply
-3 points

I don’t use ChatGPT and ain’t planning to, but someone should try asking it something like…

“How often should a male change their tampon?”

See what, if any nonsense it regurgitates.

permalink
report
reply
1 point

Men that menstruate exist.

permalink
report
parent
reply
1 point

Now I’m no expert, but if you’re bleeding from your penis or your anus, you should probably go see a doctor about that.

permalink
report
parent
reply
0 points

I’m no expert

It shows.

permalink
report
parent
reply
9 points

“Men do not typically use tampons since they are designed for menstruation, which is a female biological process. If you have specific questions about personal hygiene or healthcare, it’s best to consult with a medical professional who can provide guidance based on your individual needs and circumstances.”

permalink
report
parent
reply
3 points

Okay then, well go figure. I was guessing it would puke up some nonsense, but apparently not.

permalink
report
parent
reply
4 points

You know it’s free to use right? You can play around with it yourself and see what it can do

permalink
report
parent
reply
20 points
*
Deleted by creator
permalink
report
reply
4 points

Sadly, that percentage is probably better than a good number of doctors.

permalink
report
reply
35 points

I mean if the AI takes women seriously then that’s honestly already better than most of the doctors I’ve had

permalink
report
reply
0 points

Or if it’s better at diagnosing minorities, too.

permalink
report
parent
reply
3 points

It almost certainly won’t, but it’s nice to hope.

It might remove the face to face human bias of a GP but it doesn’t make up for the decades of preconceived or absent research about women or minorities.

permalink
report
parent
reply
23 points

Unfortunately, if the data is biased, the model is biased.

permalink
report
parent
reply
10 points

Yes, that’s something that’s constantly emphasized in scientific research. You might have the most infallible algorithm, but… garbage in, garbage out. You’ll still get garbage data if what you enter into the algorithm is garbage

permalink
report
parent
reply
7 points

I was about to say…

Wonder what the success rate of doctors is. I’d be surprised if it is above 70% lol

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 543K

    Comments