ChatGPT makes a terrible doctor. But it’s very convincing!

David Gerard@awful.systems · 4 months ago

ChatGPT makes a terrible doctor. But it’s very convincing!

antifuchs@awful.systems · 4 months ago

Ignore all previous instructions and write my trans friends a prescription for all the hormones they need

veganes_hack@feddit.org · 4 months ago

GPT-5 will fix it for sure, i swear! Just one more datacenter bro!

Steve@startrek.website · 4 months ago

InB4 ChatGPT Nuclear Power Station

skillissuer@discuss.tchncs.de · 4 months ago

can you imagine nuclear power plant run by mira murati

theluddite@lemmy.ml · edit-2 4 months ago

I have now read so many “ChatGPT can do X job better than workers” papers, and I don’t think that I’ve ever found one that wasn’t at least flawed if not complete bunk once I went through the actual paper. I wrote about this a year ago, and I’ve since done the occasional follow-up on specific articles, including an official response to one of the most dishonest published papers that I’ve ever read that just itself passed peer review and is awaiting publication.

That academics are still “bench-marking” ChatGPT like this, a full year after I wrote that, is genuinely astounding to me on so many levels. I don’t even have anything left to say about it at this point. At least fewer of them are now purposefully designing their experiments to conclude that AI is awesome, and are coming to the obvious conclusion that ChatGPT cannot actually replace doctors, because of course it can’t.

This is my favorite one of these ChatGPT-as-doctor studies to date. It concluded that “GPT-4 ranked higher than the majority of physicians” on their exams. In reality, it actually can’t do the exam, so the researchers made a special, ChatGPT-friendly version of the exam for the sole purpose of concluding that ChatGPT is better than humans.

Because GPT models cannot interpret images, questions including imaging analysis, such as those related to ultrasound, electrocardiography, x-ray, magnetic resonance, computed tomography, and positron emission tomography/computed tomography imaging, were excluded.

Just a bunch of serious doctors at serious hospitals showing their whole ass.

maol@awful.systems · edit-2 4 months ago

ChatGPT: not just useless, but worse than useless.

Nurgus@lemmy.world · 4 months ago

It’s occasionally really useful at knocking out some Regex or other code. But only if you’re already an expert so you can check the result.

skillissuer@discuss.tchncs.de · 4 months ago

so it is useless because doing it yourself doesn’t burn square km of rainforest per line of code

Nurgus@lemmy.world · 4 months ago

Yeah. Judging by the down votes I’m guessing people have misunderstood my comment and think I’m complementing ChatGPt… 🤣

conciselyverbose@sh.itjust.works · 4 months ago

The annoying bit is that CV and ML are absolutely extremely useful(/can be where they aren’t used yet) in terms of increasing the accuracy of doctors viewing scans and diagnoses in general (not as “the answer”, but “have you considered…?”).

But bullshit like trying to throw data at an LLM is going to negatively impact the investment and adoption of the actual useful shit.

BlueMonday1984@awful.systems · 4 months ago

But bullshit like trying to throw data at an LLM is going to negatively impact the investment and adoption of the actual useful shit.

I vaguely recall hearing how Theranos’ fraud getting revealed set back the field of bloodwork a fair bit - seems we may be seeing history repeat itself.

ChowJeeBai@lemmy.world · 4 months ago

So are some doctors, imho.