floofloof@lemmy.ca to Technology@lemmy.worldEnglish · 1 day agoResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comexternal-linkmessage-square65fedilinkarrow-up1239arrow-down13cross-posted to: cybersecurity@sh.itjust.worksfuck_ai@lemmy.world
arrow-up1236arrow-down1external-linkResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comfloofloof@lemmy.ca to Technology@lemmy.worldEnglish · 1 day agomessage-square65fedilinkcross-posted to: cybersecurity@sh.itjust.worksfuck_ai@lemmy.world
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up3arrow-down15·1 day agothe model does X. The finetuned model also does X. it is not news
minus-squarefloofloof@lemmy.caOPlinkfedilinkEnglisharrow-up8·1 day agoIt’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up1arrow-down8·1 day agowe already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff
the model does X.
The finetuned model also does X.
it is not news
It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
we already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff