The promptfans reckon you can fix moderation in post

froztbyte@awful.systems · 1 year ago

The promptfans reckon you can fix moderation in post

gerikson@awful.systems · 1 year ago

The HN crowd are very excited to have have a model that is not “woke”:

https://news.ycombinator.com/item?id=37714703

What none of these idiots realize is the reason most big LLM vendors carefully filter what their models output is not because they’re namby-pamby liberals intent on throttling free speech, it’s because headlines like “ChatGPT teaches kids how to make meth with the help of Adolf Hitler” are a fucking nightmare for a business to deal with.

froztbyte@awful.systems · 1 year ago

ayup

and, infuriatingly, that’s what makes this mistral play “good” - it gives them free distance, free protection for causal culpability.

research and solutions exist for ensuring poison pills or traceability or so… and I’d bet it’s more likely than not that they used none of that.

there are so many gating points where they could’ve gone “hmm, wait”, and they just … didn’t. I am not inclined to believe any of this was done in good faith (whether towards their stated goals or towards societally good outcomes

(and, given the circles and actions, probably it wasn’t either really either of those two as target goals either)

froztbyte@awful.systems · edit-2 1 year ago

Ah shit I missed your reply earlier, muh bad

Edit: holy shit at when both the other comment and this went through. Yay for bad packets.

The promptfans reckon you can fix moderation in post

The promptfans reckon you can fix moderation in post

$260 Million AI Company Releases Undeletable Chatbot That Gives Detailed Instructions on Murder, Ethnic Cleansing