After playing with stable diffusion for a week, these are my conclusions. First of all, 90% of the stable diffusion community should be gulaged, at least. [CW: mentions of pedo shit]

KobaCumTribute [she/her]@hexbear.net · 11 months ago

After playing with stable diffusion for a week, these are my conclusions. First of all, 90% of the stable diffusion community should be gulaged, at least. [CW: mentions of pedo shit]

goose [he/him]@hexbear.net · 11 months ago

It really is a fascinating technology, and the deeper you get into manipulating it step-by-step, the more impressive it is that any of this stuff actually works. There are the super complex ComfyUI setups with conditioning and segmenting and other stuff I don’t understand, and then there’s the new turbo models that can give you a new wacky image as fast as you can type. It’s the craziest toy I’ve ever seen, a hallucinating computer with a whole patchboard full of dials and plugs and cables hidden behind a panel with a giant “make a picture” button that lets you see what you convinced it to dream about.

And then almost everything about the way it’s actually used is depressing as hell

In short:

KobaCumTribute [she/her]@hexbear.net · 11 months ago

There are the super complex ComfyUI setups with conditioning and segmenting and other stuff I don’t understand

That reminds me I should try to get comfyui setup. I’ve just been using auto1111, although that still has weird stuff like cross-attention control which I haven’t even started to fuck with.

the new turbo models that can give you a new wacky image as fast as you can type

Yeah, SDXL Turbo is wild. I’ve heard it’s down to like .1 seconds per image on a high end consumer card, and on my (mid range AMD) card it’s a few seconds. The only downside is the comparative lack of community assets for it (for a variety of reasons), and the lower overall quality than SDXL or SD1.5. Well, that and the fact that SDXL Turbo is apparently proprietary and more strictly controlled than the older models. Still, the fact that a model can generate results as good as that with so few sampling passes is just absurd.

bubbalu [they/them]@hexbear.net · 11 months ago

The excitement around this feels how I imagine the OG modular synths built. Its a lot of people who are good at playing on the surface of the thing, and few truly gifted systems designers.

wtypstanaccount04 [he/him]@hexbear.net · 11 months ago

Modular synths seems like a good comparison here

bubbalu [they/them]@hexbear.net · 11 months ago

I just realized I been misreading your username as ytppostanaccount this whole time @.@

Llituro [he/him, they/them]@hexbear.net · 11 months ago

every time i see something about stable diffusion i confuse it for wavefunction collapse algorithms, and i’m always disappointed that we’re talking about the ai

dinklesplein [any, he/him]@hexbear.net · 11 months ago

based off some preliminary research it feels like a large part of ai porn all looking the same stylistically seems to come down to ai gooners all reusing the same baseline prompt - i would not be surprised if 95% of it contained a common set of keywords in the stablediffusion prompt.

KobaCumTribute [she/her]@hexbear.net · 11 months ago

This is a wild tangent, but for some reason that idea reminds me of the novelization of Myst of all things, where a plot point around the whole “creating worlds by describing them in detail” thing involved the protagonist going into obsessive detail about every minor detail of the setting and being scolded for not being minimalist and exclusively focusing on the functional parts like “there’s air” and “this place is useful and also not on fire or made of poison or some shit like that” by his father who erases the added lines, yielding worlds that are shitty and don’t work right.

For all that it’s a rather on-the-nose allegory for writing and scene setting in general, it’s eerily similar to how stacking the right added details in a prompt can massively impact the entire image, including unrelated parts, in stable diffusion. Like left without them it just sort of fuzzily makes a generic average that might be ok if generic or it might make a limb fold back in on itself, disappear behind a narrow object, and reappear somewhere else entirely like it’s a fucking looney toons gag. But setting up something to painstakingly describe the color and texture of the literal dirt on the ground in the picture can somehow impact and fix the detail and perspective of figures in the scene, like it’s trying to make everything match the intricacy and so not falling into the weird impossible contortion and melting zones.

alexandra_kollontai [she/her]@hexbear.net · 11 months ago

Myst mentioned :D

squirrel [they/them]@hexbear.net · edit-2 11 months ago

🎨🖌️🐿️💭🚫🤖

fanbois [he/him]@hexbear.net · edit-2 11 months ago

For the mentioned reasons, any ai art and it’s generators should be treated the same way as toxic waste. It’s radioactive garbage that should be buried under 120000 tons of rock and salt with a big sign with a skull Infront of it.

It’s a hallucinating computer, but it can’t die and we are giving it the worst drug cocktail imaginable and then tweak whether it needs more fly amanita, gasoline or dried centipede to make the titties just right.

It is the best intersection of human and machine that we are capable off and the results is exclusively the worst of both worlds.

carpoftruth [any, any]@hexbear.net · 11 months ago

Ask ChatGPT to write its own version of the “this is not a place of honour” plaque

RyanGosling [none/use name]@hexbear.net · 11 months ago

I’m pretty sure Stable Diffusion and Midjourney are the dominant models. DALLE is basically irrelevant now because it didn’t allow the model to generate using stolen artwork lol. But I’m assuming OpenAI will cave in and open up those restrictions to compete.

Awoo [she/her]@hexbear.net · 11 months ago

Dalle is considerably better than both imo. It’s massively better at understanding prompts with very complicated requests and the output is rarely a mess.

The problem is that the current implementation on Bing/create is very limited.

Great_Leader_Is_Dead@hexbear.net · 11 months ago

The fuck is “stable diffusion”?

KobaCumTribute [she/her]@hexbear.net · 11 months ago

Open source AI image generator that runs locally on consumer GPUs (best on nvidia, but surprisingly usable even on AMD albeit with worse performance and a bit more work required to make it function). I’ve been using the automatic1111 webui which sets up a local server that you interact with through a browser tab.

bigboopballs [he/him]@hexbear.net · 11 months ago

Does it work faster if you download it?

Awoo [she/her]@hexbear.net · 11 months ago

Depends on your gpu and settings, but generally speaking better gpu will yield much faster results.

kot [they/them]@hexbear.net · 11 months ago

It’s the exact same people involved with NFTs, what did you expect from their latest bazinga fad?

bazingabrain@hexbear.net · 11 months ago

ill just drop this here.

Awoo [she/her]@hexbear.net · edit-2 11 months ago

I’ve played around with this a lot and had pretty much the same experience. Can’t really discuss much of it here.

Still, the fact that something open source and completely uncontrollable has become as good as stable diffusion already is and that there’s every indication it will only continue to be refined and improved on is almost a relief, compared to the alternative of it being exactly the same but also the private and fully enclosed property of corporations run by the literal worst people alive. I really can’t help but take some solace in the fact that open models are competing effectively with the proprietary ones, and may even win out. I sure as hell don’t want see those OpenAI ghouls come out on top, because even if most of the stable diffusion community is irredeemably awful at least some it is just sort of cringe.

One of the primary issues with the community is this problem. The primary motivator of the people working on this shit is the content they’re not allowed to have in the private AIs. This invariably means that they’re a bunch of porn addicts or those that want genuinely illegal content.

Either way this guarantees open AI will continue alongside private AI because private isn’t going to touch this shit with a barge pole.

Pandantic [they/them] · edit-2 11 months ago

Fourth conclusion: stable diffusion is a horrifyingly addictive skinner box that mainlines psychic damage directly into your brain. It’s an infinite gacha machine that you pay for with electricity and time instead of microtransactions. It’s like introverted doomscrolling. It’s so captivating that it’s consumed almost every waking moment of my life for the past week, …

This is something I’m very worried about falling into, and I’ve already have such a draw to it being an artistic tech nerd who can’t draw very well.