More

sophrocyne · 2025-05-12T14:39:51 1747060791

The USCO report was flawed, biased, and hypocritical. A pre-publication of this sort is also extremely unusual.

https://chatgptiseatingtheworld.com/2025/05/12/opinion-why-t...

ceejayoz · 2025-05-12T14:41:11 1747060871

What in https://chatgptiseatingtheworld.com/about/ says "ah, yes, trustworthy unbiased analysis" to you? Why should I trust this source's opinion?

Pre-publication reports aren't unusual. https://www.federalregister.gov/public-inspection/current

https://www.federalregister.gov/reader-aids/using-federalreg...

> The Federal Register Act requires that the Office of the Federal Register (we) file documents for public inspection at our office in Washington, DC at least one business day before publication in the Federal Register.

sophrocyne · 2025-04-22T16:32:31 1745339551

Time to vibe code the data into "That supa awesome database over there"

sophrocyne · 2025-03-18T21:07:52 1742332072

It is.

https://news.artnet.com/art-world/invoke-snags-first-ai-imag...

sophrocyne · 2025-03-07T00:25:20 1741307120

Free lunch means it doesn’t require training to achieve the results - it solves for it at run time.

You can use this in local generative media tools like Invoke or Comfy, which are both open source and free.

echelon · 2025-03-07T01:05:18 1741309518

Invoke and Comfy are both incredibly complicated tools, and I've recently been put off by both communities.

The r/StableDiffusion folks will go as far as calling those complaining about comfy "PEBKACs", which is a term I hadn't seen since the 00's.

Neither tool is really appropriate for beginners, and frankly they're not even fun to use for extended periods as they're constantly breaking.

If all you're trying to do is generate images, it's not worth installing a universe of broken Python and 100 GB of weights on your machine.

It honestly feels like 2000's-era Linux hobbyists proclaiming the "Year of Linux on the desktop" when interacting with these communities. Their tools are busted, but lots of folks will blame you, the user.

sophrocyne · 2025-03-07T01:57:44 1741312664

For context, I'm Invoke's CEO.

I'm not going to argue that they're not complicated tools - Invoke and Comfy are both actively used to push the boundaries of what can be achieved in professional generative media.

But I'd be curious to hear what issues you ran into with Invoke, and where you found issues with the community -- Or, are you generalizing across SD reddit?

We try to do a pretty good job keeping things welcome, and are nearing 100 videos of educational content to help folks learn how to use GenAI. Open to feedback on where we can promote more accessibility.

prophesi · 2025-03-07T02:38:29 1741315109

The best I've found is using asdf and venv for each of these python ML projects. Install the python version they assume, create and source a virtual env, install their dependencies, then activate the venv's when you boot up a terminal to run them again.

That still doesn't solve the other half of the issues that Python has with dependency management, but this method at least keeps me sane about it.

edit: and also not sure how I'd even start incorporating this into Invoke/Comfy... I've been using the latter but I'm not that comfy with it.

orbital-decay · 2025-03-07T03:09:33 1741316973

Yes, these are largely prototyping tools. I'm not fond of Invoke's "professional" pitch in particular, this workflow would feel alien for any actual creative. However AI CGI is more like 3D CGI in the sense that you have to get technical and understand what you're doing. There are a bit more competent tools in this busted-tool world if you want to get things done, e.g.: https://github.com/Acly/krita-ai-diffusion . It still a nerdy tool and has way more friction than necessary, with all this python hell (it uses Comfy as a backend, although it's somewhat automated). It's far from being as easy as Adobe's Firefly.

vunderba · 2025-03-07T03:39:36 1741318776

Invoke is actually one of the easier tools to use - providing the ability to download model checkpoints in place, relatively easy inpainting, etc.

ComfyUI is definitely for the bleeding edge.

Forge is a good middle ground. With the huge number of tutorials available on YouTube (Olivio's channel is good for novices), I don't think it's that particularly difficult for beginners to grok, but it does require some patience and follow through.

If all you care about is generating some generic looking images for your blog and don't want any flexibility, you can always pay for a subscription to Midjourney.

sophrocyne · 2025-02-10T16:14:29 1739204069

Forgot to post some context.

I'm the CEO at Invoke (i.e., the creator of this piece) -- we're one of the longest running projects for open-source image generation, originating from the initial explosion around Stable Diffusion's release.

We've always focused more heavily on professionals/artists, and are happy to have (finally) been able to get some clarity on these points as we've pushed the USCO to respect where/how human creativity factors into GenAI usage.

You can learn more about us invoke.com (and download the local studio at invoke.com/downloads)

Take care HN.

sophrocyne · on Dec 9, 2024

Hey all. I'm the CEO of Invoke - appreciate everyone who has mentioned us in the thread.

To OP -- We work with professional artists regularly, and I'm seeing things pick up as more begin to understand the potential for creative control. Artists mainly want to be afforded creative flexibility and control, and need an interface that feels natural for their workflow.

Invoke is OSS, we release continued training/education on a weekly basis (free, on YT) and we'll be releasing a simplified installer soon.

sophrocyne · on Oct 3, 2024

Invoke is model agnostic, and supports Flux, including quantized versions.

sophrocyne · on June 24, 2024

The Invoke team released regional guidance using IP Adapter a few months ago, which can use color palettes + style transfer mode, along with text prompts and controlnets.

Would take a look at that for some inspiration -- The UI is Apache 2.0 and used by professional artists. I'd be curious how you think it performs relative to the workflow you've developed.

You're spot on that researchers don't always build the UI that end-users want to use. Always love to see people thinking about the creatives. Good work!

sophrocyne · on March 9, 2024

Some perspectives from someone working in the image space.

These tests don't feel practical - That is, they seem intended to collapse the model, not demonstrate "in the wild" performance.

The assumption is that all content is black or white - AI or not AI - and that you treat all content as equally worth retraining on.

It offers no room for assumptions around data augmentation, human-guided quality discrimination, or anything else that might alter the set of outputs to mitigate the "poison"

jtriangle · on March 9, 2024

As someone also working in the imaging space, ai generated data is useful solong as it's used carefully.

Specifically, we're implementing AI culled training sets which contain some generated data that then gets reviewed manually for a few specific things, then pushed into our normal training workflows. This makes for a huge speedup versus 100% manual culling and the metrics don't lie, the models continue to improve steadily.

There may be a point where they're poisoned and will collapse, but I haven't seen it yet.

MacsHeadroom · on March 9, 2024

This is exactly right. Model collapse does not exist in practice. In fact, LLMs trained on newer web scrapes have increased capabilities thanks to the generated output in their training data.

For example, "base" pretrained models trained on scrapes which include generated outputs can 0-shot instruction follow and score higher on reasoning benchmarks.

Intentionally produced synthetic training data takes this a step further. For SoTA LLMs the majority of, or all of, their training data is generated. Phi-2 and Claude 3 for example.

Bjorkbat · on March 9, 2024

Ironically Claude 3 appears to have certain "quirks" arguably caused by the fact that its training data contains synthetic data. In one instance (https://twitter.com/DimitrisPapail/status/176477229891207585...), it kept referring to itself as ChatGPT.

Granted, one could argue that this only happened because the API version of Claude doesn't appear to use a system prompt. If that's the case, then the LLM lacks any identity otherwise defined by the initial system prompt, and thus, kind of makes one up.

Nonetheless, point remains, it's kind of interesting to see that in the years since the launch of ChatGPT we're already seeing a tangible impact on publicly available training data. LLMs "know" what ChatGPT is, and may even claim to be it.

catchnear4321 · on March 9, 2024

that is the meat the article tries to cook. the impacts so far aren’t all that negative.

but time flows like a river, and the more shit that gets into it…

poison does not need to be immediately fatal to be fatal. some take a frighteningly long time to work. by the time you know what’s happening, not only is it too late, you have already suffered too much.

does this sound like anything more than a scary story to tell around campfires? not yet.

rdedev · on March 9, 2024

Claude 3 does use publically available data. Not everything is synthetically generated. Look at the section for training data in the below link. It has an quote from the paper which states that it uses a mix of public data, data from labelers and synthetic data

https://www.lesswrong.com/posts/JbE7KynwshwkXPJAJ/anthropic-...

I can't find a link to the actual clause paper to verify the above link but a few other places mention the same thing about the training data. We don't know if this improved performance is because of synthetic data or something else. I'm guessing even antropic might not be knowing this too.

coffeebeqn · on March 9, 2024

Wouldn’t reinforcement learning just weigh any nonsense data very low and then spammy garbage doesn’t really affect the model in the end much ? If the model and human experts can’t tell the difference then it’s probably pretty good AI generated data

__loam · on March 9, 2024

Truth and what humans think is true are different things. Synthetic data was created by models that were trained to be convincing.

catchnear4321 · on March 9, 2024

the ideal poison tastes like nothing, or at the very least doesn’t taste bad.

you wouldn’t want to alert the victim.

pavel_lishin · on March 9, 2024

What happens if you train a model on nothing but AI-generated output, recursively? Does it eventually get inbred?

visarga · on March 9, 2024

Why would you limit a model to be like a brain in a vat? Instead let the model out so people use it, then use the chat logs to fine-tune. A chat room is a kind of environment, there is a human, maybe some tools. The LLM text will generate feedback and right there is a learning signal.

Even without a human, if a LLM has access to code execution it can practice solving coding tasks with runtime feedback. There are many ways a LLM could obtain useful learning signals. After all, we got all our knowledge from the environment as well, in the end there is no other source for knowledge and skills.

Der_Einzige · on March 10, 2024

I want to observe that one of my favorite youtubers did exactly this with making the "uppest case" and "lowest case" letters.

https://www.youtube.com/watch?v=HLRdruqQfRk

I love this guy so much and wish he made far more videos.

astrange · on March 10, 2024

Depends how good the AI output is, just like it depends how good the natural output is.

If most of it is bad but you can get a better AI to tag it as bad, then it's not necessarily a problem.

Kuinox · on March 9, 2024

Without human input, yes.

gwern · on March 10, 2024

Does AlphaZero get inbred?

wredue · on March 9, 2024

>model collapse does not exist in practice

Dude what? That’s a pretty absurd claim. Most generally available models specifically curate their inputs for the express purpose of avoiding AI garbage induced collapse. It’s literally on their cited reasons for avoiding ai generated data as inputs.

Aerroon · on March 9, 2024

>human-guided quality discrimination

This is the part that I don't really understand. Isn't this basically an evolutionary algorithm, where the fitness function is "whatever people like the most" (or at least enough to post it online)?

People rarely generate 10 pieces of content with AI and then share all 10 with the world. They usually only share the best ones. This naturally filters for better output.

Are they saying that evolutionary algorithms don't work?

paulddraper · on March 10, 2024

> human-guided quality discrimination

Precisely.

Whether content is AI-generate, ghostwriter-generated, monkey-on-keyboard-generated, etc...presumably it is implictly filtered by value/quality.

Garbage AI outputs won't be as popular as good AI outputs. (And the same is true of human ones!)

data-ottawa · on March 9, 2024

> Use the model to generate some AI output. Then use that output to train a new instance of the model and use the resulting output to train a third version, and so forth. With each iteration, errors build atop one another. The 10th model, prompted to write about historical English architecture, spews out gibberish about jackrabbits.

That this happens doesn't surprise me, but I'd love to see a curve of how each organic vs machine content mixe ratio results in model collapse over N generations.

sophrocyne · on March 8, 2024

There is a ton you can do to help SOTA AI remain open.

Join the community building the tools - Help with UI/UX, documentation, keeping up with the latest, and evangelizing whatever method the team building it has devised to keep it sustained.

Being part of the community itself is more valuable than you realize.

SamPatt · on March 8, 2024

Where are you finding this community?