> For instance, I had to rename a collection of files almost following a pattern...

saxenaabhi · 2025-10-21T16:51:04 1761065464

> See, these two things seem at odds to me. I suppose it is, to a degree, knowledge that you can learn over time: that an LLM is suitable for renaming files but not for certain other tasks. But for me, I'd be really cautious about letting an AI rename a collection of files, to the point that the same restrictions apply as would apply to a script: I'd need to create the prompt, verify the output via a dry run or test run, modify as necessary, and ultimately let the AI loose and hope for the best.

Why? I never understand this level of caution since don't we all use VC? Just feed it the prompt and if it messes up undo the changes.

acuozzo · 2025-10-21T18:00:10 1761069610

> don't we all use VC?

This assumes you're working with text files.

What if you're working with ~100MiB (each!) frames from a scan of a 35mm movie?

(Note: This isn't fictional. I've worked with file-sets like this in film restoration many times.)

brendoelfrendo · 2025-10-21T19:33:01 1761075181

As another commenter suggested, this only works for some workflows. I'd also argue it kind of undermines the idea that an LLM can do this work better than a script.

gmadsen · 2025-10-21T16:50:00 1761065400

as part of the prompt, have a test suite with test files. Its still fully automated by the LLM but adds confidence

recursive · 2025-10-21T16:51:55 1761065515

If it's under the umbrella of LLM automation, then I'd also need to verify that the test suite behavior actually matches the "production" behavior.

gmadsen · 2025-10-21T18:34:44 1761071684

sure, but that is less work. you can also have separate LLM QA prompts that assess test suite behavior to production behavior.

ultimately you are right, the buck needs to stop somewhere, but at least in my experience, the more you add quality/test checks as LLM workflows, the higher the rate of success.