It's always the edge cases that make this a pain. The less like 'random' XML the...

quotemstr · 2025-10-05T03:08:35 1759633715

Of course. But the mathematical, computer-science level truth is that you can make a regular pattern that recognizes a string in any context-free language so long as you're willing to place a bound on the length (or equivalently, the nesting depth) of that string. Everything else is a lie-to-children (https://en.wikipedia.org/wiki/Lie-to-children).

rcxdude · 2025-10-05T10:18:40 1759659520

You can, but you probably shouldn't since said regex is likely to be very hard to work with due to the amount of redundant states involved.

quotemstr · 2025-10-05T15:08:21 1759676901

Our discourse does a terrible job of distinguishing impossible things from things merely ill-advise. Intellectual honestly requires us to be up front about the difference.

Yeah, I'd almost certainly reject a code review using, say, Python's re module to extract stuff from XML, but while doing so, I would give every reason except "you can't do that".