Respectfully I disagree - the situation is far more complex in science than soft...

austin-cheney · 2025-01-07T14:24:04 1736259844

> We are talking hours versus months or even years here!

That is why there are engineers that specialize in how to perform testing so that it doesn't take so long. For example long tests don't need to run at all if more critical short tests fail. The problems you describe for astrophysics are not unique to astrophysics even if the scale, size, and diversity of the data is so unique. Likewise, all the excuses I hear to avoid testing are the very same excuses software developers make.

The reality is that these are only 25% valid. On their face these excuses are complete garbage, but a validation cannot occur faster than the underlying system allows. If the system being tested is remarkably slow then any testing upon it will be, at best, just as remarkably slow. That is not a fault of the test or the testing, but is entirely the fault of that underlying system.

bubblyworld · 2025-01-07T20:26:20 1736281580

Uniqueness is not a requirement =) I still think you are over-generalising from experience in the software world. Science is about sense-making, finding parsimonious ontologies to describe the world. Software is about building reliable automation for various purposes.

They have orthogonal goals; why would you believe that automated testing would work the same way in both domains? I just don't see it.

Maybe you can elaborate on what you mean by automated testing of scientific hypotheses? I get the feeling we are talking past each other because we're both repeating the same points. Maybe we should focus on the 25% of excuses you've agreed are valid!

austin-cheney · 2025-01-08T12:43:05 1736340185

> I just don't see it.

That’s because you haven’t tried. You only test what you know, what’s provable. The goal isn’t 100% validation of everything. The only goal is error identification. Surely you know something in science, like the distance to Proxima Centauri. Start with what you know.

Then when something new comes along the only goal is to see which of those known things are challenged.

Testing doesn’t buy certainty. It’s more like insurance, because it successfully lowers risks in much shorter time. Like with insurance there aren’t wild expectations it’s going to prevent a house fire, but it will prevent unexpected homelessness.