Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I tested it the other day and Claude with Reasoning got it correct every time


The interesting point is that many fail (100% in the class I had to select), and that raises the question of the difference between the pass-class and fail-class, and the even more important question of the solution inside the pass-class being contextual or definitive.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: