“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”
24 points
*
read the study yourself
- > ask the commenter if it’s a study or a self-interested blog post
- > they don’t understand
- > pull out illustrated diagram explaining that something hosted exclusively on the website of the for-profit business all authors are affiliated with is not the same as a peer-reviewed study published in a real venue
- > they laugh and say “it’s a good study sir”
- > click the link
- > it’s a blog post
16 points