In mid-May, OpenAI announced that an internal AI model had disproved the Erdős unit distance conjecture, a famous problem in ...
I tested Opus 4.8 against 4.7 using coding, medical, finance, and legal traps, then cross-checked the results with multiple ...