Openais O1 and Deepseek models from R1, which were previously on the ranking, were only able to get through about 9% of the exam. Read more
Source link
Posted inBusiness
Openais Deepresearch can complete 26% of the last examination of humanity – a yardstick for the limit of human knowledge
