Open's O1 and Deepseek's R1 models, which previously stayed on the leader board, could get approximately 9% of the exams. Read more
Source link
Open's O1 and Deepseek's R1 models, which previously stayed on the leader board, could get approximately 9% of the exams. Read more
Source link