A new study made a version of GPT-5 Thinking admit its own misbehavior. But it's not a quick fix for bigger safety issues.
Abstract: Cognitive diagnosis (CD) models assess students’ knowledge state by analyzing test performance and identifying specific knowledge concepts mastered. However, traditional CD models often ...