Can AI Know What It Knows?

New research shows that language models can be trained to reliably identify concepts they’ve been taught, opening a path toward more transparent and controllable artificial intelligence.