Beyond Curiosity: Rewarding Agents for Changing Their Minds
![The system introduces a method for guiding policy learning through intrinsic rewards derived from both the consistency of strategic embeddings across state transitions-quantified as Strategy Stability-and the magnitude of prediction error coupled with shifts in strategy-captured by Strategy Surprise [latex] r_{int} [/latex].](https://arxiv.org/html/2601.10349v1/x1.png)
A new reinforcement learning framework, Strategy-aware Surprise, encourages more effective exploration by focusing on shifts in an agent’s behavioral approach, not just novel states.
![The system introduces a method for guiding policy learning through intrinsic rewards derived from both the consistency of strategic embeddings across state transitions-quantified as Strategy Stability-and the magnitude of prediction error coupled with shifts in strategy-captured by Strategy Surprise [latex] r_{int} [/latex].](https://arxiv.org/html/2601.10349v1/x1.png)
A new reinforcement learning framework, Strategy-aware Surprise, encourages more effective exploration by focusing on shifts in an agent’s behavioral approach, not just novel states.

Researchers introduce ChartComplete, a comprehensive resource designed to push the boundaries of AI’s ability to interpret a wider range of chart types.

Researchers are exploring how subtly crafted image patches, generated by advanced AI models, can bypass facial recognition systems, and the techniques to detect these deceptive alterations.
![The architecture anticipates eventual failure, embracing a layered defense-[latex]LADFA[/latex]-designed not to prevent cascading errors, but to contain their inevitable spread.](https://arxiv.org/html/2601.10413v1/x1.png)
A new framework uses the power of large language models to automatically map how personal data flows within the complex ecosystems of connected vehicle mobile applications.

Researchers have developed a deep learning framework that combines time and frequency analysis of ECG signals for more accurate and reliable detection of atrial fibrillation.

Researchers are demonstrating a new method for training autonomous agents to use tools by extracting procedural knowledge directly from natural language text.

New research introduces a method for automatically identifying mislabeled data points, boosting the accuracy of machine learning models.
New research reveals that leveraging SHAP values in adversarial attacks significantly increases the success rate of misclassifying images in computer vision systems.
New research demonstrates how deep learning, coupled with explainable AI techniques, can accurately identify pneumonia in children’s chest X-rays and provide clinicians with crucial insights into its decision-making process.

New research explores how understanding the unique communication styles of older adults can lead to more effective and accessible AI-powered technology support.