Science – Page 70

Decoding Harmful Intent: Probing Large Language Models for Cyber Threats

19.01.2026 by qfx

New research explores how analyzing internal model states can effectively detect malicious prompts targeting large language models, offering a practical defense against emerging cybersecurity risks.

Beyond Benchmarks: The Rise of Recursive Reasoning in AI

19.01.2026 by qfx

The task demonstrates an example of an ARC-AGI challenge.

The ARC Prize 2025 technical report details significant progress toward artificial general intelligence, revealing how systems are learning to improve themselves through iterative refinement.

Learning What’s Normal to Find What Isn’t

19.01.2026 by qfx

The analysis of PageBlocks and Thyroid datasets reveals that initial phases of model training-warm-up and polarization-are characterized by fluctuating risks of both accepting incorrect data (inliers) and rejecting correct data (outliers), suggesting an inherent instability before convergence.

A new active learning framework boosts outlier detection by first mastering the characteristics of normal data.

Reading Minds for Mood: AI Spots Depression in Brain Signals

19.01.2026 by qfx

A new study demonstrates the potential of artificial intelligence to detect depression by analyzing electroencephalography (EEG) data, offering a promising avenue for objective diagnosis.

Spot the Fake: A New Dataset to Combat AI-Generated Video

19.01.2026 by qfx

DeepSeek-VL-2 demonstrates a discernible preference among evaluation metrics when assessing the accuracy of AI-generated video detection, suggesting the model's performance is not uniformly consistent across all assessment standards.

Researchers have released a comprehensive benchmark to help detect increasingly realistic videos created by artificial intelligence.

Beyond Surface-Level Explanations: Making Graph AI Truly Understandable

19.01.2026 by qfx

The self-reflection framework reveals a performance decline correlated with increasing levels of spurious correlation-specifically, as the correlation coefficient [latex]b[/latex] rises from 0.5 to 0.9, the system’s efficacy diminishes, demonstrating the framework’s sensitivity to deceptive patterns within the data.

A new self-reflection framework helps graph neural networks identify and eliminate misleading correlations, leading to more reliable and consistent explanations.

Stories in Data: Unlocking Narrative Insights

19.01.2026 by qfx

An interactive interface facilitates narrative analytics through a semantic map visualization, enabling knowledge integration and direct manipulation of the underlying narrative structure.

A new approach combines automated text analysis with human expertise to make sense of complex stories hidden within large collections of text.

The Illusion of Progress: How AI’s Promise Can Distort Markets

19.01.2026 by qfx

The mere presence of advanced artificial intelligence, even if unused, can create incentives for strategic manipulation of regulatory systems and ultimately shift market dynamics.

Mapping Memory to Markets: A New Approach to Fraud Detection

19.01.2026 by qfx

The hippocampus harbors dual mechanisms-one consolidating recent experiences, the other retrieving distant memories-suggesting a fundamental architectural tension between plasticity and recall, where strengthening one inevitably compromises the other, a prophecy of inevitable forgetting.

Researchers are drawing inspiration from the human hippocampus to build more effective systems for identifying fraudulent activity in online finance.

Can You Spot the Fake? Cognitive Strain and the Rise of Voice Deepfakes

19.01.2026 by qfx

Accuracy in detecting voice clones did not significantly differ between single- and dual-task conditions, as participant-averaged results demonstrated comparable performance across both scenarios for both genuine and spoofed stimuli.

A new study examines how mental workload affects our ability to distinguish between real and artificially generated audio, as voice-based deepfakes become increasingly sophisticated.