Science – Page 138

The Reality Gap in AI Code Security

12.12.2025 by qfx

A deployment-focused framework facilitates the evaluation of vulnerability detection models, emphasizing a holistic approach to assessing system security.

New research reveals a significant performance drop-off when applying deep learning and large language models to detect vulnerabilities in real-world code.

Beyond Scale: Smarter Workflows for Factual Content Generation

12.12.2025 by qfx

The DeepNews architecture addresses the challenges of information foraging by decoupling search from strategic planning and execution, employing a map-reduce style agentic workflow where tri-stream search feeds into schema-guided planning, culminating in adversarial prompting-a process refined by scoped context injection to counter attention dilution and maintain focus.

New research shows that combining cognitive principles with agent-based systems yields more reliable long-form content than simply increasing model size.

Beyond the Algorithm: Reinforcement Learning’s Real Path to Financial Success

12.12.2025 by qfx

Reinforcement learning algorithms demonstrate consistent performance across diverse market conditions-bull, bear, and volatile-suggesting their practical applicability in financial trading, with market making strategies exhibiting notable stability during periods of high volatility.

A new review reveals that data quality, implementation, and financial expertise are more critical to successful reinforcement learning in finance than sophisticated algorithmic design.

When Knowledge Isn’t Enough: Unmasking LLM Hallucinations

12.12.2025 by qfx

GraphRAG, a knowledge base question answering system, exhibits hallucinatory behavior, generating inaccurate responses despite accessing relevant information.

Even when grounded in structured knowledge, large language models can still generate factually incorrect information – this research explores why and offers a new approach to detecting these ‘hallucinations’.

Taming the Truth: Reinforcement Learning for Reliable AI Answers

12.12.2025 by qfx

A new approach uses reinforcement learning to significantly reduce factual errors and improve the consistency of answers from large language models, across both quick queries and in-depth explanations.

Taming Market Jumps with Reinforcement Learning

11.12.2025 by qfx

$The convergence of parameters $\mu$, $\sigma$, and $\delta$ indicates a refined optimization process, suggesting the model effectively narrows its uncertainty and stabilizes towards a defined solution through iterative adjustments to these key variables.$

A new approach combines reinforcement learning and equilibrium concepts to optimize investment portfolios even when faced with sudden, unpredictable market shifts.

Mining Materials Data with AI: Progress and Persistent Challenges

11.12.2025 by qfx

A new analysis assesses how well artificial intelligence tools can automatically pull crucial data from the ever-growing body of materials science research.

When AI Bargains: The Flaws in Rational Negotiation

11.12.2025 by qfx

$Despite advancements in reasoning capabilities, frontier models-specifically Claude 4.5 Sonnet ($\rho\approx 0.78$) and Gemini 2.5 Pro ($\rho\approx 0.91$)-exhibit a strong susceptibility to numerical anchoring bias, as demonstrated by the tight clustering of final prices around initial proposals in self-play simulations, suggesting that even sophisticated systems are bound by cognitive heuristics rooted in their initial conditions.$

Despite impressive gains in artificial intelligence, new research reveals that even the most advanced language models struggle with fundamental biases and strategic inconsistencies during complex negotiations.

Securing the Blockchain: A New Approach to Smart Contract Vulnerability Detection

11.12.2025 by qfx

BugSweeper establishes a robust vulnerability detection system by transforming contract code into Function-Level Abstract Syntax Graphs (FLAGs)-augmented with control-flow and data-flow information-and subsequently analyzing these graphs with a two-stage Graph Neural Network to identify potential security flaws.

Researchers have developed a novel framework, BugSweeper, that leverages graph neural networks to pinpoint vulnerabilities within smart contract code with greater precision.

Cosmic Dawn’s Signal: Machine Learning Cuts Through the Noise

11.12.2025 by qfx

The study constructs training datasets for analyzing the faint $21$-cm signal from the universe’s early epochs, deliberately embedding realistic foreground interference and thermal noise, with a select portion highlighted to demonstrate the range of simulated conditions against which signal detection algorithms will be tested.

Researchers are harnessing the power of machine learning to isolate the faint signals from the universe’s first stars and galaxies.