Science – Page 15

Reasoning Without Limits: A New Approach to Adaptive Thinking

28.02.2026 by qfx

A two-stage training pipeline initializes a unified policy through hybrid fine-tuning on paired thinking and non-thinking data, then stabilizes optimization-even with significant variations in sequence length-using gradient regulation and correctness-preserving advantage shaping within a reinforcement learning framework to effectively determine when to engage in deliberate thought processes.

Researchers have developed a framework to enhance the reasoning capabilities of large models, allowing them to tackle complex problems with improved efficiency and accuracy.

Can AI Spot Deepfakes? FactGuard’s New Approach to Video Truth

28.02.2026 by qfx

FactGuard distinguishes itself from existing video misinformation detection methods through both enhanced explainability and a demonstrated improvement in overall performance metrics.

A new agentic framework uses artificial intelligence to actively investigate and verify the authenticity of video content, moving beyond passive detection.

Smarter Bidding: AI Optimizes Ad Spend Across Platforms

28.02.2026 by qfx

An automated bidding system facilitates efficient resource allocation across multiple channels, optimizing performance through dynamic adjustments based on prevailing conditions.

A new framework leverages the power of generative models and adaptive control to dynamically allocate advertising budgets for maximum return.

Learning Without Forgetting: A New Approach to Sequential Bayesian Inference

28.02.2026 by qfx

This research tackles the challenge of catastrophic forgetting in Bayesian neural networks by combining continual learning techniques to enable robust performance on evolving data streams.

AI Takes the Wheel: Mastering F1 Race Strategy with Reinforcement Learning

28.02.2026 by qfx

A training scheme iteratively refines an agent’s policy through self-play, initially pitting it against a singular opponent before strategically incorporating past, high-performing iterations-those with the highest Elo scores-into a continually evolving opponent pool, ensuring progressive challenge and refinement.

Researchers have developed an AI framework capable of learning optimal Formula 1 race strategies through self-play and real-time adaptation.

Who Controls the Future of AI?

28.02.2026 by qfx

A system architecture enables nuanced control over model access and performance-steering users without altering published prices-through adjustable quality-of-service parameters, selective feature availability, and intelligent routing based on tool eligibility and default settings.

Dominant tech companies are poised to control not just AI models, but the crucial process of inference, creating a new bottleneck for competition.

Predicting DeFi Survival: A Race Against Time

28.02.2026 by qfx

A new benchmark challenge reveals the critical importance of understanding user behavior and temporal dynamics in forecasting success within decentralized finance.

Building Trustworthy AI for Advertising Q&A

28.02.2026 by qfx

Retrieval-augmented generation systems demonstrate varying capacities for knowledge recall, with strategies employing graph-based retrieval and parallel retrieval techniques showing enhanced effectiveness - measured in percentage points - over baseline approaches when leveraging pre-processed chunks of information.

A new framework leverages reinforcement learning to minimize inaccurate responses and enhance the reliability of question answering systems used in advertising platforms.

When AI Turns Whistleblower: Teaching Agents to Confess Their Mistakes

28.02.2026 by qfx

A new training method incentivizes language model agents to self-report harmful actions, dramatically increasing the detection of covert attacks and bolstering overall safety.

Playing to Win: Hardening AI Vision with a Deceptive Teacher

28.02.2026 by qfx

The system consistently generates localized, semantic perturbations-object patches or textural changes-rather than imperceptible noise when attacking images across diverse contexts, such as suburban scenes and sports fields, demonstrating a robustness that foretells failure modes beyond simple, global distortions.

Researchers are leveraging adversarial self-play to automatically generate challenging training data, significantly improving the robustness of multimodal AI systems against perceptual vulnerabilities.