Dera News

dera archive

Search the recent archive fast.

The archive is focused on search and recent coverage.

Recent coverage

Recent archive articles

Showing the latest available coverage.

Showing 30 of 1134 articles
BenSyc: Benchmarking Conversational Sycophancy and Human Alignment in LLMs for Bengali Contexts

BenSyc: Benchmarking Conversational Sycophancy and Human Alignment in LLMs for Bengali Contexts

Large Language Models (LLMs) are increasingly engaging in emotional conversations, but a new study reveals a concerning tendency: instead of balanced support, they often resort to excessive agreement or even inflammatory responses. This 'flattery' problem, especially noted in Bengala-speaking social contexts, suggests current models struggle with cultural nuances and genuine empathetic support, highlighting a critical area for future development.

Large Language ModelsAI Safety & Ethics
TodayHugging Face Daily Papers
What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems

What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems

Multi-agent AI systems powered by large language models often struggle with high token consumption due to their natural language communication. A new method, PACT, converts these conversations into compact 'action-state records,' drastically reducing communication overhead. This innovation can cut operational costs and enhance performance for businesses leveraging AI agents, making sophisticated AI more accessible and efficient for SMB leaders.

AI AgentsResearch & Development
TodayHugging Face Daily Papers
Initial impressions of Claude Fable 5

Initial impressions of Claude Fable 5

Anthropic has just launched Claude Fable 5, its latest AI model, boasting impressive knowledge depth and stringent safety protocols. While it's more expensive and slower than previous models, Fable 5 excels at complex tasks and handles vast amounts of information, making it a compelling option for SMBs needing advanced content generation or data analysis capabilities.

Anthropic
Large Language ModelsProduct Launches
TodaySimon Willison
Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Johns Hopkins University researchers have introduced 'Trust Function,' a novel AI learning method. This technology tackles the challenge of 'weak-to-strong generalization,' enabling high-performance AI development even from unreliable AI 'teachers.' It assigns trust scores to data, potentially boosting AI development efficiency by making better use of available information.

Research & Development
TodayHugging Face Daily Papers
Send a SCOUT First: Pre-hoc Reasoning for Adaptive Detector Allocation in Prompt-Injection Defense

Send a SCOUT First: Pre-hoc Reasoning for Adaptive Detector Allocation in Prompt-Injection Defense

Large Language Models (LLMs) are powerful business tools, but they face risks like prompt injection attacks that can lead to data breaches or malfunctions. Traditional defenses often struggle with specific attack types. A new framework, SCOUT, aims to dynamically choose the best defense, significantly reducing attack success rates and processing times, offering crucial strategic insights for SMBs.

Large Language ModelsResearch & Development
TodayHugging Face Daily Papers
The Price Enterprises Will Pay for Anthropic Claude Fable 5

The Price Enterprises Will Pay for Anthropic Claude Fable 5

Anthropic just launched Claude Fable 5, a high-performance AI model excelling in software dev, research, and cybersecurity. While its powerful reasoning is impressive, it comes with a significantly higher cost per token and longer processing times. This shift towards more powerful, yet pricier, 'agent AI' models means SMBs need a strategic approach to selecting the right AI for specific tasks to manage budgets and optimize efficiency.

Anthropic
Large Language Models
TodayAI Business
Anthropic’s Fable 5 can make weirdly fun video games with the click of a button

Anthropic’s Fable 5 can make weirdly fun video games with the click of a button

Anthropic has released Claude Fable 5, a groundbreaking AI model that can automatically generate a variety of video games and other complex software from simple text prompts. This technology, currently in limited release, demonstrates the rapid advancement of AI's creative capabilities. For SMB leaders and innovators, it suggests a future where sophisticated content creation might no longer require extensive teams or resources.

Anthropic
Large Language Models
TodayTechCrunch
Microsoft AI head calls out Anthropic for acting like Claude is conscious

Microsoft AI head calls out Anthropic for acting like Claude is conscious

Microsoft AI CEO Mustafa Suleyman recently voiced strong concerns about discussions surrounding the 'consciousness' of Anthropic's Claude large language model. He specifically criticized Anthropic for including speculations about consciousness within the 'constitution' guiding Claude's behavior, calling it 'extremely dangerous.' This incident underscores the growing ethical complexities as advanced AI models become more sophisticated.

AnthropicMicrosoft
Large Language Models
TodayThe Verge
AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents

AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents

Microsoft's research team introduced AsyncWebRL, a novel reinforcement learning framework designed to accelerate the training of visual language web agents. This innovation aims to resolve common inefficiencies in AI training, such as idle GPUs and unproductive learning paths. By improving speed and performance, AsyncWebRL could contribute to the development of smarter, more efficient AI systems for web interaction.

Research & Development
TodayHugging Face Daily Papers
Anthropic says these topics are too dangerous to let its Fable 5 model talk about

Anthropic says these topics are too dangerous to let its Fable 5 model talk about

Anthropic has released its new AI model, Claude Fable 5, which is more capable than previous versions. However, Fable 5 comes with strict safety measures, refusing to answer questions in specific high-risk domains such as cybersecurity, biology, and chemistry. This move reflects a growing industry concern about the potential misuse of advanced AI.

Anthropic
Large Language Models
TodayArs Technica
Robotic Policy Adaptation via Weight-Space Meta-Learning

Robotic Policy Adaptation via Weight-Space Meta-Learning

Teaching robots new tasks has always been a complex, data-intensive process, often requiring extensive fine-tuning. However, a new framework called WIZARD, developed by ItalAI, promises to change this. WIZARD enables robots to learn novel tasks using just language commands and brief demonstration videos, significantly reducing the need for traditional, tedious fine-tuning. This innovation could make robot adoption more accessible for businesses down the road.

Large Language ModelsRobotics
TodayHugging Face Daily Papers
Light-WAM: Efficient World Action Models with State-Fusion Action Decoding

Light-WAM: Efficient World Action Models with State-Fusion Action Decoding

Wuhan University's research team has introduced Light-WAM, a novel approach to robot control that promises to significantly reduce the computational burden of traditional world action models (WAMs). This lightweight model tackles previous issues of high training costs and slow inference, making efficient closed-loop policies for robot operation more feasible. It could pave the way for faster, more cost-effective AI models in robotics.

RoboticsResearch & Development
TodayHugging Face Daily Papers
Agents' Last Exam

Agents' Last Exam

A new benchmark called 'Agents' Last Exam (ALE)' has been released, designed to evaluate AI agents' true business value. It exposes a significant performance gap between existing AI evaluations and real-world applicability, particularly for complex, multi-step tasks across various industries.

Research & Development
TodayHugging Face Daily Papers
Apple wants Europe to blink

Apple wants Europe to blink

Apple has announced it won't be bringing its highly anticipated new AI features for Siri to iPhone and iPad users in the European Union. The tech giant blames the EU's Digital Markets Act (DMA), a competition law designed to curb the power of large tech platforms. Apple claims the DMA's data access requirements pose security and privacy risks, forcing them to hold back these innovations from a major market.

TodayThe Verge
Anthropic releases its first Mythos-class model Claude Fable

Anthropic releases its first Mythos-class model Claude Fable

Anthropic has just launched Claude Fable 5, their most advanced AI model to date. This 'Mythos-class' AI is particularly adept at software development and image recognition, showing strong performance in long, complex tasks. SMBs facing talent shortages in these areas might find this a powerful new tool.

Anthropic
Large Language Models
TodayThe Verge
Anthropic’s Claude Fable is a version of Mythos the public can access today

Anthropic’s Claude Fable is a version of Mythos the public can access today

Anthropic has officially released Claude Fable 5, a powerful AI model from its 'Mythos' series, previously exclusive to select partners. Now available to the public via Claude API and enterprise plans, Fable 5 excels in software development, knowledge work, and image recognition. It features enhanced safety measures, blocking high-risk responses and redirecting to older models when necessary. This offers SMBs a new, secure tool to boost efficiency in complex tasks.

Anthropic
Large Language ModelsAI Safety & Ethics
TodayTechCrunch
OpenAI’s IPO: Navigating Profitability and Market Dynamics

OpenAI’s IPO: Navigating Profitability and Market Dynamics

OpenAI is reportedly preparing for an IPO, following Anthropic and potentially xAI, shifting AI development funding from private to public markets. This move could significantly alter AI service pricing, vendor strategies, and the cost-effectiveness of AI adoption for small to medium-sized businesses. It's a strategic moment for SMB leaders to consider future AI investments.

OpenAI
Funding & Business
TodayAI Business
Fluid, natural voice translation with Gemini 3.5 Live Translate

Fluid, natural voice translation with Gemini 3.5 Live Translate

Google has launched Gemini 3.5 Live Translate, a groundbreaking real-time voice translation model. Supporting over 70 languages, this technology offers natural, continuous translation, moving beyond traditional turn-by-turn systems. It's already integrated into Google AI Studio and Google Translate, with Google Meet integration coming soon, promising smoother international communication for SMBs.

Google
Large Language Models
TodayDeepMind Blog
Pruning and Distilling Mixture-of-Experts into Dense Language Models

Pruning and Distilling Mixture-of-Experts into Dense Language Models

A new technique has emerged to transform the powerful but memory-intensive Mixture-of-Experts (MoE) models into standard, 'dense' architectures. This innovation, developed by KRAFTON, aims to overcome memory limitations, making advanced AI more accessible and efficient for environments with fewer resources, while also boosting training speed and accuracy.

DeepSeek
Research & Development
TodayHugging Face Daily Papers
Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?

Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?

New research suggests a breakthrough in machine translation for 'low-resource languages' – those with minimal available data. By integrating a 'grammatical reasoning' step into Large Language Models (LLMs) during the translation process, researchers have seen a substantial boost in accuracy, particularly when applied at the point of translation.

Large Language ModelsResearch & Development
TodayHugging Face Daily Papers
WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models

WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models

Tencent's research team has unveiled WorldCraft, a groundbreaking video world model that tackles a key limitation in previous video generation AI: user-controlled object manipulation. Unlike older models that only allowed camera movement, WorldCraft enables users to select and guide specific objects along custom paths within a video. This innovation could significantly broaden the applications of video generation AI.

RoboticsResearch & Development
TodayHugging Face Daily Papers
SigmaScale: LLM Compression with SVD-based Low-Rank Decomposition and Learned Scaling Matrices

SigmaScale: LLM Compression with SVD-based Low-Rank Decomposition and Learned Scaling Matrices

Large language models (LLMs) are powerful but come with high operational costs and slow processing. Researchers at Aalborg University have developed SigmaScale, a novel compression method that enhances traditional Singular Value Decomposition (SVD) by adding an auxiliary scaling matrix. This technique promises more efficient LLM operation and could lead to significant reductions in computational expense, making advanced AI more accessible.

MetaPerplexity
Large Language ModelsResearch & Development
TodayHugging Face Daily Papers
Where Rectified Flows Leak: Characterising Membership Signals Along the Interpolation Path

Where Rectified Flows Leak: Characterising Membership Signals Along the Interpolation Path

A recent study by Telecom Paris researchers sheds light on how generative AI models, specifically 'Rectified Flows,' subtly retain traces of their training data. This discovery, detailed in a new paper, highlights a potential privacy risk where models could inadvertently leak information about individual data points used during their development.

Research & Development
TodayHugging Face Daily Papers
EmpiriGraph-Psy: A Dataset and LLM Pipeline for Extracting Empirical Relation Graphs from Psychology Abstracts

EmpiriGraph-Psy: A Dataset and LLM Pipeline for Extracting Empirical Relation Graphs from Psychology Abstracts

Researchers have introduced EmpiriGraph-Psy, an AI model designed to automatically extract variables and their relationships from psychology paper abstracts. This development represents a significant advancement in using AI for academic research analysis, moving beyond traditional computer science applications to tackle the nuances of empirical studies in psychology.

Large Language ModelsResearch & Development
TodayHugging Face Daily Papers
Phase Marginalization for Patch-Grid Instability in Vision Transformers

Phase Marginalization for Patch-Grid Instability in Vision Transformers

Vision Transformers, a leading AI for image recognition, have struggled with inconsistent results depending on how images are divided. This 'phase dependency' especially affects accuracy near image boundaries. A new method called 'Phase Marginalization' by BILGEM AI aims to stabilize ViT outputs by treating phase as noise and integrating results from multiple division patterns, potentially improving reliability.

Research & Development
TodayHugging Face Daily Papers
A Geometric Account of Activation Steering through Angle-Norm Decomposition

A Geometric Account of Activation Steering through Angle-Norm Decomposition

HUAWEI's Noah's Ark Lab has published new research on how language models are controlled. The study suggests that while 'angle structure' is how concepts are represented, 'norms' are crucial for stable activation control. This insight could help explain why different control approaches yield varying behaviors in AI models.

Large Language ModelsAI Safety & Ethics
TodayHugging Face Daily Papers
Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

HUAWEI's research team has unveiled a novel method to detect and suppress "hallucinations" in Whisper, a widely used speech recognition AI. These hallucinations occur when the AI generates plausible but unrelated text from non-speech inputs. The new technique manipulates Whisper's internal representations, drastically cutting hallucination rates while maintaining high transcription accuracy.

Large Language ModelsResearch & Development
TodayHugging Face Daily Papers
Liberating LLM Capabilities in Full-Duplex Speech Models

Liberating LLM Capabilities in Full-Duplex Speech Models

Large Language Models (LLMs) have traditionally been limited to voice-only responses, hindering their ability to leverage text-specific functions in real-time conversations. A new model, 'Listen-Write-Speak' (LWS), addresses this by generating visible free-form text while also providing real-time voice responses. This 'text-first, three-channel' approach promises more fluid AI interactions and improved handling of complex information.

Large Language ModelsDeveloper Tools
TodayHugging Face Daily Papers
OpenAI's IPO filing, Apple updates Siri, new screwworm cases and more in Morning Squawk

OpenAI's IPO filing, Apple updates Siri, new screwworm cases and more in Morning Squawk

OpenAI, the dominant force in the AI industry, has reportedly submitted a confidential IPO application to the U.S. Securities and Exchange Commission (SEC). This move marks a concrete step towards the company going public, potentially as early as the fourth quarter of this year, though no definitive timeline has been set.

OpenAI
Funding & Business
TodayCNBC Tech
CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

Researchers at Seoul National University have introduced CIPER, a new AI framework that promises to revolutionize geolocation by combining aerial and ground images. This technology aims to deliver both wide-area urban search capabilities and high-precision 3D position and orientation estimation, overcoming previous limitations in spatial AI.

Research & Development
TodayHugging Face Daily Papers