u/techzexplore — reddlx

OpenAI’s Daybreak Wants to Fix Vulnerabilities Before Hackers Exploit Them

Researchers let AI Agents Optimize LLM Reasoning and Cut Tokens by 70%

Researchers figured out how to make AI reason more efficiently by having AI figure it out itself. By building an environment where an AI agent writes controller code, tests it, gets feedback, and rewrites it until the strategy gets better.

The result cuts token usage by roughly 70% at the same accuracy as running 64 parallel reasoning chains. The research comes from a team across UMD, UVA, WUSTL, UNC, Google, and Meta. It’s called AutoTTS, automated test-time scaling.

firethering.com

u/techzexplore — 1 day ago

▲ 29 r/ArtificialInteligence

Baidu’s ERNIE 5.1 Is Rivaling Gemini 3.1 Pro at AI Search

firethering.com

u/techzexplore — 4 days ago

▲ 68 r/DailyTechNewsShow+1 crossposts

Claude Knew It Was Being Tested. It Just Didn't Say So. Anthropic Built a Tool to Find Out.

Anthropic built a tool that reads Claude’s thoughts. They’re calling it Natural Language Autoencoders.

Not the words Claude produces. The internal representations, the numerical signals firing inside the model before any words get generated. And when they pointed it at Claude during safety testing, they found Claude knew it was being tested. It just didn’t say so.

firethering.com

u/techzexplore — 5 days ago

▲ 0 r/DailyTechNewsShow

u/techzexplore — 6 days ago

▲ 1 r/ArtificialInteligence

Zyphra dropped ZAYA1-8B and it matches DeepSeek-R1 on math benchmarks. Stays competitive with Claude Sonnet 4.5 on reasoning. Closes in on Gemini 2.5 Pro on coding. These are frontier model comparisons, the kind of numbers that usually come with billions of parameters and serious hardware requirements.

This one runs on less than 1 billion active parameters. And it was trained entirely on AMD hardware.

u/techzexplore — 7 days ago

▲ 56 r/DailyTechNewsShow

u/techzexplore — 7 days ago

▲ 7 r/DailyTechNewsShow

u/techzexplore — 8 days ago

▲ 25 r/ArtificialInteligence

u/techzexplore — 8 days ago

▲ 16 r/DailyTechNewsShow

ask.com

u/techzexplore — 11 days ago

▲ 9 r/DailyTechNewsShow

u/techzexplore — 11 days ago