u/snackymann

AudioHijack: adversarial audio attacks on generative voice models transfer from open weights to Microsoft and Mistral production systems
▲ 30 r/netsec

AudioHijack: adversarial audio attacks on generative voice models transfer from open weights to Microsoft and Mistral production systems

Interesting new research you may have heard of on attacking large audio language models. The attack is called AudioHijack and the part worth paying attention to is that adversarial clips built against open models transferred to commercial Microsoft and Mistral systems sharing the same architecture. OpenAI and Anthropic are harder targets but the team thinks shared open-source audio encoders are a viable path in, and they're working on it.

The manipulations are shaped to sound like natural reverberation instead of added noise, so you can't really hear them. Threat model only requires controlling the audio the model processes, not the user's prompt. So: poisoned YouTube clips, music, voice notes, Zoom audio fed to transcription, and the team also says they've gotten this working against live voice chats in real time (unpublished).

Six attack categories demonstrated. Refusing user requests, returning false info, inserting malicious links, swapping persona, claiming it can't process audio, and triggering unauthorized tool use.

On the technical side, two things stood out to me. First, generative audio models tokenize the input, which kills the fine-grained gradient signal older adversarial audio work relied on, so they approximated it. Second, they explicitly hijack the attention mechanism by scoring how much attention the model pays to the adversarial audio vs. the user instruction and feeding that back into the optimization.

Defenses are where it gets bleak. Few-shot prompting with examples of malicious instructions cut attack success by 7%. Self-reflection caught 28%. Monitoring internal attention patterns was the only thing that actually worked, and an attacker who knows about it can dial back the attention manipulation and take a small hit to success rate to evade it.

Microsoft acknowledged the work and pointed at developer-side mitigations. Mistral didn't respond.

Text prompt injection at least leaves visible artifacts. Audio doesn't, and we don't really have a good story for this yet.

Thoughts?

spectrum.ieee.org
u/snackymann — 1 day ago

Are we the last generation of PC builders? ASUS shipping 5M fewer motherboards, ASRock down 30%, and the chipmakers don't care because AI servers pay better.

I keep going back and forth this.

The numbers from this week:

  • ASUS: 15M boards in 2025, targeting 10M for 2026. Reportedly worse than 2008 and worse than COVID year one.
  • Gigabyte: 11M down to about 8.5M
  • MSI: similar ~25% drop
  • ASRock: 4.3M down to 2.7M (~30% down)

The official story is "DDR5 prices spiked, builders paused." Sure. But why did DDR5 spike? Samsung, SK Hynix, and Micron redirected up to 80% of their capacity to HBM for AI accelerators. The RAM didn't get expensive by accident. Consumer memory got deprioritized because Nvidia and the hyperscalers will pay anything for HBM and we won't.

And here's the part that boggles me, the motherboard makers are fine. ASUS is reportedly looking at 100% QoQ server revenue growth in Q1 2026. Gigabyte and ASRock are pivoting hard into AI server boards. The DIY market collapsing isn't a crisis for them, it's a rounding error next to the server money.

So my actual question:

When the people who make our parts make 10x more selling to a single hyperscaler than they do to all of us combined, what's the realistic future of this hobby? GPUs already went this way. RTX 50 launched expensive and got more expensive, no new gen until 2028. Now boards. RAM is 2-4x what it was a year ago. CPUs reportedly next.

Are we cooked? Or is this a cycle that breaks when the AI capex bubble corrects? Because from where I'm sitting it looks like the consumer enthusiast PC isn't being killed by lack of demand, it's being killed by the fact that we're the lowest-margin customer in the building, and there's a much better customer in the next room.

Curious what people who've been building longer than me think. Have we been here before and will it turn around?

tomshardware.com
u/snackymann — 13 days ago