Variational Linear Attention: Stable Associative Memory for Long-Context Transformers

This paper proposes Variational Linear Attention, an online least-squares formulation that stabilizes linear attention memory with an adaptive penalty matrix. It targets a core bottleneck in long-context transformers: reducing interference while keeping attention efficient.

cs.LG updates on arXiv.org · May 13Score 9.2

GenAI

Trajectory Models for Few-Step Diffusion (22 minute read)

This paper replaces standard diffusion denoising with conditional normalizing flows to get four-step image generation without giving up exact likelihood training. The…

TLDR AI Feed · May 12

GenAI

The Sequence Knowledge #858: How State Space Models Went from Curiosity to Serious Transformer Competitor

State space models are moving from a niche alternative to a credible transformer competitor, with tradeoffs that matter for long-context efficiency and scaling. The piece is a…

TheSequence · May 12

GenAI

Backbone-Equated Diffusion OOD via Sparse Internal Snapshots

This paper tightens the evaluation of diffusion-based OOD detectors by controlling for backbone choice and test-time budget, then proposes sparse internal feature snapshots as a…

cs.LG updates on arXiv.org · May 13

What moved overnight, in your inbox by 7am UTC.

A tight read on the deals, papers, and policy filings worth your time. No takes, no roundups of other people's tweets.