Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper • 2507.11412 • Published Jul 15, 2025 • 30
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30, 2025 • 538
ToMMeR -- Efficient Entity Mention Detection from Large Language Models Paper • 2510.19410 • Published Oct 22, 2025 • 1
DIP: Unsupervised Dense In-Context Post-training of Visual Representations Paper • 2506.18463 • Published Jun 23, 2025 • 21