Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 15 days ago • 12
Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 15 days ago • 12
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 60
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs Paper • 2510.18279 • Published Oct 21, 2025 • 4
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation Paper • 2508.13144 • Published Aug 18, 2025
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5, 2025 • 59
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper • 2506.05062 • Published Jun 5, 2025 • 15
The Generative AI Paradox: "What It Can Create, It May Not Understand" Paper • 2311.00059 • Published Oct 31, 2023 • 20
Faith and Fate: Limits of Transformers on Compositionality Paper • 2305.18654 • Published May 29, 2023 • 7
PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning Paper • 2305.19472 • Published May 31, 2023 • 1
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations Paper • 2311.08469 • Published Nov 14, 2023 • 11
SPLAIN: Augmenting Cybersecurity Warnings with Reasons and Data Paper • 2311.11215 • Published Nov 19, 2023 • 2
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements Paper • 2306.01985 • Published Jun 3, 2023 • 1