view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 14 days ago • 90
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 28 days ago • 63
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 261
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 46 items • Updated 9 days ago • 68
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 500
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 176
Tri Series Collection Introducing our new series of models: Tri-7B, Tri-21B, and Tri-70B-preview-SFT • 10 items • Updated Sep 10, 2025 • 8
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 508
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 395
H-Net Collection The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11, 2025 • 20
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance May 21, 2025 • 38