view article Article Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture Jan 5 • 39
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 16 days ago • 72
view article Article Rigth and left alignment on Large Language Models and its variants 20 days ago • 1
Pi05 Knolwedge Insulation Collection Models that I train for that matter • 6 items • Updated 20 days ago • 1
view article Article Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models 28 days ago • 10
GPU Acceleration and Portability of the TRIMEG Code for Gyrokinetic Plasma Simulations using OpenMP Paper • 2601.14301 • Published Jan 17 • 1
Physical AI Collection VLM and models used for Physical AI, LeRobot, Nvidia, etc. Handy • 4 items • Updated 25 days ago • 1
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 149
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published Oct 22, 2025 • 115
view article Article Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models Oct 20, 2025 • 20
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 181
view article Article Nano Banana (Gemini 2.5 Flash Image) Full Tutorial - 27 Unique Cases vs Qwen Image Edit - Free 2 Use Aug 27, 2025 • 2