CodeScout Collection RL-trained code search agents (1.7B, 4B, 14B) that outperform 2–18× larger models using only a Unix terminal. 📄 arxiv.org/abs/2603.17829 • 12 items • Updated 8 days ago • 5
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 6
artificial-hivemind Collection This collection contains datasets for the Artificial Hiveminds paper. • 4 items • Updated May 16, 2025 • 16
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 24 days ago • 23
Parallia/ClinicalEncoder25-Diagnosable-Colbert-L2-for-medical-texts Sentence Similarity • 0.4B • Updated Dec 20, 2025 • 16