Carlo Moro's picture

Carlo Moro

cnmoro

·

AI & ML interests

None yet

Recent Activity

reacted to robtacconelli's post with 🔥 about 4 hours ago

🏆 Nacrith: a 135M model that out-compresses everything on natural language What if a tiny LM could compress english text better than _every_ compressor out there — classical or neural, small or large? Nacrith pairs SmolLM2-135M with an ensemble of online predictors and high-precision arithmetic coding. What's inside The standard LLM+arithmetic coding approach wastes ~75% of CDF precision on large vocabularies. Our CDF-24 fix alone recovers 0.5 bpb. On top: a token N-gram that skips the GPU on predictable tokens, an adaptive bias head, llama.cpp backend (7× faster than PyTorch), multi-GPU parallel compression, and a binary file format (NC06) — the first LLM-based binary compressor we know of. Runs on a GTX 1050 Ti. ~500 MB weights, ~1.2 GB VRAM per worker. 💻 Code: https://github.com/robtacconelli/Nacrith-GPU ⭐ Space: https://huggingface.co/spaces/robtacconelli/Nacrith-GPU 📄 Paper: https://huggingface.co/papers/2602.19626 Try it, break it, share your results — all feedback welcome. ⭐ on the repo appreciated! Results across all systems we tested: - alice29.txt → 0.918 bpb (−44% vs CMIX, −20% vs ts_zip) — below the 2nd-order Shannon entropy bound - enwik8 (100 MB) → 0.9389 bpb (−8% vs FineZip/LLMZip's 8B model, −15% vs ts_zip) - Unseen text → 0.723 bpb on a doc published after training cutoff — no memorization, 26% better than FineZip/LLMZip on the same model SmolLM2-135M by https://huggingface.co/HuggingFaceTB

liked a Space about 4 hours ago

robtacconelli/Nacrith-GPU

liked a model 1 day ago

badaramoni/wave-field-v4-825m

View all activity

Organizations

Collections 3

View 3 collections

spaces 2

SemanticCompression

Compress text while preserving meaning

STRIVE Reranker

STRIVE: Semantic Tokenized Ranking via Vectorization & Embed

models 84

cnmoro/Qwen2.5-0.5B-Portuguese-v1

Text Generation • 0.5B • Updated 3 days ago • 11 • 5

cnmoro/LFM2-PTBR-imatrix-Quants

3B • Updated Jan 22 • 8

cnmoro/LFM-Q4-GGUFS

3B • Updated Jan 19 • 17

cnmoro/LexicalEmbed-Base

Feature Extraction • 16.6M • Updated Dec 20, 2025 • 1

cnmoro/low-dimension-static-model

Updated Nov 11, 2025 • 1

cnmoro/custom-model2vec-tokenlearn-medium

Updated Oct 27, 2025 • 1

cnmoro/custom-model2vec-tokenlearn-small

Updated Oct 27, 2025 • 5 • 1

cnmoro/Qwen3-16B-A3B-REAP-PTBR

16B • Updated Oct 25, 2025 • 18 • 1

cnmoro/Qwen3-7B-A3B-REAP-PTBR

7B • Updated Oct 25, 2025 • 11 • 2

cnmoro/Qwen3-7B-A3B-REAP-PTBR-Q4_K_M-GGUF

7B • Updated Oct 25, 2025 • 125

datasets 21

cnmoro/LexicalTriplets

Viewer • Updated Dec 11, 2025 • 77.6M • 93

cnmoro/PTBR-REAP-Pruning

Viewer • Updated Oct 25, 2025 • 48.9k • 14

cnmoro/PromptSearchTermsDecomposition

Viewer • Updated May 17, 2025 • 50k • 7 • 1

cnmoro/reasoning-v1-20m-portuguese

Viewer • Updated Apr 18, 2025 • 20.9M • 145 • 13

cnmoro/smoltalk-555k-ptbr

Viewer • Updated Mar 10, 2025 • 556k • 7 • 3

cnmoro/LogicReasoningEnglishPortuguese

Viewer • Updated Feb 26, 2025 • 10.5k • 10 • 2

cnmoro/LegalAlpacaReasoningRag-Qwen

Viewer • Updated Feb 18, 2025 • 3.04k • 4 • 1

cnmoro/LegalAlpacaReasoningRag

Viewer • Updated Feb 18, 2025 • 3.04k • 8 • 2

cnmoro/DocumentPassageRanking

Viewer • Updated Feb 10, 2025 • 2.09M • 169 • 2

cnmoro/QuestionClassification-v2

Viewer • Updated Jan 30, 2025 • 129k • 15 • 2

View 21 datasets