Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

upvoted an article 1 day ago

Safetensors is Joining the PyTorch Foundation

upvoted a collection 2 days ago

View all activity

Organizations

upvoted a paper 1 day ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published 5 days ago • 39

upvoted an article 1 day ago

Article

Safetensors is Joining the PyTorch Foundation

3 days ago

•

28

upvoted 3 collections 2 days ago

VoxCPM

5 items • Updated 4 days ago • 9

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 3 items • Updated 2 days ago • 26

DFlash

Block Diffusion for Flash Speculative Decoding • 13 items • Updated 5 days ago • 47

upvoted a paper 2 days ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 46

upvoted a collection 3 days ago

Ace-Step 1.5-xl

3 items • Updated 9 days ago • 65

upvoted a collection 8 days ago

Gemma 4

8 items • Updated 8 days ago • 547

upvoted a paper 9 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 16 days ago • 51

upvoted 2 collections 10 days ago

Bonsai-Auxiliary

3 items • Updated 10 days ago • 7

Bonsai

1-bit Bonsai models • 6 items • Updated 10 days ago • 163

upvoted 2 collections 17 days ago

Open Coding Agents

13 items • Updated Mar 5 • 52

MolmoWeb

This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 6 items • Updated about 1 hour ago • 22

upvoted a paper 24 days ago

Attention Residuals

Paper • 2603.15031 • Published 26 days ago • 177

upvoted a paper 26 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 150

upvoted a collection about 1 month ago

Qwen3.5

21 items • Updated Mar 9 • 1.48k

upvoted 2 papers about 2 months ago

SERA: Soft-Verified Efficient Repository Agents

Paper • 2601.20789 • Published Jan 28 • 13

NOSA: Native and Offloadable Sparse Attention

Paper • 2510.13602 • Published Oct 15, 2025 • 7

upvoted 2 collections about 2 months ago

Devstral 2

A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 2 items • Updated Mar 2 • 52

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 5 days ago • 84