KW's picture

KW

kevineen

·

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

liked a model 2 days ago

OpenMOSS-Team/MOSS-SoundEffect

liked a model 2 days ago

RikkaBotan/quantized-stable-static-embedding-fast-retrieval-mrl-bilingual-ja-en

View all activity

Organizations

upvoted a paper 7 days ago

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published 10 days ago • 22

upvoted an article 8 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

10 days ago

•

121

upvoted an article 12 days ago

Article

Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach

Nov 24, 2024

•

20

upvoted a collection 14 days ago

GPT-OSS-Swallow-v0.1

4 items • Updated 16 days ago • 13

upvoted 3 papers about 1 month ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published Feb 2 • 60

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published Jan 20 • 23

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Paper • 2601.19798 • Published Jan 27 • 42

upvoted an article about 1 month ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

+3

Jan 20

•

40

upvoted a paper about 1 month ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 72

upvoted a collection about 2 months ago

TranslateGemma

3 items • Updated Jan 15 • 216

upvoted a paper about 2 months ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 214

upvoted a collection about 2 months ago

EnvScaler

The official datasets and models of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis" • 10 items • Updated 1 day ago • 3

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 288

upvoted an article about 2 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

82

upvoted a collection about 2 months ago

💧 LFM2.5

Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 12 days ago • 99

upvoted a paper 2 months ago

Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

Paper • 2601.02346 • Published Jan 5 • 26

upvoted 3 articles 2 months ago

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Jan 5

•

64

Article

Introducing Falcon H1R 7B

Jan 5

•

58

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

792

upvoted a paper 2 months ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published Dec 26, 2025 • 25