Research Papers - a NeoCodes-dev Collection

NeoCodes-dev 's Collections

Pokemon_Red_Experiments

Datasets - Pretraining

OCR/Document Processing

Datasets - Agents

ActionLanguageModels

Datasets - Coding

Datasets - MultiModal

Agent-Specific/Function-Calling Models

VLMs - Robotics

Datasets - Robotics

Embedding Models

MMMs

ICON - Help Agent

Models - CryptoSage

Datasets - CryptoSage

Datasets - Reasoning

VLMs

Spaces

Agents

Research Papers

Classifier Models

LLMs

Research Papers

updated Nov 22, 2025

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning

Paper • 2502.15425 • Published Feb 21, 2025 • 9
EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5, 2025 • 46
Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3, 2025 • 85
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Paper • 2502.14669 • Published Feb 20, 2025 • 15
Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 168
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Paper • 2406.06469 • Published Jun 10, 2024 • 29
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots

Paper • 2409.10277 • Published Sep 16, 2024 • 1
ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18, 2025 • 433k • 466
Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18, 2025 • 58
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published Apr 24, 2025 • 40
togethercomputer/StripedHyena-Nous-7B

Text Generation • 8B • Updated Mar 27, 2024 • 48 • 143
ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26, 2025 • 45
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 147
ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published May 29, 2025 • 22
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

Paper • 2507.15597 • Published Jul 21, 2025 • 34
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR

Paper • 2509.23808 • Published Sep 28, 2025 • 47
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Paper • 2510.03561 • Published Oct 3, 2025 • 24
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501
moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 17 days ago • 90.7k • 515
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12, 2025 • 69
UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity

Paper • 2511.13714 • Published Nov 17, 2025 • 10
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Paper • 2511.14460 • Published Nov 18, 2025 • 20
Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 28