❄️January 2025 - Open releases from the Chinese community - a zh-ai-community Collection

zh-ai-community 's Collections

2026 January⛄️ - China Open Source Highlights

🎄December 2025 - China Open Source Highlights

🍁 November 2025 - China Open Source Highlights

🎆 October 2025 - China Open Source Highlights

🎑 September 2025 - China Open Source Highlights

🏖️ August 2025 - China Open Source Highlights

🧩 July 2025 - Open works from the Chinese community

🍉 June 2025 - Open works from the Chinese community

🌞 May 2025 - Open works from the Chinese community

🌸 April 2025 - Open releases from the Chinese community

🌙 March 2025 - Open releases from the Chinese community

🧧 February 2025 - Open releases from the Chinese community

❄️January 2025 - Open releases from the Chinese community

🧠 Reasoning model 2025

⭐ 3D models - 2025

💻 Coding models 2025

🎬 Video model 2025

🎨Image model 2025

🔊 Audio model 2025

🖼️ MLLM 2025

🧠 Reasoning Models

🎬 Video models

🔊 Audio Models

🔢 Math models

💻 Code Models

🎨 Image models

🚀 Trending Demo

📊 Trending Datasets

🏆 Leaderboards & Arenas

❄️January 2025 - Open releases from the Chinese community

updated 1 day ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 53.1k • 3.55k
deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1, 2025 • 6.78k • 466
tencent/Hunyuan3D-2

Image-to-3D • Updated Oct 17, 2025 • 64.6k • 1.69k
tencent/Hunyuan-7B-Instruct-0124

Text Generation • Updated Jul 30, 2025 • 83 • 50
ByteDance/Sa2VA-4B

Image-Text-to-Text • 4B • Updated Sep 8, 2025 • 144k • • 91

Note A unified model for dense grounded understanding of images & videos.
ByteDance-Seed/UI-TARS-72B-DPO

Image-Text-to-Text • 73B • Updated Jan 25, 2025 • 791 • 148
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 436k • • 12.9k

Note 660B reasoning models with MIT license
deepseek-ai/DeepSeek-R1-Zero

Text Generation • 685B • Updated Mar 27, 2025 • 2.87k • 940
MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • 456B • Updated Jul 3, 2025 • 68.1k • 282

Note A non transformer based ( ViT-MLP-LLM framework) VLM
MiniMaxAI/MiniMax-Text-01

Text Generation • 456B • Updated Jul 3, 2025 • 1.82k • 652

Note 456B LLM with 1M tokens training context
Qwen/Qwen2.5-Math-PRM-7B

Text Classification • 8B • Updated Jan 17, 2025 • 35.7k • 80

Note Math model
Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29, 2025 • 4.92k • • 331
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5, 2025 • 79.3k • 1.28k

Note End-side multimodal LLM that supports real time conversation and video understanding.
ICTNLP/llava-mini-llama-3.1-8b

Image-Text-to-Text • 9B • Updated Jan 13, 2025 • 314 • 56
BlinkDL/rwkv-7-world

Text Generation • Updated May 31, 2025 • 104

Note RNN+Transfomers
HKUSTAudio/Llasa-3B

Text-to-Speech • 4B • Updated May 10, 2025 • 710 • 523

Note TTS
DAMO-NLP-SG/VideoLLaMA3-7B

Video-Text-to-Text • 8B • Updated Sep 2, 2025 • 84k • 71
internlm/internlm3-8b-instruct

Text Generation • 9B • Updated Feb 11, 2025 • 10.1k • 228
baichuan-inc/Baichuan-M1-14B-Base

14B • Updated Feb 20, 2025 • 151 • 31

Note Medical LLM
opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Feb 27, 2025 • 958M • 30k • 57

Note Dataset designed specifically for natural language processing (NLP) tasks in the education sector.
DAMO-NLP-SG/multimodal_textbook

Updated Mar 17, 2025 • 3.81k • 156

Note A multimodel dataset for vision language pretraining , includes 6.5M images + 0.8B text from 22k hours of instructional videos
hithink-ai/MME-Finance

Viewer • Updated May 30, 2025 • 2.06k • 173 • 8
KlingTeam/GameFactory-Dataset

Updated Mar 22, 2025 • 522 • 14
m-a-p/YuE-s1-7B-anneal-zh-cot

Text Generation • 6B • Updated Mar 12, 2025 • 317 • 40
m-a-p/YuE-s1-7B-anneal-jp-kr-cot

Text Generation • 6B • Updated Mar 12, 2025 • 521 • 22
m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • 6B • Updated Mar 12, 2025 • 7.16k • 436
Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6, 2025 • 2.13M • 582
Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 2.3M • • 1.41k
Running on Zero

3.14k

Hunyuan3D-2.0

🌍

3.14k

Text-to-3D and Image-to-3D Generation
Running

65

UI-TARS

🌖

65

Find click coordinates on images based on instructions
Running

64

MiniMaxVL01

💬

64

Generate responses to text and images in a chat interface
Running on Zero

Featured

2.01k

Chat With Janus-Pro-7B

🌍

2.01k

A unified multimodal understanding and generation model.