Towards the Aha Moment of Vision-Language Models
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 9
MMInstruction/Qwen2-VL-72B-Video-T3
73B • Updated
• 1
MMInstruction/Giraffe
8B • Updated
• 3 • 2
MMInstruction/LongVA-7B-Video-T3
8B • Updated
• 17
MMInstruction/Qwen-VL-ArXivCap
Text Generation • Updated
• 3 • 4
MMInstruction/Qwen-VL-ArXivQA
Text Generation • Updated
• 3 • 4
MMInstruction/Silkie
Text Generation • Updated
• 9 • 12
MMInstruction/YingVLM
Updated
• 3 • 1
MMInstruction/YingVLM-zh
Updated
• 1
MMInstruction/YingVLM-Video
Updated
• 1
datasets 17
MMInstruction/stock_factors
Viewer
• Updated
• 48.2M • 2.23k • 1
MMInstruction/OSWorld-G
Viewer
• Updated
• 510 • 85 • 6
MMInstruction/VL-RewardBench
Viewer
• Updated
• 1.25k • 284 • 14
MMInstruction/Video-T3-QA
Viewer
• Updated
• 162k • 122 • 2
MMInstruction/SuperClevr_Val
Viewer
• Updated
• 5k • 403 • 1
MMInstruction/Clevr_CoGenT_TrainA_R1
Viewer
• Updated
• 37.8k • 514 • 48
MMInstruction/Clevr_CoGenT_TrainA_70K_Complex
Viewer
• Updated
• 70k • 640 • 8
MMInstruction/Clevr_CoGenT_ValB
Viewer
• Updated
• 5k • 18 • 2
MMInstruction/Clevr_CoGenT_ValA
Viewer
• Updated
• 5k • 365 • 1
MMInstruction/Clevr_CoAgent_TrainA_R1
Viewer
• Updated
• 2.5k • 8