Multi-modal Multilingual Instruction

university

https://m3-it.github.io

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yaolily submitted a paper 14 days ago

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

tobiaslee authored a paper 2 months ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

tobiaslee submitted a paper 2 months ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

View all activity

Collections 1

spaces 1

VL RewardBench

Explore vision-language model performance on VL-RewardBench

models 9

MMInstruction/Qwen2-VL-72B-Video-T3

73B • Updated Dec 23, 2024 • 1

MMInstruction/Giraffe

8B • Updated Dec 17, 2024 • 3 • 2

MMInstruction/LongVA-7B-Video-T3

8B • Updated Oct 26, 2024 • 17

MMInstruction/Qwen-VL-ArXivCap

Text Generation • Updated May 6, 2024 • 3 • 4

MMInstruction/Qwen-VL-ArXivQA

Text Generation • Updated May 6, 2024 • 3 • 4

MMInstruction/Silkie

Text Generation • Updated Dec 20, 2023 • 9 • 12

MMInstruction/YingVLM

Updated Aug 16, 2023 • 3 • 1

MMInstruction/YingVLM-zh

Updated Aug 10, 2023 • 1

MMInstruction/YingVLM-Video

Updated Aug 10, 2023 • 1

datasets 17

MMInstruction/stock_factors

Viewer • Updated Dec 8, 2025 • 48.2M • 2.23k • 1

MMInstruction/OSWorld-G

Viewer • Updated May 22, 2025 • 510 • 85 • 6

MMInstruction/VL-RewardBench

Viewer • Updated May 19, 2025 • 1.25k • 284 • 14

MMInstruction/Video-T3-QA

Viewer • Updated Feb 24, 2025 • 162k • 122 • 2

MMInstruction/SuperClevr_Val

Viewer • Updated Feb 18, 2025 • 5k • 403 • 1

MMInstruction/Clevr_CoGenT_TrainA_R1

Viewer • Updated Feb 13, 2025 • 37.8k • 514 • 48

MMInstruction/Clevr_CoGenT_TrainA_70K_Complex

Viewer • Updated Feb 5, 2025 • 70k • 640 • 8

MMInstruction/Clevr_CoGenT_ValB

Viewer • Updated Feb 3, 2025 • 5k • 18 • 2

MMInstruction/Clevr_CoGenT_ValA

Viewer • Updated Feb 3, 2025 • 5k • 365 • 1

MMInstruction/Clevr_CoAgent_TrainA_R1

Viewer • Updated Feb 2, 2025 • 2.5k • 8

View 17 datasets