MBZUAI/CoME-VL
Image-Text-to-Text • Updated • 26 • 3
Natural Language Processing, Machine Learning, and Computer Vision
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework
LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation