OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published about 24 hours ago • 36
Metis-SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start Paper • 2510.25801 • Published Oct 29, 2025
DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling Paper • 2412.04905 • Published Dec 6, 2024 • 8
DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling Paper • 2412.04905 • Published Dec 6, 2024 • 8