HoliSafe: Holistic Safety Benchmarking and Modeling with Safety Meta Token for Vision-Language Model Paper • 2506.04704 • Published Jun 5, 2025 • 3
MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model Paper • 2603.18892 • Published 17 days ago • 1
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents Paper • 2603.09827 • Published 26 days ago • 29
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents Paper • 2603.09827 • Published 26 days ago • 29
🛡️ Safe-VLMs Collection Safe Vision-Language Models with Visual Guard Module (https://huggingface.co/spaces/etri-vilab/Ko-LLaVA) • 9 items • Updated 16 days ago • 2