OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence Paper • 2602.08683 • Published 7 days ago • 36
ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder Paper • 2510.18795 • Published Oct 21, 2025 • 11
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset Paper • 2601.10305 • Published Jan 15 • 36
Sleeping 70113 ImgGen Diffusion ControlNetxLoRA 🐢 Transform your images into styled artwork using control patterns
Sleeping 70113 ImgGen Diffusion ControlNetxLoRA 🐢 Transform your images into styled artwork using control patterns