Wang Yu

vvangfaye

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

GD-ML/IntTravel_dataset

upvoted a paper 4 days ago

Code2World: A GUI World Model via Renderable Code Generation

updated a dataset 5 days ago

vvangfaye/SocioSeg

View all activity

Organizations

upvoted a paper 4 days ago

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published 5 days ago • 186

upvoted a paper 10 days ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published 12 days ago • 146

upvoted a paper 16 days ago

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Paper • 2601.20354 • Published 18 days ago • 110

upvoted a paper 17 days ago

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published 18 days ago • 118

upvoted a paper 19 days ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 23 days ago • 175

upvoted a collection 19 days ago

Urban World Model

Collection

29 items • Updated Nov 28, 2025 • 1

upvoted a paper 21 days ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published 25 days ago • 74

upvoted 3 papers 24 days ago

UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation

Paper • 2601.11522 • Published 30 days ago • 17

RemoteVAR: Autoregressive Visual Modeling for Remote Sensing Change Detection

Paper • 2601.11898 • Published 29 days ago • 4

Think3D: Thinking with Space for Spatial Reasoning

Paper • 2601.13029 • Published 27 days ago • 47

upvoted a paper 30 days ago

Urban Socio-Semantic Segmentation with Vision-Language Reasoning

Paper • 2601.10477 • Published about 1 month ago • 155

upvoted 2 papers about 1 month ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 166

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 63

upvoted a paper 3 months ago

Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum

Paper • 2510.27571 • Published Oct 31, 2025 • 19

upvoted a paper 4 months ago

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

Paper • 2510.12586 • Published Oct 14, 2025 • 113

upvoted a paper 5 months ago

Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published Sep 25, 2025 • 92

upvoted a paper 9 months ago

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20, 2025 • 53

upvoted a paper 11 months ago

When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

Paper • 2503.07588 • Published Mar 10, 2025 • 7

Wang Yu

AI & ML interests

Recent Activity

Organizations

vvangfaye's activity