3 104 4

ltl

2793145003

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

upvoted a paper 23 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

upvoted a paper 29 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

View all activity

Organizations

None yet

upvoted a paper 19 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 25 days ago • 115

upvoted a paper 23 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published 24 days ago • 71

upvoted a paper 29 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 29 days ago • 167

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 114k • • 1.06k

upvoted 3 papers about 2 months ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published Nov 16, 2025 • 103

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 210

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published Nov 7, 2025 • 54

upvoted 2 papers 3 months ago

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published Oct 15, 2025 • 72

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 538

upvoted 2 papers 4 months ago

Set Block Decoding is a Language Model Inference Accelerator

Paper • 2509.04185 • Published Sep 4, 2025 • 52

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted 2 papers 5 months ago

PixNerd: Pixel Neural Field Diffusion

Paper • 2507.23268 • Published Jul 31, 2025 • 51

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 136

upvoted 2 papers 6 months ago

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19, 2025 • 88

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

upvoted 2 papers 7 months ago

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16, 2025 • 26

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27, 2025 • 109

liked a model 7 months ago

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated 26 days ago • 643 • 1.17k

upvoted 2 papers 8 months ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21, 2025 • 97

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 28

ltl

AI & ML interests

Recent Activity

Organizations

ltl's activity