Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wo-datacraft 's Collections
Audio Generation
3D Generation
Any-to-Any
Image Classification
Image Generation
Speech Generation
Speech Recognition
Text Generation - General
Text Generation - Reasoning
Text Generation - Vision
Toolkit - AI Papers
Toolkit - Datasets
Toolkit - Embeddings
Toolkit - Prompting Papers
Toolkit - Segmentation
Toolkit - Utilities
Video Generation

Text Generation - Vision

updated 23 days ago
Upvote
-

  • google/gemma-3-27b-it

    Image-Text-to-Text • Updated Mar 21, 2025 • 1.66M • • 1.9k

  • mistralai/Ministral-3-14B-Instruct-2512

    Updated Jan 15 • 339k • 256

  • Qwen/Qwen3-VL-30B-A3B-Instruct

    Image-Text-to-Text • Updated Nov 26, 2025 • 1.73M • • 538

  • Qwen/Qwen3-VL-30B-A3B-Thinking

    Image-Text-to-Text • Updated Nov 26, 2025 • 165k • • 191

  • moonshotai/Kimi-VL-A3B-Thinking-2506

    Image-Text-to-Text • Updated 26 days ago • 28.9k • 353

  • tencent/HunyuanOCR

    Image-Text-to-Text • Updated Jan 13 • 896k • 555

  • Running
    Featured
    383

    Qwen3 VL Demo

    😻
    383

    Chat with an AI that understands text, images, and videos


  • Running
    Featured
    107

    Qwen3 VL Demo

    😻
    107

    Chat with an AI assistant using text and images

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs