Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

19,114

Full-text search

Active filters: grpo

snap-stanford/humanlm-opinion

Text Generation • 8B • Updated 2 days ago • 22 • 6

lightx2v/Wan2.1-T2V-1.3B-longcat-step1500

Text-to-Video • Updated 4 days ago • 40 • 5

lightx2v/Wan2.1-T2V-1.3B-longcat-step500

Text-to-Video • Updated 4 days ago • 75 • 4

lightx2v/Wan2.1-T2V-1.3B-longcat-step1000

Text-to-Video • Updated 4 days ago • 13 • 3

LightningRodLabs/Golf-Forecaster

Text Generation • Updated about 12 hours ago • 19 • 3

MING-ZCH/MetaphorStar-32B

Image-Text-to-Text • 33B • Updated 2 days ago • 17 • 2

LightningRodLabs/Trump-Forecaster

Text Generation • Updated about 12 hours ago • 97 • 2

ericrisco/salamandra-7b-r1

8B • Updated Feb 18, 2025 • 21 • 2

almaghrabima/ALLaM-Thinking

7B • Updated Mar 21, 2025 • 41 • 5

Jeremmmyyyyy/gemma-3-1b-Math

Text Generation • 1.0B • Updated May 4, 2025 • 3 • 1

Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl

Reinforcement Learning • 2B • Updated Jul 5, 2025 • 27 • 3

xypkent/visjudge-7b

Image-Text-to-Text • Updated Dec 17, 2025 • 65 • 4

Paulescu/LFM2-350M-browsergym-20251224-013119

Text Generation • 0.4B • Updated Dec 24, 2025 • 39 • 2

MING-ZCH/MetaphorStar-3B

Image-Text-to-Text • 4B • Updated 2 days ago • 16 • 1

MING-ZCH/MetaphorStar-7B

Image-Text-to-Text • 8B • Updated 2 days ago • 14 • 1

Jarrodbarnes/opensec-gdpo-4b

Text Generation • 4B • Updated 3 days ago • 81 • 1

sdan/jokegen2-1t-rl

Updated 19 days ago • 7

ragtag1/qwen3-8b-historical-final

Text Generation • Updated 5 days ago • 13 • 1

onuryozcu/llama

Text Generation • 0.1B • Updated Mar 10, 2025 • 4

amiguel/promptTuning

8B • Updated Feb 16, 2025

sergiopaniego/Qwen2-0.5B-GRPO-test

Updated Oct 3, 2025

Novaciano/ESP-NSFW-GRPO-1B-Sin_Censura-GGUF

1B • Updated Jan 28, 2025 • 69 • 4

nbd22/Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora

Updated Jan 28, 2025

sergiopaniego/Qwen2-0.5B-GRPO

Updated Jan 31, 2025

philschmid/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Jan 30, 2025 • 7 • 8

spinech/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Apr 28, 2025 • 4

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 2, 2025 • 1 • 1

yooneo/qwen-0.5b-r1-aha

Updated Jan 31, 2025

yooneo/qwen-1.5b-r1-aha

Updated Jan 31, 2025

spinech/qwen2.5-3b-r1-rearc-stage1

Text Generation • 3B • Updated Apr 28, 2025 • 9