-
-
-
-
-
-
Inference Providers
Active filters:
ModelOpt
Text Generation
•
0.4B
•
Updated
•
169
nvidia/gpt-oss-120b-Eagle3-long-context
Text Generation
•
0.2B
•
Updated
•
4.2k
•
57
jonlizardo/affine-gpt-oss-120b-light
Text Generation
•
0.2B
•
Updated
•
1
nvidia/Phi-4-multimodal-instruct-FP8
6B
•
Updated
•
30.6k
•
4
nvidia/Phi-4-reasoning-plus-FP8
15B
•
Updated
•
527
•
3
nvidia/Phi-4-reasoning-plus-NVFP4
8B
•
Updated
•
6.96k
•
6
nvidia/Llama-3.1-8B-Instruct-NVFP4
5B
•
Updated
•
95.2k
•
6
Text Generation
•
5B
•
Updated
•
9.31k
•
13
Text Generation
•
8B
•
Updated
•
5.01k
•
3
Text Generation
•
8B
•
Updated
•
17.6k
•
5
Text Generation
•
15B
•
Updated
•
3.07k
•
2
Text Generation
•
17B
•
Updated
•
13.2k
•
5
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
669
•
7
nvidia/gpt-oss-120b-Eagle3-short-context
Text Generation
•
Updated
•
5.84k
•
14
nvidia/DeepSeek-V3.1-NVFP4
Text Generation
•
394B
•
Updated
•
61.7k
•
12
nvidia/gpt-oss-120b-Eagle3-throughput
Text Generation
•
Updated
•
795
•
33
Daemontatox/Qwen3-L-NVFP4
Text Generation
•
133B
•
Updated
•
1
nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
120B
•
Updated
•
671
•
1
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
•
241B
•
Updated
•
67
eugene141759/affine-best-5FsZP1ipNDE6Esg9rf8AnepyXQFC8xRKQFWPRRFr15p9covj
Text Generation
•
394B
•
Updated
•
44