Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Optimization
community
Activity Feed
Follow
21
AI & ML interests
None defined yet.
Recent Activity
ChibuUkachi
updated
a model
about 9 hours ago
inference-optimization/ctest-Qwen3-8B-speculator.dflash
ChibuUkachi
published
a model
about 9 hours ago
inference-optimization/ctest-Qwen3-8B-speculator.dflash
ChibuUkachi
updated
a model
about 11 hours ago
inference-optimization/MiniMax-M2.5-NVFP4
View all activity
Team members
15
inference-optimization
's models
323
Sort: Recently updated
inference-optimization/Qwen3-8B_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
9
inference-optimization/Qwen3-8B_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
8
inference-optimization/Qwen3-8B_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
8
inference-optimization/Qwen3-8B_5_bits_mode_noise
6B
•
Updated
Mar 12
•
9
inference-optimization/Qwen3-8B_5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_heuristic
7B
•
Updated
Mar 12
•
9
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_noise
7B
•
Updated
Mar 12
•
7
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_hybrid
7B
•
Updated
Mar 12
•
10
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_heuristic
7B
•
Updated
Mar 12
•
15
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_noise
7B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_hybrid
7B
•
Updated
Mar 12
•
6
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_heuristic
6B
•
Updated
Mar 12
•
7
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_noise
6B
•
Updated
Mar 12
•
10
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_hybrid
6B
•
Updated
Mar 12
•
9
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
7
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
9
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
13
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_noise
6B
•
Updated
Mar 12
•
8
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
11
inference-optimization/sarvam-105b-FP8-Dynamic
Text Generation
•
106B
•
Updated
Mar 9
•
3
inference-optimization/sarvam-30b-FP8-Dynamic
Text Generation
•
32B
•
Updated
Mar 9
•
60
•
1
inference-optimization/sarvam-30b-NVFP4
Text Generation
•
19B
•
Updated
Mar 9
•
25
•
1
inference-optimization/sarvam-105b-NVFP4
61B
•
Updated
Mar 9
•
3
•
1
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
35B
•
Updated
Mar 6
•
11
inference-optimization/Kimi-K2-Instruct-0905-BF16-FP8-BLOCK
Text Generation
•
1T
•
Updated
Mar 6
•
6
inference-optimization/MiniMax-M2.5-BF16
Text Generation
•
229B
•
Updated
Mar 6
•
137
inference-optimization/gpt-oss-20b-FP8-Dynamic
21B
•
Updated
Mar 5
•
11
inference-optimization/test_qwen3_next_mtp
Updated
Mar 4
•
3
inference-optimization/test_tencentbac_fastmtp
Updated
Mar 4
•
3
Previous
1
...
7
8
9
10
11
Next