inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-hybrid-per-tensor 5B • Updated 4 days ago • 21
inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-heuristic-per-tensor 5B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-noise-per-tensor 3B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-hybrid-per-tensor 3B • Updated 4 days ago • 20
inference-optimization/Llama-3.2-3B-Instruct-7-bits-mode-heuristic-per-tensor 3B • Updated 4 days ago • 25
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-noise-per-tensor 3B • Updated 4 days ago • 20
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-hybrid-per-tensor 3B • Updated 4 days ago • 25
inference-optimization/Llama-3.2-3B-Instruct-6.5-bits-mode-heuristic-per-tensor 3B • Updated 4 days ago • 21
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-noise-per-tensor 3B • Updated 4 days ago • 27
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-hybrid-per-tensor 3B • Updated 4 days ago • 32
inference-optimization/Llama-3.2-3B-Instruct-6-bits-mode-heuristic-per-tensor 3B • Updated 4 days ago • 53
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-noise-per-tensor 3B • Updated 4 days ago • 26
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-hybrid-per-tensor 3B • Updated 4 days ago • 22
inference-optimization/Llama-3.2-3B-Instruct-5.5-bits-mode-heuristic-per-tensor 3B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-noise-per-tensor 3B • Updated 4 days ago • 20
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-hybrid-per-tensor 3B • Updated 4 days ago • 22
inference-optimization/Llama-3.2-3B-Instruct-5-bits-mode-heuristic-per-tensor 3B • Updated 4 days ago • 22
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-noise-per-tensor 1B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-hybrid-per-tensor 1B • Updated 4 days ago • 24
inference-optimization/Llama-3.2-1B-Instruct-7-bits-mode-heuristic-per-tensor 1B • Updated 4 days ago • 22
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-noise-per-tensor 1B • Updated 4 days ago • 22
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-hybrid-per-tensor 1B • Updated 4 days ago • 21
inference-optimization/Llama-3.2-1B-Instruct-6.5-bits-mode-heuristic-per-tensor 1B • Updated 4 days ago • 22
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-noise-per-tensor 1B • Updated 4 days ago • 29
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-hybrid-per-tensor 1B • Updated 4 days ago • 21
inference-optimization/Llama-3.2-1B-Instruct-6-bits-mode-heuristic-per-tensor 1B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-1B-Instruct-5.5-bits-mode-noise-per-tensor 1B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-1B-Instruct-5.5-bits-mode-hybrid-per-tensor 1B • Updated 4 days ago • 23
inference-optimization/Llama-3.2-1B-Instruct-5.5-bits-mode-heuristic-per-tensor 1B • Updated 4 days ago • 25
inference-optimization/Llama-3.2-1B-Instruct-5-bits-mode-noise-per-tensor 1B • Updated 4 days ago • 22