inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-noise-per-tensor 7B • Updated 5 days ago • 23
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-hybrid-per-tensor 7B • Updated 5 days ago • 21
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-heuristic-per-tensor 7B • Updated 5 days ago • 28
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-noise-per-tensor 7B • Updated 5 days ago • 20
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-hybrid-per-tensor 7B • Updated 5 days ago • 34
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-heuristic-per-tensor 7B • Updated 5 days ago • 26
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-noise-per-tensor 6B • Updated 5 days ago • 32
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-hybrid-per-tensor 6B • Updated 5 days ago • 33
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-heuristic-per-tensor 6B • Updated 5 days ago • 41
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-noise-per-tensor 6B • Updated 5 days ago • 23
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-hybrid-per-tensor 6B • Updated 5 days ago • 28
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-heuristic-per-tensor 6B • Updated 5 days ago • 23