inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-noise-per-tensor 7B • Updated 28 days ago • 55
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-hybrid-per-tensor 7B • Updated 28 days ago • 52
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-heuristic-per-tensor 7B • Updated 28 days ago • 58
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-noise-per-tensor 7B • Updated 28 days ago • 51
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-hybrid-per-tensor 7B • Updated 28 days ago • 66
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-heuristic-per-tensor 7B • Updated 28 days ago • 55
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-noise-per-tensor 6B • Updated 28 days ago • 62
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-hybrid-per-tensor 6B • Updated 28 days ago • 64
inference-optimization/Llama-3.1-8B-Instruct-6-bits-mode-heuristic-per-tensor 6B • Updated 28 days ago • 70
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-noise-per-tensor 6B • Updated 28 days ago • 52
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-hybrid-per-tensor 6B • Updated 28 days ago • 59
inference-optimization/Llama-3.1-8B-Instruct-5.5-bits-mode-heuristic-per-tensor 6B • Updated 28 days ago • 53
inference-optimization/Llama-3.1-8B-Instruct-5-bits-mode-noise-per-tensor 5B • Updated 28 days ago • 46