NM Testing

company

AI & ML interests

None defined yet.

Recent Activity

nm-autobot updated a model about 4 hours ago

nm-testing/w8a8_static_asym-e2e

nm-autobot updated a model about 4 hours ago

nm-testing/w8a8_dynamic_asym-e2e

nm-autobot updated a model about 4 hours ago

nm-testing/w8a16_grouped_quant-e2e

View all activity

nm-testing 's models 559

nm-testing/DeepSeek-V2-Lite-FP8-BLOCK-Fused

16B • Updated Feb 6 • 3

nm-testing/woah

0.6B • Updated Feb 4 • 1

nm-testing/Meta-Llama-3-8B-Instruct-NVFP4A16-GPTQ

5B • Updated Jan 29 • 4

nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-GPTQ-ActOrder

5B • Updated Jan 29 • 3

nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-GPTQ

5B • Updated Jan 29 • 3

nm-testing/Meta-Llama-3-8B-Instruct-NVFP4

5B • Updated Jan 28 • 3

nm-testing/Meta-Llama-3-8B-Instruct-MXFP4A16-GPTQ

5B • Updated Jan 28 • 2

nm-testing/Speculator-Qwen3-30B-MOE-VL-Eagle3

0.4B • Updated Jan 22 • 475

nm-testing/Qwen3-0.6B-FP8_BLOCK

0.6B • Updated Jan 20 • 157

nm-testing/Qwen3-0.6B-W4A16-G128

0.6B • Updated Jan 20 • 2

nm-testing/Llama-3.2-1B-Instruct-DEBUG-STRAWBERRY

1B • Updated Jan 14 • 2

nm-testing/Llama-3.2-1B-Instruct-DEBUG-COUNTER

1B • Updated Jan 14 • 2

nm-testing/TinyLlama-1.1B-compressed-tensors-kv-cache-scheme

Text Generation • 1B • Updated Jan 14 • 352

nm-testing/TinyLlama-1.1B-Chat-v1.0-kvcache-fp8-attn_head

1B • Updated Jan 14 • 321

nm-testing/TinyLlama-1.1B-Chat-v1.0-kvcache-fp8-tensor

1B • Updated Jan 14 • 8.94k

nm-testing/Meta-Llama-3-8B-Instruct-awq-NVFP4

5B • Updated Jan 7 • 6

nm-testing/testing-llama3.1.8b-2layer-eagle3

1B • Updated Jan 5 • 604

nm-testing/CDH-test-nvfp4-awq

5B • Updated Dec 19, 2025 • 3

nm-testing/granite-4.0-h-small-FP8-dynamic

Text Generation • 32B • Updated Dec 3, 2025 • 37

nm-testing/tinysmokeqwen3moe-W4A16-first-only-CTstable

2.93M • Updated Nov 25, 2025 • 18.2k

nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head

Updated Nov 25, 2025

nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor

Updated Nov 25, 2025

nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head

Updated Nov 25, 2025

nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

Updated Nov 25, 2025

nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

Updated Nov 25, 2025

nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head

Updated Nov 25, 2025

nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Tensor

Updated Nov 25, 2025

nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Head

Updated Nov 25, 2025

nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

Updated Nov 25, 2025

nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head

Updated Nov 25, 2025