steven hobbs
shobbs
·
AI & ML interests
Vision
HRL
Edge
Recent Activity
updated a collection 6 days ago
video updated a collection 6 days ago
embed RAG updated a collection 20 days ago
NSFWOrganizations
papers
embed RAG
small and fast
bio
video llm llava
-
NVILA: Efficient Frontier Visual Language Models
Paper • 2412.04468 • Published • 61 -
unsloth/GLM-4.1V-9B-Thinking-GGUF
Image-Text-to-Text • 9B • Updated • 2.31k • 41 -
zai-org/GLM-4.5V
Image-Text-to-Text • 108B • Updated • 177k • • 717 -
rednote-hilab/dots.vlm1.inst
Image-Text-to-Text • Updated • 37 • 81
arm
Mobile use aka smart phone actions dataset
storytime
think and learn
-
deepseek-ai/DeepSeek-R1-0528
Text Generation • 685B • Updated • 3.27M • • 2.45k -
unsloth/ERNIE-4.5-300B-A47B-PT-GGUF
Text Generation • 299B • Updated • 915 • 9 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 561k • 147 -
cerebras/GLM-4.6-REAP-218B-A32B-FP8
Text Generation • Updated • 10 • 44
NSFW
vision
-
google/paligemma2-28b-pt-896
Image-Text-to-Text • Updated • 83 • 51 -
lmstudio-community/olmOCR-7B-0225-preview-GGUF
Image-Text-to-Text • 8B • Updated • 228 • 12 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 92.2k • 99 -
vidore/colpali-v1.3
Visual Document Retrieval • Updated • 51.7k • 97
image art
video
Mobile use aka smart phone actions dataset
papers
storytime
embed RAG
think and learn
-
deepseek-ai/DeepSeek-R1-0528
Text Generation • 685B • Updated • 3.27M • • 2.45k -
unsloth/ERNIE-4.5-300B-A47B-PT-GGUF
Text Generation • 299B • Updated • 915 • 9 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 561k • 147 -
cerebras/GLM-4.6-REAP-218B-A32B-FP8
Text Generation • Updated • 10 • 44
small and fast
NSFW
bio
vision
-
google/paligemma2-28b-pt-896
Image-Text-to-Text • Updated • 83 • 51 -
lmstudio-community/olmOCR-7B-0225-preview-GGUF
Image-Text-to-Text • 8B • Updated • 228 • 12 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 92.2k • 99 -
vidore/colpali-v1.3
Visual Document Retrieval • Updated • 51.7k • 97
video llm llava
-
NVILA: Efficient Frontier Visual Language Models
Paper • 2412.04468 • Published • 61 -
unsloth/GLM-4.1V-9B-Thinking-GGUF
Image-Text-to-Text • 9B • Updated • 2.31k • 41 -
zai-org/GLM-4.5V
Image-Text-to-Text • 108B • Updated • 177k • • 717 -
rednote-hilab/dots.vlm1.inst
Image-Text-to-Text • Updated • 37 • 81
image art
arm