Ai-Model
-
Image-Text-to-Text β’ 25B β’ Updated β’ 61.8k β’ 637 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition β’ 0.8B β’ Updated β’ 6.63M β’ β’ 2.95k -
SWivid/F5-TTS
Text-to-Speech β’ Updated β’ 671k β’ 1.16k -
D-Edit
π84 -
FacePoke
π2.21kImport a portrait, click to move the head!
-
Expression Editor
π¨1.64kQuickly edit the expression of a face
-
F5-TTS
π£2.85kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
FLUX.1 [dev]
π₯9.43kGenerate images from text prompts with FLUX.1 diffusion model
-
Face Recognition SDK
π’234Face Recognition
-
Open NotebookLM
π1.09kPersonalised Podcasts For All - Available in 13 Languages
-
PMRF
πΌ313A gradio demo for Posterior-Mean Rectified Flow (PMRF)
-
stabilityai/stable-diffusion-3.5-large
Text-to-Image β’ Updated β’ 54.1k β’ β’ 3.43k -
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 5.32k β’ β’ 1.32k -
Freepik/flux.1-lite-8B-alpha
Text-to-Image β’ Updated β’ 243 β’ 427 -
rhymes-ai/Allegro
Text-to-Video β’ Updated β’ 196 β’ 264 -
CohereLabs/aya-expanse-8b
Text Generation β’ 8B β’ Updated β’ 18.7k β’ 424 -
deepseek-ai/Janus-1.3B
Any-to-Any β’ 2B β’ Updated β’ 3.76k β’ 595 -
Pangea
π50A Fully Open Multilingual Multimodal LLM for 39 Languages
-
Etched/oasis-500m
Updated β’ 61 β’ 490 -
microsoft/OmniParser
Image-Text-to-Text β’ Updated β’ 252 β’ 1.71k -
OuteAI/OuteTTS-0.1-350M
Text-to-Speech β’ Updated β’ 250 β’ 302 -
tencent/Tencent-Hunyuan-Large
Text Generation β’ Updated β’ 300 β’ 616 -
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation β’ 71B β’ Updated β’ 8.76k β’ 2.06k -
tencent/HunyuanVideo
Text-to-Video β’ Updated β’ 1.21k β’ β’ 2.16k -
zai-org/CogVideoX-5b
Text-to-Video β’ Updated β’ 35.1k β’ β’ 672 -
LanguageBind/Open-Sora-Plan-v1.2.0
Updated β’ 1 β’ 47 -
microsoft/phi-4
Text Generation β’ Updated β’ 512k β’ 2.24k -
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
-
Search Your Face Online
π836Track your online presence with reverse face search
-
Kolors Virtual Try-On
π10kGenerate a virtual tryβon image of a person wearing a garment
-
DeepSeek-R1 WebGPU
π§556Next-generation reasoning model that runs locally in-browser
-
AnyCoder
π3.21kGenerate code snippets with AI
-
tencent/Hunyuan3D-2
Image-to-3D β’ Updated β’ 79.8k β’ 1.73k -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 129k β’ 1.29k -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation β’ Updated β’ 164k β’ β’ 769 -
Magic Face
π€ͺ244Transform Your Face Into Legendary Characters!
-
Llasa 3b Tts
π₯314Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
mistralai/Mistral-Small-24B-Instruct-2501
Updated β’ 124k β’ 950 -
Pyramid Flow
β±669Generate videos from text prompts and optional images
-
microsoft/OmniParser-v2.0
Updated β’ 858 β’ 1.32k -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech β’ Updated β’ 1.63k β’ 1.1k -
agentica-org/DeepScaleR-1.5B-Preview
Text Generation β’ 2B β’ Updated β’ 13.1k β’ 577 -
stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text β’ 132B β’ Updated β’ 165 β’ 458 -
hexgrad/Kokoro-82M
Text-to-Speech β’ Updated β’ 9.68M β’ β’ 6.03k -
black-forest-labs/FLUX.1-dev
Text-to-Image β’ Updated β’ 697k β’ β’ 12.7k -
NousResearch/DeepHermes-3-Llama-3-8B-Preview
Text Generation β’ 8B β’ Updated β’ 306 β’ β’ 352