Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
rovo 's Collections
3D Mesh
Audio
interesting
controlnet
Diffusion
Text Generation
Dataset
codellm
Diffusion LORAs
Clip Vision
Papers
Flux

Audio

updated 25 days ago
Upvote
-

  • fishaudio/fish-speech-1.5

    Text-to-Speech • Updated Mar 25, 2025 • 6.31k • 745

  • suno/bark

    Text-to-Speech • Updated Oct 4, 2023 • 18.1k • 1.53k

  • SWivid/F5-TTS

    Text-to-Speech • Updated Mar 21, 2025 • 587k • 1.17k

  • NexaAI/OmniAudio-2.6B

    Audio-Text-to-Text • 0.6B • Updated Dec 13, 2024 • 1.51k • 289

  • Running
    20

    3DAudio-Spectrum-Analyzer - One-minute creation by AI Coding Autonomous Agent

    📉
    20

    https://huggingface.co/spaces/VIDraft/mouse-webgen


  • sesame/csm-1b

    Text-to-Speech • 2B • Updated Dec 1, 2025 • 244k • 2.38k

  • argmaxinc/whisperkit-coreml

    Automatic Speech Recognition • Updated Apr 24 • 10.1M • 182
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs