CT-Punc

Punctuation Restoration — automatically add punctuation to ASR output text.

CT-Punc (Controllable Time-delay Punctuation) restores punctuation marks for unpunctuated text, commonly used as a post-processing step after speech recognition.

Quick Start

from funasr import AutoModel

# Standalone punctuation restoration
model = AutoModel(model="funasr/ct-punc", hub="hf", device="cuda")
result = model.generate(input="我们今天讨论三个议题首先是产品发布其次是市场策略最后是团队建设")
print(result[0]["text"])
# → 我们今天讨论三个议题,首先是产品发布,其次是市场策略,最后是团队建设。

Use as Part of ASR Pipeline

from funasr import AutoModel

model = AutoModel(
    model="funasr/paraformer-zh",
    hub="hf",
    vad_model="funasr/fsmn-vad",
    punc_model="funasr/ct-punc",
    device="cuda",
)
result = model.generate(input="audio.wav")
# Output text includes punctuation automatically

Features

  • Chinese and English punctuation restoration
  • Low latency, suitable for streaming pipelines
  • Integrates seamlessly with FunASR ASR models

Links

Downloads last month
875
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Spaces using funasr/ct-punc 3