CT-Punc
Punctuation Restoration — automatically add punctuation to ASR output text.
CT-Punc (Controllable Time-delay Punctuation) restores punctuation marks for unpunctuated text, commonly used as a post-processing step after speech recognition.
Quick Start
from funasr import AutoModel
# Standalone punctuation restoration
model = AutoModel(model="funasr/ct-punc", hub="hf", device="cuda")
result = model.generate(input="我们今天讨论三个议题首先是产品发布其次是市场策略最后是团队建设")
print(result[0]["text"])
# → 我们今天讨论三个议题,首先是产品发布,其次是市场策略,最后是团队建设。
Use as Part of ASR Pipeline
from funasr import AutoModel
model = AutoModel(
model="funasr/paraformer-zh",
hub="hf",
vad_model="funasr/fsmn-vad",
punc_model="funasr/ct-punc",
device="cuda",
)
result = model.generate(input="audio.wav")
# Output text includes punctuation automatically
Features
- Chinese and English punctuation restoration
- Low latency, suitable for streaming pipelines
- Integrates seamlessly with FunASR ASR models
Links
- GitHub: FunASR
- Docs: modelscope.github.io/FunASR
- Downloads last month
- 875