Introduction

This repository hosts the Kokoro model for the React Native Executorch library. It can perform speech synthesis in 8 different languages, including fine-tuned Polish and German.

The models support input shape dynamism and cover the input range of 1 up to 128 tokens.

Additionally, the repository contains essential resources for G2P (grapheme-to-phoneme) preprocessing (see v0.9.0 branch) required by the Kokoro model, including simple word-by-word phonemization models (also in ExecuTorch format).

If you'd like to run these models in your own ExecuTorch runtime, refer to the official documentation for setup instructions.

Compatibility

These models were exported using v1.0.0 version of ExecuTorch and no forward compatibility is guaranteed. Older versions of the runtime may not work with these files.

The models are intended to be used within the React Native ExecuTorch package. If you want to use them outside the package, make sure your runtime is compatible with the ExecuTorch version used to export the .pte files and follow the example script to run the models.

Repository Structure

The repository contains 3 main directories:

phonemizer - data files required by the Phonemis package - responsible for input preprocessing part of React Native ExecuTorch Kokoro pipeline.
voices - a collection of pre-computed speaker embeddings used by the Kokoro model to synthesize speech with specific vocal characteristics.
xnnpack - exported, XNNPACK-optimized Kokoro runtime modules.

Downloads last month: 37,776

Model tree for software-mansion/react-native-executorch-kokoro

Base model

yl4579/StyleTTS2-LJSpeech

Finetuned

hexgrad/Kokoro-82M

Quantized

(50)

this model

Collection including software-mansion/react-native-executorch-kokoro

Text to Speech

Collection

1 item • Updated Apr 28 • 1