Introduction

This repository hosts the Kokoro model for the React Native Executorch library. It includes the model divided into 2 main parts: duration predictor and synthesizer, ready for use in the ExecuTorch runtime.

The models support input shape dynamism and cover the input range of 1 up to 128 tokens.

Additionally, the repository contains essential resources for G2P (grapheme-to-phoneme) preprocessing required by the Kokoro model, including simple word-by-word phonemization models (also in ExecuTorch format).

If you'd like to run these models in your own ExecuTorch runtime, refer to the official documentation for setup instructions.

Compatibility

These models were exported using v1.0.0 version of ExecuTorch and no forward compatibility is guaranteed. Older versions of the runtime may not work with these files.

The models are intended to be used within the React Native ExecuTorch package. If you want to use them outside the package, make sure your runtime is compatible with the ExecuTorch version used to export the .pte files and follow the example script to run the models.

Repository Structure

The repository contains 3 main directories:

  • phonemizer - data files required by the Phonemis package - responsible for input preprocessing part of React Native ExecuTorch Kokoro pipeline.
  • voices - a collection of pre-computed speaker embeddings used by the Kokoro model to synthesize speech with specific vocal characteristics.
  • xnnpack - exported, XNNPACK-optimized Kokoro runtime modules.
Downloads last month
28,020
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for software-mansion/react-native-executorch-kokoro

Quantized
(33)
this model

Collection including software-mansion/react-native-executorch-kokoro