piper-plus-demo / README.md
ayousanz's picture
Update from GitHub Actions - 2025-08-22 02:29:06
7f37b04 verified

A newer version of the Gradio SDK is available: 5.49.1

Upgrade
metadata
title: Piper TTS Demo
emoji: πŸŽ™οΈ
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 3.50.2
app_file: app.py
pinned: false
license: mit

Piper TTS Demo

A web-based demo for Piper TTS, featuring high-quality text-to-speech synthesis for Japanese and English.

Features

  • πŸ‡―πŸ‡΅ Japanese TTS: High-quality Japanese speech synthesis using OpenJTalk phonemization
  • πŸ‡ΊπŸ‡Έ English TTS: Natural English speech synthesis
  • πŸš€ Fast Inference: ONNX Runtime for efficient CPU-based inference
  • πŸŽ›οΈ Adjustable Parameters: Control speech speed, expressiveness, and phoneme duration
  • 🌐 Web Interface: Easy-to-use Gradio interface

Models

This demo includes:

  • Japanese (Medium): CSS10-based Japanese model with natural intonation
  • English (Test): English model for demonstration

Usage

  1. Select a model from the dropdown
  2. Enter your text in the input field
  3. Adjust advanced settings if needed
  4. Click "Generate Speech" to synthesize

Technical Details

  • Framework: ONNX Runtime (CPU inference)
  • Phonemization:
    • Japanese: pyopenjtalk
    • English: Character-based fallback
  • Audio: 16-bit PCM WAV output

Local Development

# Clone the repository
git clone https://github.com/ayutaz/piper-plus.git
cd piper-plus/huggingface-space

# Install requirements
pip install -r requirements.txt

# Run the app
python app.py

Credits

License

This project is licensed under the MIT License. See the original Piper repository for more details.


Last updated: 2025-01-21 - Using Gradio 4.44.1 for stable deployment