File size: 1,828 Bytes
bb82738
62e3411
 
 
 
bb82738
7f37b04
4821af3
bb82738
62e3411
bb82738
 
62e3411
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3275e74
 
 
1b3d1f6
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
title: Piper TTS Demo
emoji: πŸŽ™οΈ
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 3.50.2
app_file: app.py
pinned: false
license: mit
---

# Piper TTS Demo

A web-based demo for [Piper TTS](https://github.com/ayutaz/piper-plus), featuring high-quality text-to-speech synthesis for Japanese and English.

## Features

- πŸ‡―πŸ‡΅ **Japanese TTS**: High-quality Japanese speech synthesis using OpenJTalk phonemization
- πŸ‡ΊπŸ‡Έ **English TTS**: Natural English speech synthesis
- πŸš€ **Fast Inference**: ONNX Runtime for efficient CPU-based inference
- πŸŽ›οΈ **Adjustable Parameters**: Control speech speed, expressiveness, and phoneme duration
- 🌐 **Web Interface**: Easy-to-use Gradio interface

## Models

This demo includes:
- **Japanese (Medium)**: CSS10-based Japanese model with natural intonation
- **English (Test)**: English model for demonstration

## Usage

1. Select a model from the dropdown
2. Enter your text in the input field
3. Adjust advanced settings if needed
4. Click "Generate Speech" to synthesize

## Technical Details

- **Framework**: ONNX Runtime (CPU inference)
- **Phonemization**: 
  - Japanese: pyopenjtalk
  - English: Character-based fallback
- **Audio**: 16-bit PCM WAV output

## Local Development

```bash
# Clone the repository
git clone https://github.com/ayutaz/piper-plus.git
cd piper-plus/huggingface-space

# Install requirements
pip install -r requirements.txt

# Run the app
python app.py
```

## Credits

- Piper TTS by [Rhasspy](https://github.com/rhasspy/piper)
- Japanese enhancements by [ayutaz](https://github.com/ayutaz/piper-plus)

## License

This project is licensed under the MIT License. See the original [Piper repository](https://github.com/rhasspy/piper) for more details.

---
_Last updated: 2025-01-21 - Using Gradio 4.44.1 for stable deployment_