feat: update link in tech report
Browse files- index.html +13 -9
index.html
CHANGED
|
@@ -10,7 +10,7 @@
|
|
| 10 |
<meta name="keywords" content="latex.css,css library,class-less css,latex css" />
|
| 11 |
<meta property="og:title"
|
| 12 |
content="MiniMax-Speech Tech Report | Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder" />
|
| 13 |
-
<meta property="og:url" content="https://
|
| 14 |
<meta property="og:description"
|
| 15 |
content=" MiniMax-Speech, an autoregressive Transformer-based Text-to-Speech (TTS) model that generates high-quality speech" />
|
| 16 |
<meta property="og:type" content="website" />
|
|
@@ -28,9 +28,11 @@
|
|
| 28 |
Encoder</h4>
|
| 29 |
<p class="author">
|
| 30 |
MiniMax Team <span class="date">May 2025</span><br />
|
| 31 |
-
<a style="font-size: 1.1rem;" target="_blank"
|
| 32 |
-
href="https://huggingface.co/spaces/MiniMaxAI/MiniMax-Speech-Tech-Report/blob/main/MiniMax_Speech.pdf">[Tech
|
| 33 |
Report]</a>
|
|
|
|
|
|
|
|
|
|
| 34 |
</p>
|
| 35 |
</header>
|
| 36 |
|
|
@@ -57,13 +59,16 @@
|
|
| 57 |
control
|
| 58 |
via LoRA; text to voice (T2V) by synthesizing timbre features directly from text description; and professional
|
| 59 |
voice
|
| 60 |
-
cloning (PVC) by fine-tuning timbre features with additional data.
|
| 61 |
-
<a href="https://www.minimax.io/audio">MiniMax Audio</a> and
|
| 62 |
-
explore our powerful TTS features.
|
| 63 |
</p>
|
| 64 |
</div>
|
| 65 |
|
| 66 |
<nav role="navigation" class="toc">
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 67 |
<h2>Contents</h2>
|
| 68 |
<ol>
|
| 69 |
<li>
|
|
@@ -232,9 +237,8 @@
|
|
| 232 |
features based
|
| 233 |
on the text content, whereas OneShot adheres more strictly to the speaker characteristics (prosody, speech
|
| 234 |
rate,
|
| 235 |
-
emotions, etc.)
|
| 236 |
-
|
| 237 |
-
technical report for details).
|
| 238 |
</p>
|
| 239 |
<div class="scroll-wrapper" style="margin-top: 2rem;">
|
| 240 |
<table style="width: 100%;">
|
|
|
|
| 10 |
<meta name="keywords" content="latex.css,css library,class-less css,latex css" />
|
| 11 |
<meta property="og:title"
|
| 12 |
content="MiniMax-Speech Tech Report | Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder" />
|
| 13 |
+
<meta property="og:url" content="https://minimax-ai.github.io/tts_tech_report" />
|
| 14 |
<meta property="og:description"
|
| 15 |
content=" MiniMax-Speech, an autoregressive Transformer-based Text-to-Speech (TTS) model that generates high-quality speech" />
|
| 16 |
<meta property="og:type" content="website" />
|
|
|
|
| 28 |
Encoder</h4>
|
| 29 |
<p class="author">
|
| 30 |
MiniMax Team <span class="date">May 2025</span><br />
|
| 31 |
+
<a style="font-size: 1.1rem;" target="_blank" href="https://arxiv.org/abs/2505.07916">[Tech
|
|
|
|
| 32 |
Report]</a>
|
| 33 |
+
<a style="font-size: 1.1rem; margin-left: 1rem;" target="_blank"
|
| 34 |
+
href="https://huggingface.co/datasets/MiniMaxAI/TTS-Multilingual-Test-Set">[Multilingual Test Set]</a>
|
| 35 |
+
<a style="font-size: 1.1rem; margin-left: 1rem;" target="_blank" href="https://github.com/MiniMax-AI">[GitHub]</a>
|
| 36 |
</p>
|
| 37 |
</header>
|
| 38 |
|
|
|
|
| 59 |
control
|
| 60 |
via LoRA; text to voice (T2V) by synthesizing timbre features directly from text description; and professional
|
| 61 |
voice
|
| 62 |
+
cloning (PVC) by fine-tuning timbre features with additional data.
|
|
|
|
|
|
|
| 63 |
</p>
|
| 64 |
</div>
|
| 65 |
|
| 66 |
<nav role="navigation" class="toc">
|
| 67 |
+
<h2>Explore MiniMax-Speech</h2>
|
| 68 |
+
<p>Welcome to visit
|
| 69 |
+
<a href="https://www.minimax.io/audio">MiniMax Audio</a> and
|
| 70 |
+
explore our powerful TTS features.
|
| 71 |
+
</p>
|
| 72 |
<h2>Contents</h2>
|
| 73 |
<ol>
|
| 74 |
<li>
|
|
|
|
| 237 |
features based
|
| 238 |
on the text content, whereas OneShot adheres more strictly to the speaker characteristics (prosody, speech
|
| 239 |
rate,
|
| 240 |
+
emotions, etc.). For details of Zero-Shot and One-Shot, refer to the <a
|
| 241 |
+
href="https://arxiv.org/abs/2505.07916" target="_blank">technical report</a>.
|
|
|
|
| 242 |
</p>
|
| 243 |
<div class="scroll-wrapper" style="margin-top: 2rem;">
|
| 244 |
<table style="width: 100%;">
|