
Popular
New
Dia-1.6B MCP Server
pro
text
audio_video
Created Mar 29, 2025
$ 20 / Million Characters
Dia-1.6B is a cutting-edge MCP Text-to-Speech (TTS) service developed by Nari-Labs, providing ultra-high-fidelity voice synthesis capabilities. Using a 1.6 billion parameter large voice model, Dia-1.6B can generate natural speech that is difficult to distinguish from real human voices, featuring high emotional expressiveness and subtle tone variations, suitable for professional voice acting, premium content production, and scenarios requiring extremely high-quality voice output.
Technical Specifications
- Service Type: Text-to-Speech Generation
- Supported Formats:
- Input: Dialogue text with speaker labels
- Output: MP3 audio
Usage Examples
Generate speech from dialogue text
User prompt:
Generate speech for this dialogue: [S1] Hi, how are you? [S2] I'm doing great, thank you!
Result:
Here are the speech audio url: https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94, and ensure to show in this in english markdown format: [audio](https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94)
Raw Tool call (How LLM might use this tool)
{
"name": "Text-to-Speech",
"arguments": {
"input": "[S1] Hi, how are you? [S2] I'm doing great, thank you!"
}
}
Raw tool result from the MCP server
{
"type": "text",
"text": "Here are the speech audio url: https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94, and ensure to show in this in english markdown format: [audio](https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94)",
"annotations": null
}
Tools
Text-to-Speech
Usage: Generate an speech, which support text-to-speech.
Input Arguments:
Name | Type | Required | Description |
---|---|---|---|
input |
string | ✓ | Dialogue input with speaker labels. Format example: [S1] Hi. [S2] Hello. |
© 2025 Open Agent Platform. All Rights Reserved.