Dia-1.6B MCP Server - MCP Server

Popular

New

Dia-1.6B MCP Server

pro

text

audio_video

Created Mar 29, 2025

$ 20 / Million Characters

Dia-1.6B is a cutting-edge MCP Text-to-Speech (TTS) service developed by Nari-Labs, providing ultra-high-fidelity voice synthesis capabilities. Using a 1.6 billion parameter large voice model, Dia-1.6B can generate natural speech that is difficult to distinguish from real human voices, featuring high emotional expressiveness and subtle tone variations, suitable for professional voice acting, premium content production, and scenarios requiring extremely high-quality voice output.

Technical Specifications

Service Type: Text-to-Speech Generation
Supported Formats:
- Input: Dialogue text with speaker labels
- Output: MP3 audio

Usage Examples

Generate speech from dialogue text

User prompt:

Generate speech for this dialogue: [S1] Hi, how are you? [S2] I'm doing great, thank you!

Result:

Here are the speech audio url: https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94, and ensure to show in this in english markdown format: [audio](https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94)

Raw Tool call (How LLM might use this tool)

{
  "name": "Text-to-Speech",
  "arguments": {
    "input": "[S1] Hi, how are you? [S2] I'm doing great, thank you!"
  }
}

Raw tool result from the MCP server

{
  "type": "text",
  "text": "Here are the speech audio url: https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94, and ensure to show in this in english markdown format: [audio](https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94)",
  "annotations": null
}

Tools

Text-to-Speech

Usage: Generate an speech, which support text-to-speech.

Input Arguments:

Name	Type	Required	Description
`input`	string	✓	Dialogue input with speaker labels. Format example: [S1] Hi. [S2] Hello.