mcp-image
Popular
New
Dia-1.6B MCP Server
pro
text
audio_video
Created Mar 29, 2025
$ 20 / Million Characters
Dia-1.6B is a cutting-edge MCP Text-to-Speech (TTS) service developed by Nari-Labs, providing ultra-high-fidelity voice synthesis capabilities. Using a 1.6 billion parameter large voice model, Dia-1.6B can generate natural speech that is difficult to distinguish from real human voices, featuring high emotional expressiveness and subtle tone variations, suitable for professional voice acting, premium content production, and scenarios requiring extremely high-quality voice output.

Technical Specifications

  • Service Type: Text-to-Speech Generation
  • Supported Formats:
    • Input: Dialogue text with speaker labels
    • Output: MP3 audio

Usage Examples

Generate speech from dialogue text

User prompt:

Generate speech for this dialogue: [S1] Hi, how are you? [S2] I'm doing great, thank you!

Result:

Here are the speech audio url: https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94, and ensure to show in this in english markdown format: [audio](https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94)

Raw Tool call (How LLM might use this tool)

{
  "name": "Text-to-Speech",
  "arguments": {
    "input": "[S1] Hi, how are you? [S2] I'm doing great, thank you!"
  }
}

Raw tool result from the MCP server

{
  "type": "text",
  "text": "Here are the speech audio url: https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94, and ensure to show in this in english markdown format: [audio](https://storage.oaphub.ai/19/192881631470551040/13853096369321311679b73?k=48ab0f94)",
  "annotations": null
}

Tools

Text-to-Speech

Usage: Generate an speech, which support text-to-speech.

Input Arguments:

Name Type Required Description
input string Dialogue input with speaker labels. Format example: [S1] Hi. [S2] Hello.
© 2025 Open Agent Platform. All Rights Reserved.