
Popular
Flux-1.1-Pro-Ultra
Flux-1.1-Pro-Ultra is an advanced MCP image generation service that can quickly generate high-quality, creatively rich images based on text descriptions. Suitable for various scenarios such as design, marketing materials, and artistic creation, supporting multiple styles and detail control.
pro
image

Popular
SearXNG MCP Server
SearXNG is a powerful MCP metasearch engine service that can simultaneously query multiple search engines and integrate results, providing rich internet information for AI models. It supports privacy-protected search and can filter and sort search results according to needs, enabling AI to obtain real-time network information.
base
text
search

Popular
Remove-Bg
Remove background from an image.
pro
image

Flux-Kontext-Max
Flux-Kontext-Max, by black-forest-labs, is a premium text-based image editing model. It stands out with exceptional performance and enhanced typography generation. Using simple natural language prompts, users can easily perform complex edits on existing images, including changing styles, adjusting object details, swapping backgrounds, and modifying in-image text. It's a powerful tool for transforming text instructions into precise, high-quality visual results, ideal for in-depth image transformations.
pro
image

Google-Imagen-4
Imagen 4 is our best text-to-image model yet, with photorealistic images, near real-time speed, and sharper clarity — to bring your imagination to life.
pro
image

Speech-02-HD
Minimax's Speech-02-HD is a high-quality Text-to-Audio (T2A) model optimized for high-fidelity applications like voiceovers and audiobooks. It offers rich voice synthesis features, including emotional expression control and multilingual support. Users can choose from various built-in voices or use their own cloned voices, and can finely adjust speed, volume, pitch, and emotion to create detailed and natural-sounding speech. This makes it an ideal choice for professional audio content creation.
pro
audio_video

Dia-1.6B MCP Server
Dia-1.6B is a cutting-edge MCP Text-to-Speech (TTS) service developed by Nari-Labs, providing ultra-high-fidelity voice synthesis capabilities. Using a 1.6 billion parameter large voice model, Dia-1.6B can generate natural speech that is difficult to distinguish from real human voices, featuring high emotional expressiveness and subtle tone variations, suitable for professional voice acting, premium content production, and scenarios requiring extremely high-quality voice output.
pro
text
audio_video

StabilityAI: SD3.5
SD3.5 is the latest generation MCP image generation service launched by StabilityAI, based on leading Stable Diffusion technology. Compared to previous generations, SD3.5 has significant improvements in image quality, detail representation, and creative generation, capable of producing ultra-high-quality, diverse style images, suitable for various professional scenarios such as artistic creation, design, and content production.
pro
image

Flux-Kontext-Pro
Flux-Kontext-Pro, developed by black-forest-labs, is a cutting-edge text-based image editing model. It allows you to perform diverse, high-quality edits on existing images using precise text prompts. It excels at tasks such as style transfer, changing clothes or objects, swapping backgrounds, and even modifying text within images. The Pro version boasts superior prompt following and stable output, ensuring your edits meet your requirements and maintain image consistency. It is a powerful tool for professional image editing.
pro
image

Voice-Cloning
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo.
pro
audio_video

New
OpenAI Whipser
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition, translation, and language identification.
pro
audio_video

Speech-02-Turbo
Minimax's Speech-02-Turbo is an advanced Text-to-Audio (T2A) model offering high-quality voice synthesis. It is specifically designed for low-latency real-time applications, providing quick responses. It supports various built-in system voices and personal voices cloned via minimax/voice-cloning. You can freely adjust speed, volume, and pitch, and control or auto-detect emotional expression. With multilingual capabilities, it is an ideal choice for developing applications requiring real-time voice interaction.
pro
audio_video

Playwright
Cross-browser. Playwright supports all modern rendering engines including Chromium, WebKit, and Firefox. Cross-platform. Test on Windows, Linux, and macOS, locally or on CI, headless or headed. Cross-language. Use the Playwright API in TypeScript, JavaScript, Python, .NET, Java. Test Mobile Web. Native mobile emulation of Google Chrome for Android and Mobile Safari. The same rendering engine works on your Desktop and in the Cloud.
pro
text