multimodal

Multi-provider media generation — images, video, audio, and transcription via a unified interface

stdio v1.3.1

Connection Configuration

Add this to your AI agent's MCP config file to connect.

Run in your terminal:

claude mcp add io.github.rsmdt--multimodal -- npx

Server Details

Transport

stdio

Authentication

None

Version

v1.3.1

Server Name

io.github.rsmdt/multimodal

Last Updated

Mar 3, 2026

Get Started

How to install and connect this MCP server.

Required Configuration

Environment variables you must provide when running this server.

Variable Required
OPENAI_API_KEY secret Optional
XAI_API_KEY secret Optional
GEMINI_API_KEY secret Optional
ELEVENLABS_API_KEY secret Optional
BFL_API_KEY secret Optional
MEDIA_OUTPUT_DIR Optional