multimodal
Multi-provider media generation — images, video, audio, and transcription via a unified interface
stdio v1.3.1
Connection Configuration
Add this to your AI agent's MCP config file to connect.
Run in your terminal:
claude mcp add io.github.rsmdt--multimodal -- npx
Server Details
Transport
stdio
Authentication
NoneVersion
v1.3.1
Server Name
io.github.rsmdt/multimodal
Last Updated
Mar 3, 2026
Get Started
How to install and connect this MCP server.
Required Configuration
Environment variables you must provide when running this server.
| Variable | Required |
|---|---|
OPENAI_API_KEY secret | Optional |
XAI_API_KEY secret | Optional |
GEMINI_API_KEY secret | Optional |
ELEVENLABS_API_KEY secret | Optional |
BFL_API_KEY secret | Optional |
MEDIA_OUTPUT_DIR | Optional |