You're done when
A short vertical reel — your photo, moving, with a natural voiceover and captions — ready for Reels, Shorts or TikTok, in about half an hour and with no video timeline.
A reel that moves and talks normally takes three crafts: animating a still, recording a voice, and editing the two together. Each MCP connector only knows its own tool — Kling makes video, ElevenLabs makes voice — and neither knows the other is in the chain. The repos document one tool each; an app-store listing just hands you the install. Nobody covers the seam.
This recipe is the seam. You bring one photo (or generate one in Freepik), Kling gives it motion, ElevenLabs reads your line in a real-sounding voice, and you lay the two together into a vertical clip. The gotchas above — keep the motion small so faces don’t warp, ask ElevenLabs for a conversational delivery, mind two separate credit meters — are the parts no single tool’s docs mention, because they only show up when you chain the tools.
Checked 2026-06-14: Kling and ElevenLabs connectors both reachable. Kling is community (needs your Kling API keys); ElevenLabs is the official connector.
If something breaks
- I don't see Kling to turn on
- Fix: Kling is a community connector, not official: add the Kling MCP (npm package mcp-kling) and paste your Kling API keys. There's no one-click 'sign in' like an official connector.
- ElevenLabs reads it like a robot announcer
- Fix: Ask for a specific delivery — 'warm, natural, conversational, not announcer-style' — and pick a voice that fits. The default read is flat; the description is what changes it.
- The animated clip warps faces or hands
- Fix: Keep the motion small — slow zoom, gentle drift. Big camera moves are where Kling distorts; subtle motion stays believable.
- It ran out of credits
- Fix: Kling renders from your plan's credits and ElevenLabs from your character quota. Lower the resolution or length, or generate fewer takes while you dial in the prompt.
- I don't have a photo to start from
- Fix: Generate one first: Freepik (official — add it with your Freepik API key) can make a clean vertical image from a text description, then feed that into step 1.