mcps
K E F D

Recipe

Turn a photo into a narrated reel

Animate one still in Kling, add a natural voiceover in ElevenLabs, and post a vertical reel — no video editor, no studio voice.

~30 minutes beginner 4 connectors for Anyone who wants a short video from a photo or a sentence — no editing software

What you'll need

Do this, in order

  1. 1

    Animate the photo in Kling

    Animate this photo into a 5-second vertical 9:16 clip — slow zoom-in, gentle drift, keep it realistic. [attach the photo]

    You'll get: Kling returns a short clip that turns your still into motion.

  2. 2

    Record the voiceover in ElevenLabs

    Read this line for a short reel in a warm, natural, conversational voice — not announcer-style: '[your one-liner]'.

    You'll get: ElevenLabs returns a voiceover audio file you can play back.

  3. 3

    Put them together

    Lay this voiceover over the Kling clip, add captions that match the words, and export a 1080x1920 MP4.

    You'll get: A finished vertical reel with captions, ready to post. (No Descript? Drop the clip and the audio into any free editor — both pieces are already done.)

You're done when

A short vertical reel — your photo, moving, with a natural voiceover and captions — ready for Reels, Shorts or TikTok, in about half an hour and with no video timeline.

A reel that moves and talks normally takes three crafts: animating a still, recording a voice, and editing the two together. Each MCP connector only knows its own tool — Kling makes video, ElevenLabs makes voice — and neither knows the other is in the chain. The repos document one tool each; an app-store listing just hands you the install. Nobody covers the seam.

This recipe is the seam. You bring one photo (or generate one in Freepik), Kling gives it motion, ElevenLabs reads your line in a real-sounding voice, and you lay the two together into a vertical clip. The gotchas above — keep the motion small so faces don’t warp, ask ElevenLabs for a conversational delivery, mind two separate credit meters — are the parts no single tool’s docs mention, because they only show up when you chain the tools.

Checked 2026-06-14: Kling and ElevenLabs connectors both reachable. Kling is community (needs your Kling API keys); ElevenLabs is the official connector.

If something breaks

I don't see Kling to turn on
Fix: Kling is a community connector, not official: add the Kling MCP (npm package mcp-kling) and paste your Kling API keys. There's no one-click 'sign in' like an official connector.
ElevenLabs reads it like a robot announcer
Fix: Ask for a specific delivery — 'warm, natural, conversational, not announcer-style' — and pick a voice that fits. The default read is flat; the description is what changes it.
The animated clip warps faces or hands
Fix: Keep the motion small — slow zoom, gentle drift. Big camera moves are where Kling distorts; subtle motion stays believable.
It ran out of credits
Fix: Kling renders from your plan's credits and ElevenLabs from your character quota. Lower the resolution or length, or generate fewer takes while you dial in the prompt.
I don't have a photo to start from
Fix: Generate one first: Freepik (official — add it with your Freepik API key) can make a clean vertical image from a text description, then feed that into step 1.

People ask their AI

“turn a photo into a reel with a voiceover”“make a talking reel from one picture”“animate my product photo and narrate it”

Related