How to make an Instagram carousel video in 2026
A practical how-to covering both Instagram carousel formats: the swipeable static image post and the multi-clip video carousel, with a full workflow from idea to publish.
Instagram calls everything a "carousel post." That label covers two completely different things, and most tutorials conflate them. Before you build anything, get clear on which format you're actually making.
Format A: static image carousel — a swipeable deck of still images (2–10 slides). No motion, no audio. The format that blew up for educational content, quote decks, and listicles. Your audience swipes through it like a PDF.
Format B: video carousel — a multi-clip video post where each clip plays in sequence. Users still swipe, but they're watching video segments rather than static images. Less common, higher production friction, used mainly for product demos and before/after sequences.
This guide covers both. The workflow diverges at the design step, so read through once and then follow the branch that matches your format.
Step 1: Choose your idea and format together
The format should follow the content type, not the other way around.
Static image carousels perform well for:
- Lists ("7 reasons why…")
- Step-by-step tutorials with discrete stages
- Quote + context pairings
- Before/after comparisons where a single image captures the state
Video carousels perform well for:
- Product walkthroughs where motion shows the interaction
- Multiple short clips that each stand alone but build a sequence
- Side-by-side comparison where the motion is the point
If you're genuinely unsure: default to static. Static image carousels are faster to produce, cheaper to iterate on, and Instagram's algorithm treats them similarly to video in terms of saves and replays.
Step 2: Write a structured script
A carousel is not a social post with images slapped on. It's a scripted sequence with a job.
The six-part structure that works reliably:
- Hook slide — the single most important slide. It shows in the feed thumbnail. One bold claim or question. No body copy, no lists.
- Setup — one or two slides that frame the problem or context.
- Payload — three to six slides where the value lives. Each slide should carry one idea, not three.
- Pattern interrupt — optional. A stat, a counterintuitive point, something that makes the reader pause.
- Payoff — the conclusion or lesson, stated plainly.
- CTA slide — follow, save, DM, link in bio. One action. Not five.
For a static carousel, write headline + one or two supporting lines per slide. For a video carousel, write a spoken line or two per clip — this becomes the script you'll read or voice over.
Aim for six to ten slides total. Under six and the carousel feels incomplete; over ten and completion rates fall off.
Step 3: Design the slides
Aspect ratios
Instagram supports two main aspect ratios for carousels:
- 1:1 (square, 1080×1080 px) — the default. Shows a square crop in the feed grid. Works on every surface.
- 4:5 (portrait, 1080×1350 px) — takes up more vertical real estate in the feed, which means more attention per scroll. Slightly cropped in the grid view, but the additional height in feed is worth it for most content.
4:5 is the better default choice for carousels in 2026 because the feed is primarily vertical-scroll. Use 1:1 only if your visuals genuinely need square framing (infographic grids, product photography).
Instagram does not support 9:16 for carousel posts — that's Reels territory. If your design is full-portrait (9:16), consider posting as a Reel instead.
What makes a slide actually readable
- One idea per slide. If you are writing more than two lines of body copy, split it.
- High-contrast text. Dark background with light text, or a strong overlay if you're using a photo background.
- Consistent visual language across slides. Same fonts, same color palette, same layout grid. Consistency is what makes a carousel feel designed versus assembled.
- The "swipe right →" cue on every slide except the last. It's a small thing and it meaningfully increases completion.
For video carousel clips
Each clip in an Instagram video carousel should be between 3 and 15 seconds. Shorter than 3 seconds and it feels like an error; longer than 15 and you have enough content for a Reel. Shoot or render at 4:5 if possible, 1:1 as a fallback.
If you're adding voiceover: keep the narration tight and sync it to what's on screen. Instagram auto-mutes video in the feed, so either your visuals carry the story without sound or you add text captions.
Step 4: Generate or capture your visuals
For static image carousels, you have three routes:
- Design by hand in Canva, Figma, or Adobe Express. Full control, but each slide takes 5–15 minutes.
- Template tool like ContentDrips or Postwise. Faster, but you're locked to their template library and there's no video output — you'll export PNGs and upload manually.
- AI generation from a prompt. This is the fastest path if the tool produces structured layouts rather than freeform images. A tool that understands carousel primitives (title slide, checklist, big-stat, CTA) will output something usable; a generic image generator will not.
For video carousel clips, you either shoot footage, screen-record a workflow, or generate short video segments. If you're adding voiceover or AI narration, record or generate audio separately, then sync in a video editor.
Step 5: Add voiceover (video carousels only)
Static image carousels have no audio track — skip this step if that's your format.
For video carousels, voiceover is optional but it increases watch time. Options:
- Record your own voice — highest quality, most personal, requires editing.
- AI text-to-speech — faster to iterate, no recording setup. Quality varies a lot by provider. Professional-grade TTS (ElevenLabs, Google Chirp) produces broadcast-quality results; generic online converters do not.
- No voiceover, text captions only — works if your visuals are strong enough. Requires good type design on the clips.
If you're building a faceless brand or posting at daily volume, AI voiceover is the practical choice. The generation is instant and you can re-run a slide without re-recording the whole track.
Step 6: Render and export
For static image carousels: export each slide as a PNG or JPEG at the target aspect ratio. Instagram requires at minimum 1080 px on the short side. Export at 2x if your design tool supports it.
Keep all slides at the same aspect ratio. Instagram rejects mixed-aspect carousels.
For video carousel clips: export each clip as an MP4 with H.264 encoding. Maximum file size per clip is 250 MB; minimum resolution is 600×600 px (go higher). The audio should be AAC at 128 kbps or higher.
You will upload each clip individually in the Instagram app — there's no bulk upload API for video carousels from third-party tools today. If you want programmatic publishing, you need the static image format (the Graph API supports CAROUSEL_ALBUM for image carousels directly to feed).
Step 7: Write your caption and hashtags
The caption does two jobs: get the right person to swipe, and serve as the context if someone shares the post without the slides.
Structure that works:
- First line (the "more" hook) — the sentence visible before the fold. Make it the same hook as your first slide or a question that earns the swipe.
- Explain the payoff in two to three lines.
- Call to action — one. Save this. DM "template" for the link. Follow for more. Not all three.
- Hashtags — five to ten targeted hashtags is the range most creators report returning real discovery. Broad hashtags (#motivation, #marketing) return almost nothing at scale; specific hashtags relevant to your niche do.
Caption length: Instagram allows up to 2,200 characters. For carousels you rarely need more than 150–200 words. The content is in the slides; the caption supports it.
Step 8: Post
For static image carousels, you have two publishing routes:
Manual (Instagram app): tap the + icon, select Post, tap the multi-select icon to choose up to 10 images, arrange order, choose filters, write caption, post. This works but doesn't scale if you're posting daily.
Programmatic (Graph API): for teams or tools with Meta API access, the CAROUSEL_ALBUM flow creates child media containers first, then a parent container, then calls /media_publish. This is how third-party tools like Slidereel publish directly to your feed without the manual upload step.
For video carousels, programmatic publishing via the API is not available for multi-clip video carousels in the same way — you'll upload clips directly in the app.
Tools comparison
| Tool | Best for | Limitations | Pricing |
|---|---|---|---|
| Canva | Design-heavy static carousels | Slow per-slide; no video generation; manual upload only | Free tier exists; Pro ~$15/mo (canva.com/pricing) |
| ContentDrips | Quick static-only carousel generation | No video output at all; limited template library; no direct IG publish | $15–$26/mo (contentdrips.com/pricing) |
| CapCut | Manual video carousel editing | Fully manual; no structured AI generation; best for one-off clips you already have | Free + CapCut Pro ~$10/mo (capcut.com/pro-pricing) |
| Slidereel | Generating both formats from one prompt | Disclosure: this is our product. Video carousels (voiced, slide-based MP4s rendered in ~25 seconds) plus matching static image carousels — from the same input, published directly to Instagram (static carousel) and TikTok/Facebook/YouTube Shorts (video). 100 credits free, no card required. | Starter $19/mo (/pricing) |
The honest comparison: if you already have your visuals and you want to lay them out beautifully, Canva is still the best design tool. If you produce carousels at volume — daily posting across multiple platforms — a generation tool that handles scripting, design, and publishing in one step takes about five minutes per carousel versus 30–45 minutes doing each step manually.
ContentDrips is the closest competitor to Slidereel for static carousels but produces only images, not video. If your strategy requires both formats (static for Instagram, video for TikTok and YouTube Shorts), you'd need two separate tools — or one topic typed into Slidereel.
What makes a carousel worth saving
Instagram's save rate is the signal that matters most for carousel distribution. Saves indicate the viewer found the content reference-worthy — not just entertaining — and that signal carries more weight in distribution than likes.
Content that gets saved reliably:
- Reference carousels — "the definitive list of X" that someone wants to come back to.
- Checklist carousels — actionable steps that feel incomplete without saving for later.
- Stat carousels — hard numbers that someone will cite in a conversation or a post.
- Template carousels — fill-in-the-blank frameworks the viewer can use themselves.
The common thread: the viewer needs to return to it. If your carousel is purely entertaining or inspirational, it gets watched and forgotten. If it's useful as a reference, it gets saved.
See it in action
Slidereel generates a static image carousel (published directly to Instagram feed) and a voiced video carousel (published to TikTok, Facebook, and YouTube Shorts) from a single prompt — start to all four platforms in under a minute. A 6-slide static carousel costs 9 credits; a full 8-slide voiced video carousel costs ~63 credits.
Start free → 100 credits, no card
Related reading
- Best AI tools for Instagram Reels — if you want video-first content for Instagram rather than carousels.
- Best AI tools for LinkedIn carousels — LinkedIn carousels use a different aspect ratio (4:5) and different audience expectations; the format overlap is real but the strategy differs.
- Slidereel vs ContentDrips — detailed head-to-head if you're evaluating static-carousel-only tools versus a video-plus-carousel workflow.
Type a topic, get a voiced multi-slide video in ~30 seconds. 100 free credits, no card.
Try it on a topic →