SRT to Video: turn subtitle files into a captioned video

Paste the contents of your .srt subtitle file. ngram reads each timestamped caption line, plans a scene and narration to match it, and returns a branded captioned video you edit in plain language.

Input · SRT to VideoReady
chars 0 / 4000

Trusted by teams at

Amazon
Amazon
Google
Google
Microsoft
Microsoft
Nvidia
Nvidia
Apple
Apple
Walmart
Walmart
Salesforce
Salesforce
Reddit
Reddit
CVS Health
CVS Health
PayPal
PayPal
John Deere
John Deere
Snap Inc.
Snap Inc.
Amazon
Amazon
Google
Google
Microsoft
Microsoft
Nvidia
Nvidia
Apple
Apple
Walmart
Walmart
Salesforce
Salesforce
Reddit
Reddit
CVS Health
CVS Health
PayPal
PayPal
John Deere
John Deere
Snap Inc.
Snap Inc.
Veeva Systems
Veeva Systems
DocuSign
DocuSign
DP World
DP World
Genpact
Genpact
Parker Hannifin
Parker Hannifin
Bio-Rad
Bio-Rad
Imperva
Imperva
ITV
ITV
HubSpot
HubSpot
Rocket Mortgage
Rocket Mortgage
Tektronix
Tektronix
Diligent
Diligent
Times Internet
Times Internet
Veeva Systems
Veeva Systems
DocuSign
DocuSign
DP World
DP World
Genpact
Genpact
Parker Hannifin
Parker Hannifin
Bio-Rad
Bio-Rad
Imperva
Imperva
ITV
ITV
HubSpot
HubSpot
Rocket Mortgage
Rocket Mortgage
Tektronix
Tektronix
Diligent
Diligent
Times Internet
Times Internet
Deel
Deel
Zapier
Zapier
Delhivery
Delhivery
SafetyCulture
SafetyCulture
Demandbase
Demandbase
PingCAP
PingCAP
Quizizz
Quizizz
Apryse
Apryse
Improvado
Improvado
Taggbox
Taggbox
Matrixport
Matrixport
Glasswall
Glasswall
ContractSafe
ContractSafe
Deel
Deel
Zapier
Zapier
Delhivery
Delhivery
SafetyCulture
SafetyCulture
Demandbase
Demandbase
PingCAP
PingCAP
Quizizz
Quizizz
Apryse
Apryse
Improvado
Improvado
Taggbox
Taggbox
Matrixport
Matrixport
Glasswall
Glasswall
ContractSafe
ContractSafe

How it works

Four steps from a raw .srt file to a captioned video.

No subtitle-burner that just stamps text on footage you don't have, and no timeline to nudge cues by hand. The caption lines you already wrote become the script, and the script becomes a storyboard you can argue with before anything renders.

01

Paste the SRT or drop a file URL

Open the .srt in any text editor and paste the cue numbers, timestamps, and caption lines. ngram parses the SubRip structure, keeps the cue order, and reads the in/out times so pacing follows the original captions.

02

Cues become a video script

ngram groups consecutive caption lines into scenes, finds the hook and the close, and tightens the wording for spoken pacing. The phrasing from your subtitles survives where it reads well; the timestamps guide how long each scene holds.

03

Review the storyboard before render

Each scene shows its caption line, the visual direction, and the duration pulled from the cue timing. Merge two cues, rewrite a line, swap a visual, or ask for a shorter cut in plain language, and the changes flow back through the script.

04

Export with captions burned in

One render produces 16:9 for YouTube, 1:1 for LinkedIn, and 9:16 for Reels, Shorts, and approved social channels. Captions are burned in from your SRT text and styled with the Brand Kit, with voiceover reading the same lines.

Output controls

Smart defaults from the cues. Real knobs when you need them.

Caption-line-to-scene mapping

ngram reads the numbered cues and decides where one caption line ends a scene and the next begins one. Long captions split, rapid-fire cues merge, so the storyboard reads as scenes rather than one card per subtitle.

Timing follows the timestamps

The in/out times in the .srt set a baseline duration for each scene. Keep the original pacing for a faithful cut, or ask the agent to re-pace the whole video to a target length and the cue timing rescales with it.

Script-first review

Read the full script, assembled from your caption lines, before any visual generates. Cut a section, fix a line, or rewrite the close in plain English, and every edit re-flows the scene plan downstream.

AI Visuals per caption

Each scene gets a brand-matched image or short generative clip tied to what that caption line says. No keyword-matched stock footage that ignores the words on screen.

Voiceover that reads the captions

Narrate the caption lines in a default ngram voice, your cloned voice, or a multilingual ElevenLabs voice. The voiceover and the burned-in captions stay in lockstep because they come from the same SRT text.

Caption styling from the Brand Kit

Burned-in captions inherit the Brand Kit's caption preset: font, color, weight, and position. The subtitle file gave you the words; the Brand Kit gives them a consistent on-screen look across every render.

Re-render in another language

Translate the caption lines and regenerate voiceover and on-screen text in the target language without rebuilding the storyboard. One SRT becomes a localized cut for each market you ship in.

Three ratios in one render

16:9 for YouTube and embeds, 1:1 for LinkedIn feed, 9:16 for Reels and approved social channels. Smart reframing keeps the caption text readable across every aspect ratio.

Use cases

Where a subtitle file already holds the whole video.

Explainer video

Turn a captioned explainer transcript into a video

The .srt from an old explainer or webinar already carries the full narration. Paste it and ngram rebuilds the explainer with fresh visuals and voiceover, no re-recording the script line by line.

See use case
Training content

Rebuild a training module from its caption track

Course captions and SOP subtitles convert straight into a branded training video with step pacing pulled from the cue timing. The words employees already read on screen become the script.

See use case
Tutorial video

Re-skin a tutorial from its subtitle file

Have the SRT but not the footage? ngram reads the tutorial captions and generates matching scenes, so the steps stay in order and the on-screen text matches what the narrator says.

See use case
Customer onboarding

Caption files become onboarding videos

Onboarding scripts saved as .srt convert into a short branded walkthrough with captions burned in. New customers see the steps spoken and written, which lifts activation in week one.

See use case
Help center

Help-article subtitles into support clips

A subtitle export from a recorded answer carries the exact wording support already approved. Paste it, get a captioned help clip that embeds inline in Zendesk or Intercom, no re-shoot.

See use case
Social distribution

Repurpose caption files into social clips

An old SRT is a ready-made script for a fresh 9:16 cut. Paste it, regenerate the visuals on-brand, and ship a captioned clip to LinkedIn and approved social channels without rewriting a word.

See use case
Lecture recap

Lecture captions into a focused recap

A 60-minute lecture's caption file holds far more than a recap needs. ngram trims the cues to the core points and rebuilds a tighter video, with captions on every scene for students who skim on mute.

See use case
Customer training

Feature-training captions into release videos

Subtitle files from a feature walkthrough convert into a captioned training clip CS can attach to every release. The narration text drives both the voiceover and the on-screen captions.

See use case

Tools that pair with this converter

Sharpen the captions before. Edit the video after.

All ngram tools

Editing the video further

Take the SRT-to-video output past the first cut

Built for teams

Teams sitting on caption files that should be videos.

All solutions

Integrations

Convert subtitle files to video where they already pile up.

Wire the converter into your storage, your CRM, your agent stack, or your publishing tools. Every integration ships with a working SRT-to-video recipe you can fork.

REST APIMCP serverWebhooksProgrammatic SRT-to-video runs in ~20 lines against the REST API.

How it compares

If you've been using something else for srt to video.

VEED, Kapwing, and Flixier read an .srt to burn subtitles onto footage you already have. They don't build the video. ngram treats the caption lines as the script, plans a storyboard you can edit before render, and generates the scenes and voiceover to match, so an SRT with no footage still becomes a finished video.

FeaturengramVEEDKapwingFlixier
Needs existing footageNo. Generates scenes from the caption linesYes, burns onto your videoYes, burns onto your videoYes, burns onto your video
How the SRT is usedCaption lines become a hook-body-CTA scriptOverlay text on a clipOverlay text on a clipOverlay text on a clip
Storyboard review before renderFull scene-by-scene plan, editable in plain languageCaption editor onlyCaption editor onlyCaption editor only
Visual generation per captionAI Visuals matched to each line's meaningNoneStock-library matchingNone
Voiceover from the caption textElevenLabs + cloned voice, any supported languageLimited TTSLimited TTSLimited TTS
Brand Kit on every sceneLogo, fonts, colors, motion, caption presetCaption-style presetsTemplate-basedTemplate-based
Aspect ratios per render16:9, 1:1, 9:16 from one renderOne ratio per exportOne ratio per exportOne ratio per export
Re-render in another languageTranslate captions + voiceover, keep the storyboardManual re-subtitleManual re-subtitleManual re-subtitle
API + agentic accessREST, MCP server, Zapier, n8n, MakeAPI on paid plansAPI on paid plansLimited API

FAQ

Common questions about srt to video

Open your .srt file in any text editor and paste the contents. ngram parses the cue numbers, timestamps, and caption lines, groups them into scenes, and rewrites the text as a video script. You review the storyboard in plain language, then export in 16:9, 1:1, and 9:16 with captions burned in and voiceover reading the same lines.

Still curious?

SRT → Video

Ready to turn a subtitle file into a captioned video?

Paste the .srt, review the storyboard, export in three ratios with captions burned in. Roughly five minutes from paste to publish.