Subtitles for online courses: how to caption Teachable, Kajabi, Thinkific & Podia videos (2026)

Course platforms moved from optional to expected on captions over the last three years. Coursera and Udemy require them. Teachable, Kajabi, Thinkific, and Podia all support them. Students filter for captioned courses and leave reviews complaining when they are missing. If your course library is uncaptioned in 2026, you are leaving completion rate, conversion, and goodwill on the table.

Why captions matter more for courses than for social

Course viewers are not scrolling — they are studying. They re-watch. They search. They take notes. Captions support all three behaviors: re-watching is faster when you can scan a transcript, search jumps directly to the segment that mentioned a keyword, and notes get easier when the words are visible.

Comprehension: 12–18% lift on retention quizzes when captions are available, even for native speakers.
Accessibility: opens enrollment to deaf and hard-of-hearing students. Required for ADA / Section 508 compliance in US institutions.
Internationalization: a captioned English course can be auto-translated by the platform into 30+ languages without you re-recording.
Searchability: VTT/SRT transcripts feed in-platform search and SEO crawlers — your lessons become discoverable on Google.

The practical workflow

01Export each lesson as a final MP4. Do not caption the rough cut — you will redo it.
02Upload to a caption tool (SoCaptions, Whisper API, Rev). Generate the transcript in 30–90 seconds per lesson.
03Review the transcript. Fix product names, technical terms, acronyms, and numbers. The model gets verbs right but proper nouns wrong.
04Export an SRT file (universal) or VTT (for web-native players).
05Upload the SRT/VTT alongside the video on your platform. Teachable, Kajabi, Thinkific, and Podia all accept SRT.
06For platforms without native caption support, burn captions into the MP4 instead.

Platform-specific notes

Teachable: SRT upload per lecture in the lecture editor. Toggle captions on by default.
Kajabi: VTT upload via the lesson video settings. Multiple language tracks supported.
Thinkific: SRT or VTT upload per video. Captions are toggleable for students.
Podia: VTT upload per lesson video.
Self-hosted (Vimeo, Mux, Bunny): SRT or VTT track on the player. Easy.
Custom HTML5 player: VTT via the <track> element.

Edit pass: what to fix and what to leave

AI transcription gets 92–97% accuracy on clean studio audio. The 3–8% it misses is concentrated in product names, jargon, and numbers — exactly the words that matter most in a course. Spend your edit time there.

Fix: product names, library names, function names, acronyms, numbers, prices, URLs, brand names.
Fix: timing drift greater than ~250ms — captions that lag the speaker break flow.
Leave: ums, ahs, false starts. They make the transcript human.
Leave: minor punctuation. Most viewers do not notice missing commas.

Translation for global enrollment

Once you have a clean English SRT, translation is trivial. Most caption tools offer one-click translation to 30+ languages, and even rough machine translation lifts comprehension for non-native speakers. For paid courses sold internationally, the translation pays for itself within the first month of enrollment.

ROI math

If your course has 100 lessons and captions add 4 minutes per lesson, that is ~7 hours of work. If captions lift completion by 5% and completion lifts review-driven referrals, you make that time back inside one month for any course earning $2k/mo or more.

FAQ

Do online courses need subtitles?

Yes, for two reasons. First, accessibility: US educational institutions subject to ADA and Section 508 must provide equivalent captions for video content. WCAG 2.1 AA (the enterprise and institutional standard) requires captions for all prerecorded synchronized media. Second, retention: studies consistently show 12–18% higher comprehension and quiz scores when subtitles are available, even for native speakers. Udemy and Coursera both require captions for courses submitted to their marketplace.

What subtitle format do course platforms use?

SRT is universally accepted — Teachable, Kajabi, Thinkific, Podia, Udemy, Coursera, and Skillshare all accept SRT. VTT is supported by Kajabi, Mux, and self-hosted HTML5 players using the <track> element. If you're not sure which format a platform accepts, start with SRT — it converts to VTT in seconds and has never broken on any major LMS.

How do I add subtitles to Teachable videos?

In Teachable, open the course editor, click the lecture you want to caption, click the video block, and look for the Captions section in the video settings panel. Upload your SRT file and assign it a language. Teachable renders the captions as a toggleable CC track in the Teachable player. Repeat per lecture — Teachable doesn't have a bulk upload option as of 2026.

How do I add subtitles to Kajabi videos?

In Kajabi, go to the Product, open the post (lesson), and click the video thumbnail to open video settings. There is a Subtitles/Captions section where you can upload a VTT file per language. Kajabi's player supports multiple language tracks, so if you have translations, upload each as a separate VTT. English first, then additional languages.

How long does it take to caption a one-hour course?

With AI transcription, the bulk of the work is under 2 hours for a one-hour course: about 30 seconds of AI generation per 10-minute lesson, then 15–20 minutes of review per hour of audio to fix names, product terms, and numbers. Human transcription costs $1–3 per minute of audio and takes 24–48 hours turnaround. For most course creators, AI + manual review is the right balance — it catches the 5–8% that actually matters.

Free tool

YouTube transcript extractor — get an SRT from any YouTube lesson in one click

Production workflow

The practical way to apply this guide is to treat subtitles for online courses: how to caption teachable, kajabi, thinkific & podia videos (2026) as a repeatable production workflow, not a one-off fix. Start with the final video file, not the rough edit. Make the content understandable first, make the captions accurate second, and make the styling attractive third. That order prevents the most common mistake in video caption work: spending time on color, animation, or font choice before the words, timing, and placement are correct.

For short-form video, the workflow should be fast enough that you can use it every time you publish. If the process takes 45 minutes per clip, you will skip it when you are busy. A good caption workflow should fit inside the final polish pass: upload the final cut, generate captions, fix the transcript, choose the preset, check safe zones, preview on mute, and export. That is enough for most creator, founder, marketer, and agency clips.

01Watch the video once without captions and write the single idea the viewer must understand.
02Generate or paste the transcript and remove anything that distracts from that idea.
03Set caption timing before styling. Timing problems are more damaging than font problems.
04Choose one readable visual system: outline, box, karaoke, cinematic, or minimal.
05Check the worst frame in the video, not the cleanest frame.
06Preview the export at phone size with sound off.
07Publish only when the message is clear without audio.

Quality checklist before publishing

Use this checklist before publishing any video related to subtitles for online courses. It is intentionally practical. The goal is not to create a perfect studio deliverable; the goal is to avoid the errors that cause people to swipe, misunderstand the message, or miss the call to action.

The first caption appears early enough to support the hook.
No caption is hidden by platform buttons, username text, captions, CTA buttons, or progress controls.
Every important proper noun, number, price, URL, and product name is spelled correctly.
Lines break around phrases instead of splitting random words.
The caption block uses enough contrast on the brightest frame.
The style matches the content category: louder for fast social, cleaner for tutorials, calmer for B2B.
The video still makes sense with sound off.
The export was checked after rendering, not only inside the editor preview.
The caption position is consistent with other videos on the same channel.
The final CTA is visible, readable, and not competing with native platform UI.

Common mistakes to avoid

The biggest mistake is treating captions as decoration. Captions are part of the content layer. They carry meaning, pace, emphasis, accessibility, and retention. If they are late, too small, hidden, or hard to read, the viewer does not experience them as a design flaw; they experience the whole video as harder to watch.

The second mistake is designing for the editor canvas instead of the feed. Editors show a clean preview. Social platforms add buttons, labels, captions, comments, compression, and device variation. Always assume the published version will be harsher than the preview. More margin, stronger contrast, and shorter lines are usually better than a layout that looks elegant only in the editor.

Do not put the most important text at the very bottom of vertical video.
Do not use thin fonts for fast speech or small mobile viewing.
Do not rely on color alone for emphasis if contrast is weak.
Do not generate captions before the edit is final unless you expect to redo timing.
Do not export once and assume every platform will display the file the same way.

How to use SoCaptions for this

SoCaptions is built for the practical version of this workflow: quick caption generation, editable transcript cleanup, readable presets, and export-ready MP4 captions for social video. Use it when the edit is mostly done and the remaining job is to make the words visible, timed, and polished. That is where a focused caption tool is faster than opening a full video editor and rebuilding a caption system from scratch.

The best SoCaptions workflow is simple. Upload the final video, generate captions, fix the transcript, pick a preset, adjust placement for the platform, preview the full clip, and export. For high-volume creators, save a consistent style and reuse it. Consistency matters because viewers learn where to read your captions and begin to recognize your videos before they consciously notice the branding.

Value-first CTA

Try the workflow on a real 20-40 second clip before changing your whole process. One finished export will tell you whether the caption style, placement, and timing are strong enough for your channel.

FAQ

What is the fastest way to handle subtitles for online courses?

The fastest reliable method is to work from the final video, use an automatic caption or transcript tool, fix only the meaningful mistakes, and apply a proven preset instead of designing from zero. Manual control is useful, but manual setup is expensive if you repeat it for every clip. Use automation for the repetitive timing work and spend your attention on clarity, placement, and final review.

Should I use burned-in captions or a caption file?

Use burned-in captions when you need every viewer to see the text immediately in a social feed. Use a caption file such as SRT or VTT when accessibility, toggling, translation, or platform-native playback matters. For important videos, the strongest workflow is often both: a captioned social export for reach and a clean transcript or caption file for accessibility and reuse.

How do I know if the captions are readable enough?

Preview the video on a phone-sized screen with sound off. If you can understand the point without leaning in, pausing, or replaying, the captions are probably readable. Then check the brightest frame, the busiest frame, and the final export after compression. Readability is proven in the worst viewing condition, not the best screenshot.

How much should I customize the style?

Customize enough to fit your brand, but not so much that the captions become harder to read. Most channels need one dependable default and one alternate style for special clips. Constantly changing fonts, colors, and animation makes the content feel less consistent and slows production. A simple repeatable style usually beats a new design for every post.

What should I measure after publishing?

Measure retention, average watch time, completion rate, rewatches, comments that mention clarity, and whether viewers understand the call to action. View count alone is too noisy. If caption improvements work, you should see fewer early drop-offs and better comprehension on clips where the spoken message matters.