Create Captions
Add AI-generated captions to your videos with customizable styling
For AI agents: a documentation index is at /llms.txt. Every page is also available as markdown, just append .md to the URL.
Overview
Add professional captions to your videos using AI-powered transcription. This endpoint generates accurate captions with various styling options, emoji support, and keyword highlighting. Perfect for making content accessible and engaging on social media platforms.Rate Limiting
This endpoint is rate limited to 10 requests per minute per API key.sourceUrl or uploadId, but not both.Video Requirements
Duration
Maximum: 15 minutes
File Size
Format
Audio Quality
Plan Limits
| Plan | Max Resolution | Concurrent Projects |
|---|---|---|
| Creator | 1080p | 3 |
| Studio | 4K (2160p) | 10 |
Response
processing- Video is being transcribed and captions are being generatedcompleted- Captions have been generated successfullyfailed- Processing failed due to an error
Upload- Uploaded fileGeneric- External URL
Example Request
Example Response
Processing Workflow
- Audio Extraction - Audio is extracted from the video file
- Transcription - AI transcribes the speech with precise timing
- Caption Generation - Captions are formatted with your chosen style
- Enhancement - Emojis and highlights are added if enabled
- Rendering - Final video is rendered with embedded captions
Caption Features
AI Transcription
Style Presets
Emoji Support
Keyword Highlights
Best Practices
- Audio Quality: Clear audio produces more accurate transcriptions
- Language Selection: Specify the language for better accuracy
- Style Matching: Choose presets that match your brand and content type
- Translation: Use translation for multilingual audiences
Use Cases
Video Platforms
Accessibility Compliance
Social Media Tools
Learning Management
Next Steps
After creating a caption project:- Monitor progress with Get Project Status
- Retrieve the captioned video with Get Project Clips
- Download and distribute your captioned content
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Direct URL to a video file (alternative to uploadId)
Upload ID from a previously uploaded file (alternative to sourceUrl)
Caption style preset ID. Defaults to system_beasty if not specified.
Primary language of the video content (auto-detected if not provided)
Language to translate captions to
Script format for transcription output
native, roman Whether to add contextual emojis to captions
Whether to highlight keywords in captions
Output resolution (only applicable when sourceUrl is provided)
720, 1080, 1440, 2160 Response
Successful response
queued, prepped, draft, processing, finalizing, completed, invalid, expired, failed, error clipping, captions, reframe, dubbing, transcription Upload, Youtube, Vimeo, TwitchVod, Twitter, RumbleEmbed, Generic talking, screenshare, gaming landscape, portrait, square native, roman