Skip to main content

AI Transcription

Model:
Upload a file or record live to get a transcript with accurate timestamps. Supports speaker diarization, SRT/VTT subtitle export and 100+ languages with automatic detection. Cost is proportional to clip length. Runs on Whisper large-v3 and Parakeet (self-hosted), with Wizper and ElevenLabs STT available on paid plans.

Drag and drop audio/video, or click to browse

Up to 500MB, MP3, WAV, MP4, WebM and M4A

OpenAI Whisper, 99 languages, high accuracy.
Token estimate for this clip
Buy tokens
Recording: 0:00

Record from your microphone — your transcript appears after you click Stop.

Transcript
Loading…

Converting your audio to text...

Longer files take more time to process.

Common transcription uses

Interviews + podcasts

Speaker diarization labels each participant separately. Export SRT for a video editor or plain text for a written article.

Auto captions + subtitles

Upload your video, choose SRT or WebVTT, then use /video/subtitle/ to apply the captions. Two steps from clip to captioned video.

Meeting notes

Upload a Zoom or Teams recording to get a full transcript with speaker labels. Run the result through /write/summarize/ for concise bullet-point minutes.

Lectures + lessons

Turn a 90-minute lecture into text, then use /study/flashcards/ or /write/summarize/ to build study materials from it.

Foreign-language audio

Whisper identifies 99 languages automatically. Transcribe in the source language, then pass the text to /translate/ to convert it.

Legal + medical

Word-level timestamps, speaker labels and JSON export give you the detail needed for court-reporter or clinical-note work.

Rewind.ai transcription vs the alternatives

What you getRewind.aiOtter.aiDescriptRev.com
Free daily usage5K+ tokens/day300 minutes/mo1 hr/month
EngineWhisper large-v3, ParakeetProprietaryProprietaryHuman + AI
Languages99English-focused2230+
Speaker diarization
SRT / VTT exportPaidPaid
Public APILimitedLimited
Live streaming STT (free) Paid
Sign-up requiredNoYesYesYes
Competitor figures are based on publicly listed free tiers as of 2026. Verify with each provider before relying on them.
Advanced options
Result
Tokens running low. Get More Tokens
Want better results? Premium models (GPT-5, Claude, Gemini) deliver higher quality.View Plans

Love Rewind.ai? Tell your friends!

Sign up to get a referral link and earn 25,000 tokens per friend.

Want more?Sign up for 5K tokens/day plus a 10K bonus
Sign Up Free
Loading…

Processing your request...

Rewind.ai transcription uses Whisper large-v3. Handles audio files, video files and live microphone input. Speaker diarization, 99 languages, SRT/VTT/TXT export included.

How to Use AI Transcription

1
Upload your file

Drop in an audio or video file. No account needed to start.

2
Transcribe

Whisper turns the speech into text in seconds, with timestamps if you want them.

3
Edit and export

Fix anything that needs it, then copy the text or download it as SRT.

Call the transcription API directly

The endpoint follows the OpenAI REST format and accepts a bearer token, so whatever HTTP client you already use will work without changes. Token usage is metered the same way as in the browser.

curl -X POST https://api.rewind.ai/v1/stt/ \
  -H "Authorization: Bearer sk-rewind-..." \
  -H "Content-Type: application/json" \
  -d '{"file": "@audio.mp3", "language": "auto"}'

AI Transcription FAQ

Free AI Transcription converts audio and video files to text using the Whisper speech-recognition model. Upload a file and get text back in seconds.

Yes! Transcription costs ~4 tokens per second of audio. A 5-minute file costs ~1,200 tokens. You get 2,500/day free.

Whisper supports 99+ languages with automatic language detection. Just upload your audio and it detects the language automatically.

MP3, WAV, M4A, FLAC, OGG, MP4, WEBM and most common audio/video formats.

Whisper is one of the most accurate STT models available, comparable to commercial services. Accuracy varies by audio quality and language.

Yes! Choose between plain text or timestamped output (SRT subtitle format).

Up to 25MB for anonymous users, 100MB for signed-in users. For larger files, split them first.

No! Transcribe files immediately without an account.

Yes, use /transcribe/video/, upload an MP4/WebM/MOV and we extract the audio and transcribe it.

Our transcription uses the same Whisper model and is completely free. Otter charges $8-24/month, Rev charges per minute.

The transcribed text is fully editable, copy, modify and download as needed.

Yes! Access our transcription API at /api/ for batch processing.

Sign up free for 10,000 tokens

Create Free Account

No credit card required

How would you rate this tool?

Love Rewind.ai? Tell your friends!

Rate this page