Transcription
Explore the best Transcription tools powered by AI
Voice AI platform for developers.
Enterprise Voice AI platform designed for developers building voice-first products using speech-to-text, text-to-speech, or speech-to-speech APIs. Over 200,000 developers build with Deepgram's voice-native foundational models, accessed via APIs or self-managed software. Start building with $200 in free credits!
Transform speech into meaning with one robust API
Build new AI products with voice data leveraging AssemblyAI’s industry-leading Speech AI models for accurate speech-to-text, speaker detection, sentiment analysis, chapter detection, PII redaction, and more.
Join 5,000+ industry-leading companies—including Fireflies.ai, Glean, and Loop—unlocking the power of voice data and launching best-in-class products and experiences.
Voice AI for developers
Build, test and deploy voicebots in minutes rather than months.
Fast and 100% offline Speech to Text for MacOS
Introducing Paraspeech - the fastest offline speech to text app for mac: • 100% offline and on-device • just 40ms cold-start time • 300ms transcriptions • barely touches battery and just needs 200MB of RAM • No subscription!
All-in-one platform for voice and chatbots
Smartly.AI is an innovative SaaS platform for creating, monitoring and deploying voice and chat applications.
Multilingual speech-to-text API trained on 100M+ utterances
Speechflow is a multilingual Speech-to-Text API that offers state-of-the-art accuracy in 13 languages, not just English. This is a breakthrough as languages other than English have achieved the same level of recognition accuracy as English for the first time.
Voice AI Suite for Enterprises
Our conversational AI agents talk, text, and email your customers for you. Super fast and super safe. They work right inside the tools you already use. No messy add-ons, no data leaks. Just smooth, fast help for every customer.
Capture every conversation
The most accurate transcription, translation and analytics platform for English, Arabic, Indian and mixed languages. Transcribe any file or real-time speech in a user-friendly platform, or integrate VoiceAI to your applications with just a few lines of code.
Speech-to-Text and Text-to-Speech with AI Power
Voiser's AI-powered platform offers accurate speech-to-text and natural-sounding text-to-speech services in over 75 languages. Perfect for content creators, podcasters, and businesses seeking high-quality voiceovers and transcripts.
Voiser - AI-powered tool
AI-powered, text-to-speech, speech-to-text, high-quality voiceovers, transcripts, 75 languages, 135 dialects, personalized voiceovers, fine-tune, accuracy, transcribing, editor, API, developers.
Voiser is an AI-powered platform that offers text-to-speech and speech-to-text services in over languages and accents. With over voices to choose from, users can generate natural-sounding voiceovers in minutes and fine-tune them with the Voiser Studio. The speech-to-text service boasts up to % accuracy, making transcribing audio files into text a breeze. Voiser's API allows developers to integrate its features into their own applications.
Key Features
- Text-to-speech and speech-to-text services in over languages and accents
- Over voices to choose from
- Fine-tune voiceovers with the Voiser Studio
- Speech-to-text service boasts up to % accuracy
- Easy-to-use editor to customize transcripts
- API for developers to integrate into their own applications
Main Use Case
- Create high-quality voiceovers for videos, podcasts, and presentations
- Transcribe audio files into text quickly and accurately
- Customize voiceovers and transcripts to fit
The next generation of the Phi family from Microsoft
Microsoft introduces Phi-4-multimodal & Phi-4-mini! 🚀 Phi-4-multimodal integrates speech, vision & text for seamless interactions, while Phi-4-mini excels in text tasks with high accuracy. Now available on Azure AI Foundry, HuggingFace & NVIDIA API Catalog.
AI That Hears How You Speak
Nova Sonic is Amazon's Speech-to-speech AI on Bedrock. Understands how you speak (tone, pace) & responds with adaptive, expressive voice in real-time.
Boost watch time and engagement with burned-in subtitles
Generate Beautiful Subtitles Double Watch Time & Engagement
LexiTalk AI creates your personalized English environment.
Language is a tool for communication. True mastery comes from using it, not memorizing it.
At LexiTalk AI, you enter a personalized English environment where everything revolves around using English: 🎙️ AI speaking practice with instant feedback 🎧 Personalized podcasts from your saved words 🎮 Fun vocabulary games to make learning stick 📊 Smart exam prep for IELTS / TOEFL / TOEIC 📝 The unique 5-Step Vocabulary Method
Building LexiTalk AI — https://www.lexitalkai.com
More Than Transcribe - AI Voice Typing on Desktop & Mobile
Voice Typing Software on Every Device, All Your Apps.
More than transcribe, use your voice to write, translate, and create across Windows, macOS, iOS, and Android—inside any software you already use.
Offline Mac AI with full privacy and encrypted chats
PureLocalAI is a fully offline AI assistant for Mac. Brainstorm privately with unrestricted AI, chat with sensitive & lengthy PDFs, use speech-to-text & voice replies, create custom AI personas, and enjoy AES-256 encryption — all local, no cloud, full control.
Capture tasks instantly with your voice: fast, simple, smart
ListN turns your spoken thoughts into clear, organized tasks — no typing needed. Capture ideas 3× faster than typing. Perfect for busy professionals, parents, and anyone juggling too much at once.
CareScribe
Smarter Clinical Documentation Powered by AI
CareScribe is India’s first AI-powered clinical documentation platform for doctors. It allows you to speak in Hindi, Tamil, Telugu, Kannada, Malayalam, or English, and instantly generates accurate notes and compliant records in your native language.
FFTrans
FFTrans Parakeet is a offline transcription app for macOS. It's completely free, runs entirely offline, and includes speaker diarization using the latest parakeet-tdt-0.6b-v3 model. No cloud, no tracking—just fast, local processing.
Turn Your Voice into Notes and Smart Reminders
Simply speak naturally, and Voicely instantly converts your words into voice notes, smart reminders, and categorized notes, no typing needed. It understands your intent, automatically detects smart tags, dates and times.
Showing 1-20 of 24 tools