Text to Speech & Speech to Text
Type text and hear it read aloud, or speak to get typed text. 100% browser-native — free, fast and private.
How the Text to Speech tool works
Text to Speech: Type or paste any text into the text area. Select a voice from the dropdown (voices are provided by your browser and operating system). Adjust the speed, pitch and volume sliders. Click Speak and your browser will read the text aloud. Everything runs locally — your text is never sent to any server.
Speech to Text: Switch to the Speech to Text tab, select your language and click the microphone button. Start speaking and the browser will convert your spoken words into typed text in real time. You can copy the result or continue dictating.
Why the Text to Speech Tool is Useful
There's a well-known proofreading trick that professional editors use: read your work aloud. Hearing text read back exposes errors that silent reading misses — awkward phrasing, repeated words, sentences that are grammatically correct but rhythmically wrong. This tool does that automatically. Paste your draft, click Speak, and hear every sentence. Beyond proofreading, it's a practical accessibility tool for anyone who prefers audio to reading, a study aid for processing dense material differently, and a pronunciation guide for language learners. The reverse tab — speech to text — turns any microphone into a dictation tool, so you can capture thoughts hands-free and see them typed in real time. Both features run entirely in your browser using the Web Speech API, which means no server, no uploads, no account — just open the page and use it.
Key Features
- Browser-native voice selection: All voices installed on your operating system appear in the dropdown. Most modern browsers on Windows, macOS, and mobile devices offer multiple voices across many languages and accents.
- Language filter: A second dropdown lets you filter voices by language code so you can quickly find an English, Hindi, Spanish, or French voice without scrolling through a long unfiltered list.
- Speed, pitch, and volume sliders: Control the speaking rate from 0.1× (very slow, useful for language learners) to 3× (fast, useful for quickly scanning long text). Pitch and volume can also be independently adjusted.
- Pause and resume: The Speak button becomes a Pause button mid-playback. Resume from where you paused rather than starting over, which is useful for long passages where you want to stop and re-read a section.
- Speech to Text tab: Switch to the STT tab to dictate into a textarea. 13 languages are supported including English (US/UK/India), Hindi, Spanish, French, German, Japanese, Chinese, Arabic, and more. Continuous recognition with auto-restart keeps capturing until you click Stop.
- Character counter: A live character count below the text area helps when you're working within a character limit for a social post, email subject line, or other constrained format.
Real-Life Use Cases
- Proofreading and editing: Paste your article, essay, or email draft and listen to it at 1× speed. You'll catch repeated words, missing articles, and unnatural sentence constructions that you've been reading past because your brain autocorrects familiar text.
- Accessibility for vision impairment or dyslexia: Listening to written content is often significantly easier than reading it for people with dyslexia, visual impairment, or reading fatigue. The tool requires no accessibility software installation.
- Language learning and pronunciation: Paste a sentence in a language you're learning and hear a native voice read it. Adjust speed to 0.7× to hear each word clearly before attempting your own pronunciation.
- Hands-free dictation: Use the Speech to Text tab to capture meeting notes, ideas, or drafts while your hands are busy with something else — cooking, commuting, or taking physical notes at a whiteboard.
- Content creation voice drafts: Dictate a rough draft of a blog post or script into the STT tab, then copy the text to a writing tool for editing. Faster for many people than typing a first draft from scratch.
Who Can Use This Tool
Writers and editors who proofread by listening, students with dyslexia or reading difficulties who process information better through audio, language learners who want to hear correct pronunciation, professionals who dictate rather than type, developers testing the Web Speech API, content creators drafting scripts or voiceovers, and anyone with long-form text they'd rather hear than read. The tool works on desktop and mobile browsers — Chrome and Edge offer the broadest voice selection and the most reliable speech recognition support.
Tips & Best Practices
- Slow down for proofreading: Set the speed to 0.8× or 0.9× when listening for errors. At 1× the voice moves at a natural reading pace, which can be fast enough that you miss mistakes. Slower playback gives your brain time to notice inconsistencies.
- Filter by language for multilingual content: If you're working with content in multiple languages, use the language filter to switch quickly between a voice for each language rather than scrolling through an unfiltered list every time.
- Use the STT tab for brain dumps: When you have a lot to say and don't want the friction of typing, speak your thoughts into the STT tab and clean up the text afterward. Most people speak faster than they type and with fewer editing interruptions.
- Speak clearly for better STT accuracy: The browser's speech recognition engine performs best with a clear microphone, moderate speaking pace, and minimal background noise. Headphone microphones typically outperform laptop built-ins for dictation quality.
- STT works best in Chrome or Edge: Firefox has limited or no Web Speech API support. For the speech-to-text tab specifically, Chrome or Edge on desktop gives the most consistent recognition performance.