The Speed Gap: Speaking vs Typing
The average person types at 40 words per minute. The average person speaks at 130-150 words per minute. Professional typists reach 80-100 WPM. Professional speakers and auctioneers exceed 200 WPM. No matter how you compare, speaking is fundamentally faster than typing. But why?
The answer lies in the neuroscience and biomechanics of human communication. Speaking is one of the most natural and practiced skills humans possess. Typing is a relatively recent invention that requires a complex translation from thought to finger movements. Understanding this gap explains why voice typing is not just a convenience but a fundamental productivity advantage.
The Neuroscience of Speech
Speech Is Hardwired
Humans have been speaking for at least 100,000 years. Our brains have evolved specialized regions for speech production: Broca's area for speech motor planning and Wernicke's area for language comprehension. These neural circuits are deeply optimized through both evolution and years of daily practice from early childhood. By adulthood, speech production is almost entirely automatic.
Typing Is Learned
Typing, by contrast, is a skill most people learn in their teens or twenties. It requires translating language into a learned motor sequence: identifying each letter, locating it on the keyboard, and pressing it with the correct finger. This translation step creates a bottleneck that does not exist in speech. Even expert typists must perform this letter-by-letter conversion, which fundamentally limits speed.
Cognitive Load
Research shows that typing requires more conscious attention than speaking. When you type, part of your cognitive resources are devoted to the mechanics of typing: letter selection, hand positioning, error correction. When you speak, these mechanical concerns are absent, freeing your full cognitive capacity for composing your message. This is why many writers find that dictated prose flows more naturally and captures their thinking more authentically.
The Biomechanics
Speech Uses Larger Muscle Groups
Speech production involves the diaphragm, larynx, tongue, lips, and jaw — relatively large muscle groups capable of rapid, coordinated movement. Typing involves only the small muscles of the fingers, which have more limited speed and endurance. The larger muscle groups used in speech can sustain rapid output for much longer without fatigue.
Parallel vs Sequential Processing
Speech production is highly parallel: your brain plans upcoming words while you are still producing the current one. The articulatory system can coarticulate sounds, blending them together for efficiency. Typing is more sequential: you press one key, then the next. Even touch typists who overlap keystrokes are limited by the sequential nature of pressing individual keys.
Fatigue and RSI
The small, repetitive motions of typing cause fatigue and, over time, repetitive strain injuries. Speaking does not carry these risks. For people who write extensively, the physical sustainability of voice typing is a significant advantage. Read more about speech-to-text for RSI prevention.
The Numbers
Let us quantify the productivity difference for a knowledge worker who writes 5,000 words per day (emails, documents, messages, notes):
- At 40 WPM typing: 125 minutes (over 2 hours) of pure typing time
- At 150 WPM speaking: 33 minutes of dictation time
- Time saved: 92 minutes per day
- Weekly savings: 7.7 hours
- Annual savings: ~400 hours (10 full work weeks)
Even accounting for the editing time that dictated text requires, the net savings are substantial. Most users find that editing a dictated draft takes about 20-30% of the time saved on initial composition, resulting in a net productivity gain of 50-60%.
Quality Differences
Dictated Text Sounds More Natural
Because speech is more natural than typing, dictated text often has a more conversational, readable quality. Sentences flow better, vocabulary is more varied (you use words in speech that you might not bother typing), and the overall rhythm of the prose feels more human. This is why content creators and book authors increasingly dictate their first drafts.
Thinking While Speaking vs Thinking While Typing
When typing, people often stop mid-sentence to think, creating fragmented writing that requires substantial revision. When speaking, the pressure to continue produces more complete, flowing thoughts. The result is first drafts that, while they need editing, often have better structure and flow than typed first drafts.
Making the Switch
If you are convinced by the science but new to voice typing, here is how to start:
- Download Scrybapp on your Mac. It runs Whisper AI locally for private, fast transcription.
- Start with emails and messages — these are short, conversational, and forgiving of imperfection.
- Gradually expand to documents, notes, and longer-form writing.
- Read our accuracy improvement guide and 10 voice typing tips.
The initial adjustment period is typically 3-5 days. After that, most people find voice typing natural and wonder how they managed without it.
Get Started
Download Scrybapp with 3 minutes of free transcription and experience the 4x speed difference yourself. Your brain is built for speech — let it work the way it was designed to.
Related: 10 tips for Mac users, RSI prevention, best Mac dictation apps.