Use Cases15 min read

How to Write a Book Using Voice Dictation

A practical guide to writing novels, non-fiction books, and long-form manuscripts using speech-to-text dictation, covering workflow, tools, productivity tips, and common pitfalls.

Scrybapp

Scrybapp Team

The Case for Dictating Your Book

The idea of dictating a book is not new. Winston Churchill dictated many of his works. Barbara Cartland, one of the most prolific novelists in history, dictated all of her 723 novels. More recently, bestselling authors like Kevin J. Anderson, Monica Leonelle, and Dean Wesley Smith have championed dictation as a tool for increasing writing speed without sacrificing quality.

The appeal is simple mathematics. Most people type at 30 to 60 words per minute. Most people speak at 130 to 170 words per minute. If you can dictate prose at even half your natural speaking speed — accounting for pauses, restarts, and thinking time — you are still producing words at roughly twice your typing speed. For a book-length project, this difference is transformative.

Consider a 80,000-word novel. At a typing speed of 50 words per minute with a net output (accounting for editing-as-you-go) of perhaps 1,000 words per hour, that novel requires 80 hours of pure writing time. At a dictation speed of 2,500 to 3,000 words per hour (a commonly reported rate for experienced dictators), the same novel requires 27 to 32 hours. That is 50 hours saved — more than a full work week.

But speed is only part of the story. Many authors who dictate report that the quality of their prose changes — not worse, but different. Dictated prose tends to have a more natural rhythm, more varied sentence structure, and a stronger narrative voice. This makes sense: when you speak your story, you are essentially telling it, and human beings have been telling stories orally for millennia before writing was invented.

Getting Started: The Setup

Choosing Your Dictation Software

For book-length dictation on a Mac, you need a tool that is accurate, fast, and — critically — reliable for extended sessions. Scrybapp is built for exactly this use case. It uses OpenAI's Whisper AI model running locally on your Mac, which means there is no internet dependency (you can dictate anywhere, including on an airplane or in a cabin in the woods), no cloud privacy concerns (your unpublished manuscript stays on your device), and no subscription fees eating into your writing budget.

The system-wide compatibility means Scrybapp works in whatever writing application you prefer: Scrivener, Ulysses, Google Docs, Microsoft Word, iA Writer, or any other tool. There is no need to dictate into one application and then transfer the text elsewhere.

Hardware for Book Dictation

Your microphone choice matters for extended dictation sessions:

  • USB desk microphone — Best for home office dictation. Models like the Rode NT-USB Mini or Blue Yeti Nano provide excellent audio quality for dictation without the bulk of a studio microphone.
  • Headset microphone — Ideal if you pace while dictating (many authors do). A wireless headset gives you freedom of movement while maintaining consistent microphone distance.
  • Lapel microphone — Good for dictating while walking. Some authors dictate during walks, either outdoors or on a treadmill, using a lapel mic connected to their Mac or iPhone.
  • AirPods or earbuds with microphone — Convenient and portable. Audio quality is not as high as a dedicated microphone, but it is sufficient for dictation in most environments.

The key principle is consistency. Whatever microphone you choose, use it consistently so that you and the speech recognition model settle into a reliable pattern.

The Dictation Writing Process

Phase 1: Outlining (Typed or Dictated)

Most authors who dictate successfully do some form of outlining first. The level of detail varies enormously — from a brief one-page synopsis to a detailed scene-by-scene breakdown — but having a roadmap prevents the most common problem in dictation: losing your place.

When you type, the words on the screen serve as a constant reminder of where you are in the narrative. When you dictate with your eyes closed or while pacing, that visual anchor is absent. An outline — whether on a nearby screen, printed on paper, or memorized — provides the structural guidance that keeps the narrative on track.

Some authors outline by typing (it is a more analytical task), while others dictate their outlines as well. Either approach works. The important thing is to have a plan before you start dictating prose.

Phase 2: First Draft Dictation

This is where the magic happens. Here are the principles that successful book dictators follow:

  • Do not edit while dictating. This is the most important rule. When you see a transcription error or a clumsy phrase in the text, resist the urge to stop and fix it. Keep dictating forward. Editing and creating are separate cognitive processes, and mixing them destroys the flow state that makes dictation so productive.
  • Speak in complete sentences. Rather than dictating fragmentary phrases and assembling them, speak in full sentences. This produces more natural prose and fewer transcription errors.
  • Describe what you see. For fiction writers, a useful technique is to visualize the scene and describe what you see, hear, and feel. This produces vivid, sensory prose that reads well.
  • Use character voices. When dictating dialogue, speak in the character's voice — their rhythm, word choice, and emotional tone. This produces dialogue that sounds authentic rather than flat.
  • Accept imperfection. Your dictated first draft will be rougher than a typed first draft. It will have transcription errors, awkward transitions, and places where you lost your train of thought. This is normal and expected. The revision process will clean it up, and the time saved in creation more than compensates for the additional editing.

Phase 3: Revision and Editing

After dictating a chapter or section, the revision process is different from revising a typed draft. Here is what to expect:

Transcription errors are the most obvious difference. Words that sound similar may be confused (their/there/they're, complement/compliment, etc.), and occasional words may be completely wrong. These are easy to spot and fix during a read-through.

Homophone issues are common. Dictation software cannot always distinguish between words that sound identical, so you will need to check for correct usage of their/there/they're, its/it's, your/you're, and similar pairs.

Punctuation may need attention. While modern speech-to-text engines handle punctuation reasonably well — inserting periods, commas, and question marks based on speech patterns — you may need to adjust punctuation during editing.

Prose rhythm will be different from your typed writing. This is not necessarily a problem — many authors find their dictated prose has a more natural flow — but you may want to adjust the rhythm during editing to match your established voice.

For revision, many authors use Microsoft Word's Track Changes or Scrivener's revision mode to mark and refine their dictated text. The combination of dictated first draft plus careful typed revision produces excellent results.

Fiction vs. Non-Fiction Dictation

Fiction

Fiction dictation works best for narrative prose: scene descriptions, action sequences, dialogue, and internal monologue. These are the most “verbal” elements of fiction — the parts that benefit most from the natural rhythm of speech.

Some elements of fiction writing are harder to dictate: complex plotting, timeline management, and continuity tracking. These analytical tasks are often better handled by typing and visual tools (outlines, timelines, character sheets). The most productive fiction authors use dictation for the generative work (creating prose) and typing for the structural work (planning and organizing).

Genre fiction authors — romance, thriller, fantasy, science fiction — tend to be the most enthusiastic adopters of dictation because their genres reward consistent, prolific output. An author who can dictate a novel draft in three to four weeks rather than eight to ten has a significant competitive advantage in fast-moving genre markets.

Non-Fiction

Non-fiction dictation is in some ways easier and in some ways harder than fiction. It is easier because non-fiction is inherently explanatory — you are explaining something you already know, which maps naturally to speech. It is harder because non-fiction often requires precision, citations, and technical terminology that demand careful attention during revision.

The most effective non-fiction dictation workflow involves thorough research and outlining before any dictation begins. Once you know what you want to say in each section, dictating the explanation is fast and natural. Think of it as giving a lecture or explaining the topic to a friend — that is essentially what non-fiction dictation is.

For non-fiction books that include data, statistics, or quoted material, a hybrid approach works well: dictate the narrative and explanatory content, then type in the specific numbers, quotes, and citations during the editing phase.

Building Your Dictation Speed

If you have never dictated extended prose before, expect a learning curve. Most authors report that their first dictation sessions feel awkward and slow — slower than typing, in some cases. This is normal. You are building a new skill.

Week 1-2: Getting Comfortable

Start with short sessions of 15 to 20 minutes. Focus on getting comfortable with the mechanics: activating dictation, speaking at a natural pace, and resisting the urge to edit. Do not worry about speed or quality. The goal is simply to build the habit of producing words by voice.

Week 3-4: Building Speed

Extend your sessions to 30 to 45 minutes. By now, the mechanical aspects should feel more natural, and you can focus on flow. Try dictating with your eyes closed or while looking away from the screen — this prevents the visual distraction of watching text appear and encourages a more natural speaking rhythm.

Month 2+: Peak Productivity

After a month of regular practice, most authors report dictation speeds of 2,000 to 4,000 words per hour of net output (accounting for pauses and restarts). Some authors report sustained speeds of 5,000 or more words per hour once they are fully comfortable with the process. At these speeds, a full novel draft can be completed in 20 to 40 hours of dictation time.

Common Challenges and Solutions

The Inner Critic

The biggest obstacle to dictation is not technical — it is psychological. Many writers have a powerful inner critic that edits as they create. When typing, this manifests as backspacing, deleting, and rewriting. When dictating, the inner critic has nowhere to go, and the resulting discomfort can be paralyzing.

The solution is to give yourself explicit permission to produce bad first drafts. Remind yourself that every word you dictate can be revised later. The goal of dictation is to get words on the page, not to get perfect words on the page.

Self-Consciousness

Speaking your book aloud can feel strange, especially if you live with other people or work in a shared space. Solutions include: dictating in a private room, using a closet or car as a dictation booth, dictating during walks, or dictating quietly (modern speech-to-text engines work well even at conversational volume).

Dialogue Attribution

A practical challenge in fiction dictation is handling dialogue tags and attribution. Many authors develop shorthand conventions: saying “new paragraph” before dialogue, using character names before their lines (“Sarah said...”), or dictating dialogue and narration in separate passes.

Maintaining Continuity

When dictating a novel, it is easy to lose track of character details, timeline, and plot threads. Keep a reference document (character sheet, timeline, plot outline) visible while dictating. Some authors use a second monitor or an iPad positioned nearby for this purpose.

Tools and Integrations for Book Authors

Scrybapp integrates with all the major writing tools used by book authors:

  • Scrivener — The most popular book-writing software. Dictate directly into Scrivener documents and use its organizational features to manage your manuscript.
  • Ulysses — A popular Mac-native writing app. Dictation works seamlessly in the clean, distraction-free interface.
  • Microsoft Word — Still the standard for manuscript submission. See our Word dictation guide for specific tips.
  • Google Docs — Useful for collaborative writing and sharing with editors. See our Google Docs dictation guide.
  • iA Writer, Bear, Obsidian — Markdown-focused writing apps that work well with dictation for authors who prefer plain text.

Because Scrybapp operates at the system level, it works in any text input field on your Mac. If you can type in it, you can dictate in it. Explore our full integration list for more details.

The Economics of Book Dictation

For professional authors, the economics are compelling. Consider an author who writes two books per year, each 80,000 words. By switching from typing to dictation, they save approximately 100 hours per book, or 200 hours annually. That time can be reinvested in writing additional books (increasing annual output from 2 to 3 or even 4 books), in marketing and promotion, or in simply maintaining a healthier work-life balance.

The cost of entry is minimal. Scrybapp is a one-time purchase of 39 euros — less than the price of most writing craft books. A decent microphone costs $30 to $80. The total investment is under $100, with a potential return of hundreds of hours per year.

Compare this to hiring a transcriptionist (hundreds to thousands of dollars per project) or paying for a subscription-based dictation service (a recurring monthly cost that adds up over the years of a writing career). Scrybapp's one-time pricing model is particularly attractive for authors, who tend to have irregular income and benefit from predictable, one-time expenses.

Success Stories and Productivity Data

The author community has accumulated significant data on dictation productivity. Commonly reported statistics include:

  • Average typing speed for authors: 800 to 1,500 words per hour of net output
  • Average dictation speed after learning period: 2,000 to 4,000 words per hour of net output
  • Time to complete a 80,000-word first draft by typing: 53 to 100 hours
  • Time to complete a 80,000-word first draft by dictation: 20 to 40 hours
  • Additional editing time for dictated drafts vs. typed drafts: 10 to 20% more (offset by the much faster creation time)

Authors who have successfully transitioned to dictation consistently report that the total time from concept to finished manuscript is shorter with dictation, even accounting for the additional editing time.

Getting Started with Your Book

If you have been thinking about writing a book — or if you are mid-book and struggling with the pace — voice dictation may be the tool that gets you across the finish line. The technology is mature, the learning curve is manageable, and the productivity gains are real.

Start small. Download Scrybapp and use the free 3-minute trial to dictate a scene, a chapter opening, or an outline. Get comfortable with the mechanics. Then commit to dictating for 30 minutes a day for two weeks. By the end of that period, you will have a clear sense of whether dictation works for your writing process — and you will likely have several thousand more words of your manuscript than you would have produced by typing alone.

For more on building an efficient writing workflow, see our guides for writers, our post on voice-based note taking, and our comparison of the best speech-to-text tools for Mac.

Try Scrybapp Free

Experience the fastest, most private speech-to-text on macOS. 3 minutes free, no sign-up required.

Download for macOS