for authors · 3 min read

research interviews and dictated drafts that stay on your machine.

book projects produce two kinds of audio that don't belong on a third-party server: source interviews you've promised to keep confidential, and manuscript dictation that's still yours and yours alone. on-device transcription handles both without the audio ever leaving your laptop.

when this matters for authors

most transcription tools are built for short, disposable audio: a meeting, a podcast episode, a one-off interview. a book project doesn't look like that. the audio accumulates over years, the material is unpublished by definition, and a publisher contract or a source agreement may already constrain where the file can sit.

the concrete concerns:

two jobs this fits

authors send us audio for two recurring tasks. they look different and the privacy argument lands the same way for both.

workflow

  1. capture the audio. record dictation in the browser with the voice recorder, or upload existing files from a phone, a field recorder, a zoom call with a source.
  2. open audiohighlight, select private mode. transcription runs locally in the browser via WebGPU + Whisper. nothing uploads.
  3. review the transcript. fix speaker labels for "me" and the interviewee once each, and they propagate across the file. click any word to replay the audio at that second — the verification loop you'll want when a quote goes into the book.
  4. export. .docx for scrivener, word, or google docs. .md for obsidian or a manuscript repo. the long-form export also works for chapter drafts: it collapses the speaker-by-speaker structure into clean prose you can edit into a chapter.
  5. close the tab. the audio file stays on your machine; the transcript is wherever you saved it. nothing on our side.

where on-device fits and where it doesn't

what we don't claim

we don't claim verbatim accuracy. for direct quotes that go in the book, listen to the audio with the transcript open and verify the ones that matter. the editor is built for this — click the word, hear the second.

we don't claim the on-device model handles every accent and language equally. english is best. for non-english source interviews, cloud mode is more accurate; private mode degrades on languages outside the on-device model's strong set.

pricing

$0.25 per minute. a 90-minute oral-history interview costs $22.50; an hour of dictation runs $15. private mode and cloud mode are the same price. no subscription, no minimum.

book-length projects accumulate fast — 10+ hours of audio is common, 50+ hours not unusual on biography or oral-history work. at that scale, a flat per-project rate often makes more sense than per-minute. write hello@audiohighlight.com with a rough total runtime and we'll quote a flat rate for the project.

related

lifetime deal while we're in beta.

join the waitlist to get a lifetime deal — your first month free, plus 50% off forever. private invite when we ship; no drip campaign.