Desktop app + web workspace for dictation and transcription

DictateAnywhere

Use the desktop app to dictate into any application with a global hotkey, then turn raw voice into polished notes, summaries, emails, and documents in VerbScribe. Live meeting intelligence from your files and web helps you answer questions while the conversation is still happening.

Hotkey
Dictation Anywhere
Live
Meeting Answers
RAG
File-Grounded Context
BYOK
Provider Control
Live Recording
Botless Meeting Capture
Bot Meeting Mode
Audio Upload
YouTube Links
Chrome Extension
Desktop App

Dictate Anywhere on Your Desktop

Download VerbScribe Flow for Windows, macOS, or Linux. Use the global hotkey to transcribe your voice directly into any application.

VerbScribe Flow

Version 0.1.12~5 MB
Global Hotkey
Win+Alt+V to start dictating anywhere
Instant Paste
Transcribed text pasted directly into any app
Works Everywhere
System-wide dictation in any application
Windows 10 or later
Available for Windows, macOS, and Linux

Powered by industry-leading AI providers

D
DeepgramSpeech
A
AssemblyAISpeech
G
Google GeminiAI
O
OpenAIAI
A
AnthropicAI
O
OpenRouterAI
X
X.AI GrokVoice
S
StripePayments
D
DeepgramSpeech
A
AssemblyAISpeech
G
Google GeminiAI
O
OpenAIAI
A
AnthropicAI
O
OpenRouterAI
X
X.AI GrokVoice
S
StripePayments
Core Capabilities

Everything You Need.
Focused On The Work.

VerbScribe starts as a fast dictation tool and becomes a meeting sidekick when your files, web research, and follow-up reports matter.

Dictate Anywhere

Press the desktop hotkey, speak naturally, and insert clean text into email, chat, documents, CRM notes, or any focused app.

Meeting Intelligence

Run botless capture from your desktop or invite a visible notetaker. Live insights surface answers, risks, and follow-ups while people are still talking.

Knowledge-Grounded Output

Connect project files, Gemini File Search stores, and web research so answers and reports can cite the context you actually use.

Custom Writing Styles

Turn rough speech into emails, summaries, reports, action items, or your own prompts without changing tools.

Custom Prompt Styles

Turn rough speech into finished work.

Styles are reusable AI prompts. Create your own, trigger them by voice, and let VerbScribe combine your dictation with clipboard context, your selected model, and Gemini File Search when document grounding is enabled.

Voice workflow
One hotkey, one command, one output
RAG optional
1Copied context
Customer email, draft, or selected reference text
2Spoken command
reply to this: confirm availability and ask for the deployment date
3Style prompt
Respond to Email, Executive Brief, Action Items, or your own prompt
4Knowledge
Gemini File Search when RAG is enabled and a store is configured
5Result
A polished reply, summary, brief, report, or file-grounded answer

Reply from the clipboard

reply to this:

Copy an email, speak your intent, and VerbScribe writes the reply in the active text field.

Trigger any style by voice

select style professional email:

Use spoken commands for formal emails, informal notes, summaries, briefs, and custom styles.

Ground output in your files

answer using my documents

When RAG is enabled, Gemini File Search can provide the context behind the generated answer.

Keep model control

chosen in the web app

Use the model you selected in VerbScribe settings, with included quota or your own provider keys.

Example styles users can build
Sales follow-upTechnical support replyCRM noteExecutive briefMeeting summaryProposal paragraphAction itemsKnowledge-grounded answer
Try Desktop Dictation

Four Powerful Modes.
Infinite Possibilities.

Whether you need quick transcription, conversational AI, voice agents, or meeting assistance — VerbScribe adapts to your workflow.

Botless Meeting Capture

Start VerbScribe locally during Teams, Google Meet, Zoom, or a phone call. You remain responsible for participant consent and local recording rules, then the meeting assistant can surface file and web context inside your private workspace.

Bot Meeting Mode

Invite a visible VerbScribe Notetaker when the meeting owner and participants should see a bot in the call. Consent is explicit, transcript insights are stored for the owner, reports are owner-reviewed, and external sharing waits for approval.

Single Mode

One-shot transcription & transformation

Record or paste text, apply an AI style, and get your transformed output. Perfect for quick voice-to-email, voice memos, or content creation.

  • Real-time transcription display
  • Apply custom AI styles
  • Export to TXT, PDF, DOCX
  • Speaker diarization
  • Timestamp navigation
Try Single Mode
VerbScribe — Single Mode
Recording...
00:32

Input From Anywhere

Voice, files, YouTube, or text — VerbScribe handles it all. Transform any audio or text into exactly what you need.

Live Recording

Real-time voice capture with instant transcription. Premium providers for professional accuracy.

Deepgram Nova-3AssemblyAIGemini+1

Audio Upload

Upload MP3, WAV, M4A, and more. Process recordings, podcasts, or voice memos.

MP3WAVM4A+2

YouTube Videos

Paste any YouTube URL. We extract and transcribe the audio automatically.

Video TranscriptionAuto-captionsLectures+1

Text Input

Paste existing text for AI transformation. Apply styles without transcription.

Copy/PasteMarkdownPlain Text

Browser Extension

Transcribe directly into any text field on the web. Gmail, Docs, Slack, anywhere.

ChromeFirefoxEdge

Chrome Extension

Transcribe anywhere on the web. Your styles and context travel with you.

Learn More
AI Transformation

Powered by the World's
Best AI Models

Access 400+ AI models through OpenRouter. Use your own API keys (BYOK) and choose the perfect model for each task.

GPT-4o
Claude 3.5
Gemini Pro
Llama 3.1
Mistral
Qwen
DeepSeek
Command R+
+ 392 more

Including free tier models — no API key required to get started

Document RAG

Context-aware intelligence

Upload your documents and let AI use them as context. Get responses grounded in your actual content — not generic answers.

Document Upload
Upload PDF, DOCX, TXT files. AI reads and uses them as context.
Gemini File Search
Enterprise-grade RAG with Google Cloud infrastructure.
Local Embeddings
Privacy-first option using local Xenova transformers.
PDFDOCXTXTMarkdown

Custom Prompt Styles

Reusable instructions for repeat work

Create unlimited prompt styles for the outputs you write every day. Use them from the web app or trigger matching styles by voice from the desktop app.

Email
ProfessionalCasualReplyFollow-up
Business
Executive BriefCRM NoteProposalSupport Reply
Notes
Meeting NotesAction ItemsSummaryBullet Points
Research
File-Grounded AnswerReportStudy GuideBriefing
Study Mode

Generate interactive quizzes and study materials from any content

BYOK (Bring Your Own Keys) — Use your own API keys for full control, privacy, and cost management.
Start free with our included quota, upgrade anytime with your own keys.

Browser Extension

VerbScribe.
Everywhere.

No other transcription tool does this. Our Chrome extension brings voice transcription with AI styles and document context to every text field on the web. Write emails, messages, documents — all with your voice.

Voice Input Anywhere

Click the floating mic button to start transcribing into any text field.

Apply Your Styles

Your custom styles travel with you. Transform voice into emails, notes, or any format.

Document Context

Access your uploaded documents for context-aware responses anywhere on the web.

Per-Site Memory

The extension remembers your preferred style for each website you use.

mail.google.com/compose
To:team@company.com
Subject:Q4 Project Update

Hi Team,

I wanted to share a quick update on our Q4 project milestones. We've made significant progress...

|

Active Stylegmail.com
Gmail & Email ClientsGoogle Docs & OfficeSlack & TeamsNotion & NotesSocial MediaCRM SystemsSupport TicketsAny Text Field

Export & Study Features

Your transcriptions, your format. Export to any file type or transform content into interactive study materials.

Export Formats

Download in any format

TXT
Plain text
PDF
Document
DOCX
Word
MD
Markdown
HTML
Interactive
JSON
Data
50+ Languages
Input & output
Timestamps
Navigation
Speakers
Diarization

Study Mode

Transform content into learning materials

Interactive Quizzes
Generate multiple-choice quizzes from any content. Export as HTML for standalone use.
Study Notes
AI-generated summaries optimized for learning and retention.
Action Items
Extract tasks, decisions, and follow-ups from meeting transcripts.
Quiz Preview
Q: What is the main benefit of RAG technology?
A) SpeedB) ContextC) CostD) Size
Simple Pricing

Start Free. Scale as You Grow.

Start with the desktop app and core transcription for free. Upgrade only if you need unlimited advanced AI workflows or provider-level control.

Free

Desktop dictation + core workflow

£0forever

Start dictating and transcribing without complex setup.

  • Desktop app with global hotkey
  • Instant paste into any app
  • Browser-based transcription
  • 15 desktop transcriptions/month
  • 5 premium/API transcriptions/month
  • 20 AI transforms/month
  • 10 documents up to 5 MB each
  • TXT and PDF exports
  • Chrome extension
Start Free
Most Popular

Premium

For power users

£9/month

Unlimited usage plus full provider control with your own keys.

  • Everything in Free, plus:
  • Unlimited desktop and API transcription
  • Unlimited AI transforms
  • Unlimited documents
  • Connect Deepgram, OpenRouter, and Gemini
  • Choose your preferred AI models
  • Provider usage billed through your own accounts
  • Best fit for heavy daily usage
Get Premium

Premium+

Managed AI included

Coming Soon

All features with our managed API keys.

  • Everything in Premium
  • Our Deepgram API key
  • Our OpenRouter API key
  • Your Gemini key for private File Search
  • Deepgram Nova-3 (99.8%)
  • X.AI Grok Voice Agent
  • Built-in TTS
  • Priority support
Get Notified

The desktop app works on the free plan. BYOK is an advanced option for power users who want unlimited usage and provider control.

BYOK provider usage is billed through your Deepgram, OpenRouter, or Google accounts.

See example provider costs for common transcription, summary, and RAG workflows.

Premium+ coming soon with managed keys at a fixed monthly rate.

FAQ

Frequently Asked Questions

Yes! VerbScribe offers a generous free tier with browser-based transcription and free AI models via OpenRouter. You can use all core features without paying anything. Premium features like Deepgram, AssemblyAI, and Gemini Voice Agent are available through our Premium plan or by bringing your own API keys (BYOK).

BYOK allows you to connect your own provider accounts for services like Deepgram, OpenRouter, and Gemini. VerbScribe charges for the app, workspace, sync, styles, reports, and orchestration; your provider usage stays under your own provider account so you keep direct control over usage and spend.

Accuracy depends on the speech provider you choose. Our premium providers (Deepgram Nova-3 and AssemblyAI) achieve up to 99.8% accuracy with clear audio. Browser-based transcription uses your device's built-in Web Speech API, which is good for casual use but less accurate in challenging conditions.

RAG (Retrieval-Augmented Generation) allows AI to reference your uploaded documents when generating responses. Upload PDFs, DOCX, or TXT files, and the AI will use that content as context. This means your responses are grounded in your actual documents, not generic knowledge.

Our Chrome extension adds a floating microphone button to any text field on the web. Click it, speak, and your voice is transcribed and optionally processed with your chosen AI style before being inserted. Your styles and document context travel with you to any website.

VerbScribe supports 50+ languages for transcription, including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, Hindi, and many more. You can also transcribe in one language and have the AI output in a different language for translation workflows.

Our Meeting Assistant mode is designed specifically for meetings. It provides real-time transcription with speaker diarization, automatic action items extraction, proactive AI insights, and even live web search to look up topics discussed during the meeting.

You can export transcriptions to TXT, PDF, DOCX, Markdown, and JSON formats. Study mode also lets you export interactive quizzes as standalone HTML files that work without any internet connection.

VerbScribe uses encrypted transport and encrypts stored provider keys at rest. Some workflows, including desktop Flux transcription, route audio through VerbScribe so it can connect securely to the selected speech provider. Read the trust and data-flow guide for the full breakdown.

Through OpenRouter integration, you have access to 400+ AI models including GPT-4o, Claude 3.5, Gemini Pro, Llama 3.1, Mistral, and many more. You can choose the best model for each task, and many models have free tiers available.

Still have questions?

Contact Support

Ready to Dictate
Anywhere?

Download the desktop app for instant dictation in any application, then use VerbScribe on the web to turn transcripts into polished notes, summaries, and exports.

No credit card requiredFree tier foreverWindows, macOS, and LinuxCancel anytime