Now with Gemini Voice Agent & RAG 2.0

Voice toIntelligence

The only transcription tool with a brain. Real-time speech-to-text with live web search, document RAG, and 400+ AI models. Free forever with BYOK.

400+
AI Models
50+
Languages
99.8%
Accuracy
< 1s
Latency
Live Recording
Audio Upload
YouTube Links
AI Styles
Chrome Extension

Powered by industry-leading AI providers

D
DeepgramSpeech
A
AssemblyAISpeech
G
Google GeminiAI
O
OpenAIAI
A
AnthropicAI
O
OpenRouterAI
E
ElevenLabsTTS
S
StripePayments
D
DeepgramSpeech
A
AssemblyAISpeech
G
Google GeminiAI
O
OpenAIAI
A
AnthropicAI
O
OpenRouterAI
E
ElevenLabsTTS
S
StripePayments
Core Capabilities

Everything You Need.
Nothing You Don't.

Enterprise-grade transcription meets AI transformation. Built for professionals who demand accuracy, speed, and flexibility.

Real-Time Transcription

Crystal-clear voice-to-text with Deepgram Nova-3 or AssemblyAI. Sub-second latency with 99.8% accuracy.

400+ AI Models

Access GPT-4, Claude, Gemini, Llama, and 400+ models via OpenRouter. Transform text into any format.

Document RAG

Upload PDFs, DOCX, TXT files. AI uses your documents as context for smarter, personalized responses.

Custom Styles

Create unlimited writing styles. Transform voice into emails, blog posts, meeting notes, or any format.

50+ Languages

Transcribe in one language, output in another. Native fluency with automatic language detection.

Chrome Extension

Transcribe directly into any text field on the web. Your styles and context travel with you.

Four Powerful Modes.
Infinite Possibilities.

Whether you need quick transcription, conversational AI, voice agents, or meeting assistance — VerbScribe adapts to your workflow.

Single Mode

One-shot transcription & transformation

Record or paste text, apply an AI style, and get your transformed output. Perfect for quick voice-to-email, voice memos, or content creation.

  • Real-time transcription display
  • Apply custom AI styles
  • Export to TXT, PDF, DOCX
  • Speaker diarization
  • Timestamp navigation
Try Single Mode
VerbScribe — Single Mode
Recording...
00:32

Input From Anywhere

Voice, files, YouTube, or text — VerbScribe handles it all. Transform any audio or text into exactly what you need.

Live Recording

Real-time voice capture with instant transcription. Premium providers for professional accuracy.

Deepgram Nova-3AssemblyAIGemini+1

Audio Upload

Upload MP3, WAV, M4A, and more. Process recordings, podcasts, or voice memos.

MP3WAVM4A+2

YouTube Videos

Paste any YouTube URL. We extract and transcribe the audio automatically.

Video TranscriptionAuto-captionsLectures+1

Text Input

Paste existing text for AI transformation. Apply styles without transcription.

Copy/PasteMarkdownPlain Text

Browser Extension

Transcribe directly into any text field on the web. Gmail, Docs, Slack, anywhere.

ChromeFirefoxEdge

Chrome Extension

Transcribe anywhere on the web. Your styles and context travel with you.

Learn More
AI Transformation

Powered by the World's
Best AI Models

Access 400+ AI models through OpenRouter. Use your own API keys (BYOK) and choose the perfect model for each task.

GPT-4o
Claude 3.5
Gemini Pro
Llama 3.1
Mistral
Qwen
DeepSeek
Command R+
+ 392 more

Including free tier models — no API key required to get started

Document RAG

Context-aware intelligence

Upload your documents and let AI use them as context. Get responses grounded in your actual content — not generic answers.

Document Upload
Upload PDF, DOCX, TXT files. AI reads and uses them as context.
Gemini File Search
Enterprise-grade RAG with Google Cloud infrastructure.
Local Embeddings
Privacy-first option using local Xenova transformers.
PDFDOCXTXTMarkdown

Custom Styles

Transform text into any format

Create unlimited custom styles with your own prompts. Voice-to-email, meeting notes, blog posts, or any format you need.

Email
ProfessionalCasualFollow-upSales
Content
Blog PostSocial MediaNewsletterPress Release
Notes
Meeting NotesAction ItemsSummaryBullet Points
Study
FlashcardsQuizStudy GuideLecture Notes
Study Mode

Generate interactive quizzes and study materials from any content

BYOK (Bring Your Own Keys) — Use your own API keys for full control, privacy, and cost management.
Start free with our included quota, upgrade anytime with your own keys.

Browser Extension

VerbScribe.
Everywhere.

No other transcription tool does this. Our Chrome extension brings voice transcription with AI styles and document context to every text field on the web. Write emails, messages, documents — all with your voice.

Voice Input Anywhere

Click the floating mic button to start transcribing into any text field.

Apply Your Styles

Your custom styles travel with you. Transform voice into emails, notes, or any format.

Document Context

Access your uploaded documents for context-aware responses anywhere on the web.

Per-Site Memory

The extension remembers your preferred style for each website you use.

mail.google.com/compose
To:team@company.com
Subject:Q4 Project Update

Hi Team,

I wanted to share a quick update on our Q4 project milestones. We've made significant progress...

|

Active Stylegmail.com
Gmail & Email ClientsGoogle Docs & OfficeSlack & TeamsNotion & NotesSocial MediaCRM SystemsSupport TicketsAny Text Field

Export & Study Features

Your transcriptions, your format. Export to any file type or transform content into interactive study materials.

Export Formats

Download in any format

TXT
Plain text
PDF
Document
DOCX
Word
MD
Markdown
HTML
Interactive
JSON
Data
50+ Languages
Input & output
Timestamps
Navigation
Speakers
Diarization

Study Mode

Transform content into learning materials

Interactive Quizzes
Generate multiple-choice quizzes from any content. Export as HTML for standalone use.
Study Notes
AI-generated summaries optimized for learning and retention.
Action Items
Extract tasks, decisions, and follow-ups from meeting transcripts.
Quiz Preview
Q: What is the main benefit of RAG technology?
A) SpeedB) ContextC) CostD) Size
Simple Pricing

Start Free. Scale as You Grow.

No credit card required. Use free AI models and browser transcription, or unlock premium features anytime.

Free

Try before you buy

£0forever

Limited features to explore VerbScribe.

  • Browser-based transcription
  • 2 min/day API transcription
  • 20 AI transforms/month
  • 3 documents with RAG
  • 10 RAG queries/week
  • Export to all formats
  • Chrome extension
Start Free
Most Popular

Premium

Bring Your Own Keys

£9/month

Unlimited usage with your own API keys.

  • Everything in Free, plus:
  • Unlimited API transcription
  • Unlimited AI transforms
  • Unlimited documents
  • Your own Deepgram key
  • Your own OpenRouter key
  • Your own Gemini key
  • Full cost control
Get Premium

Premium+

We handle the keys

Coming Soon

All features with our managed API keys.

  • Everything in Premium
  • Our Deepgram API key
  • Our OpenRouter API key
  • Your Gemini key only
  • Deepgram Nova-3 (99.8%)
  • Gemini Live Voice Agent
  • ElevenLabs TTS
  • Priority support
Get Notified

Premium users bring their own API keys and only pay for their actual API usage.

Premium+ coming soon with managed keys at a fixed monthly rate.

FAQ

Frequently Asked Questions

Yes! VerbScribe offers a generous free tier with browser-based transcription and free AI models via OpenRouter. You can use all core features without paying anything. Premium features like Deepgram, AssemblyAI, and Gemini Voice Agent are available through our Premium plan or by bringing your own API keys (BYOK).

BYOK allows you to use your own API keys for services like Deepgram, OpenRouter, and Gemini. This gives you full control over your usage and costs, with no monthly fees from us. You only pay for what you use directly to the providers. This is perfect for power users and enterprises who want unlimited usage.

Accuracy depends on the speech provider you choose. Our premium providers (Deepgram Nova-3 and AssemblyAI) achieve up to 99.8% accuracy with clear audio. Browser-based transcription uses your device's built-in Web Speech API, which is good for casual use but less accurate in challenging conditions.

RAG (Retrieval-Augmented Generation) allows AI to reference your uploaded documents when generating responses. Upload PDFs, DOCX, or TXT files, and the AI will use that content as context. This means your responses are grounded in your actual documents, not generic knowledge.

Our Chrome extension adds a floating microphone button to any text field on the web. Click it, speak, and your voice is transcribed and optionally processed with your chosen AI style before being inserted. Your styles and document context travel with you to any website.

VerbScribe supports 50+ languages for transcription, including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, Hindi, and many more. You can also transcribe in one language and have the AI output in a different language for translation workflows.

Our Meeting Assistant mode is designed specifically for meetings. It provides real-time transcription with speaker diarization, automatic action items extraction, proactive AI insights, and even live web search to look up topics discussed during the meeting.

You can export transcriptions to TXT, PDF, DOCX, Markdown, and JSON formats. Study mode also lets you export interactive quizzes as standalone HTML files that work without any internet connection.

Yes. Your transcriptions and documents are stored securely. With BYOK, your audio is sent directly to the provider you choose (Deepgram, AssemblyAI, etc.) without going through our servers for transcription. We use encryption in transit and at rest.

Through OpenRouter integration, you have access to 400+ AI models including GPT-4o, Claude 3.5, Gemini Pro, Llama 3.1, Mistral, and many more. You can choose the best model for each task, and many models have free tiers available.

Still have questions?

Contact Support

Ready to Transform
Your Voice?

Join thousands of users who have already discovered the power of AI-enhanced transcription. Start free, no credit card required.

No credit card requiredFree tier foreverBYOK supportedCancel anytime