Vosk CLI Dictation

Real-time voice dictation that revolutionizes your Linux workflow

Transform your voice into text instantly in any application. Built with Python and Vosk, completely open-source and privacy-respecting.

Get Started Now View on GitHub

v0.0.4-alpha Current Version

2 Languages

100% Offline & Private

Key Features

Real-time transcription

Your voice becomes text instantly in any active application on your system.

Multi-language support

Works out-of-the-box with French and English, with an easily extensible configuration.

Total control

Voice commands, global keyboard shortcuts, and manual commands for a tailored experience.

Advanced configuration

Customize everything via config.yaml: theme, shortcuts, recognition aliases and much more.

Complete punctuation

Dictate all punctuation marks with natural voice commands in each language.

100% offline

No data leaves your machine. Based on Vosk, an open-source recognition engine.

Total Configuration

The config.yaml file gives you unprecedented control over your experience

Snippet of the config.yaml file showing hotkeys and theme color settings.

Theme and shortcuts

Fully customize the interface with your favorite colors and define your own keyboard shortcuts for each action.

Customizable colors for each message type
Configurable global keyboard shortcuts
Personalize start & stop voice commands

Snippet of the config.yaml file showing English language settings.

Advanced settings

Fine-tune voice recognition with custom aliases, adapted voice commands, and optimized audio parameters.

Alias system to correct frequent errors
Customizable voice commands per language
Extensible punctuation dictionary

Getting Started

Set up your voice dictation environment in minutes on Debian/Ubuntu.

System Dependencies

Install the necessary dependencies for audio and keyboard control:

                            sudo apt-get update && sudo apt-get install python3-pip python3-venv portaudio19-dev gettext xdotool pulseaudio-utils
                        

Clone and Setup

Clone the repository and create a virtual environment:

                            git clone https://github.com/RonanDavalan/vosk-cli-dictation.git

                            cd vosk-cli-dictation

                            python3 -m venv venv

                            source venv/bin/activate

Install Dependencies

Install the required Python packages:

pip install -r requirements.txt

Download your language models from the Vosk models page and place them in the vosk-model/ directory.

Launch

Start dictating with your voice:

python3 src/main.py -l fr

Pro tip: Add a shell function to your ~/.bashrc for quick access from anywhere!

Ready to transform your workflow?

Start Your Journey