Real-time voice dictation that revolutionizes your Linux workflow
Transform your voice into text instantly in any application. Built with Python and Vosk, completely open-source and privacy-respecting.
Experience the power of voice dictation directly in your terminal
Your voice becomes text instantly in any active application on your system.
Works out-of-the-box with French and English, with an easily extensible configuration.
Voice commands, global keyboard shortcuts, and manual commands for a tailored experience.
Customize everything via config.yaml: theme, shortcuts, recognition aliases and much more.
Dictate all punctuation marks with natural voice commands in each language.
No data leaves your machine. Based on Vosk, an open-source recognition engine.
The config.yaml file gives you unprecedented control over your experience
Fully customize the interface with your favorite colors and define your own keyboard shortcuts for each action.
Fine-tune voice recognition with custom aliases, adapted voice commands, and optimized audio parameters.
A native experience in your language, with an interface that adapts automatically.
Set up your voice dictation environment in minutes on Debian/Ubuntu.
Install the necessary dependencies for audio and keyboard control:
sudo apt-get update && sudo apt-get install python3-pip python3-venv portaudio19-dev gettext xdotool pulseaudio-utils
Clone the repository and create a virtual environment:
git clone https://github.com/RonanDavalan/vosk-cli-dictation.git
cd vosk-cli-dictation
python3 -m venv venv
source venv/bin/activate
Install the required Python packages:
pip install -r requirements.txt
Download your language models from the Vosk models page and place them in the vosk-model/
directory.
Start dictating with your voice:
python3 src/main.py -l fr
Pro tip: Add a shell function to your ~/.bashrc
for quick access from anywhere!