Command-line Knowledge Base

OpenAI Whisper

sudo apt install python3 python3-pip python3-venv ffmpeg -y

python3 -m venv whisper_env
source whisper_env/bin/activate

pip install -U openai-whisper

Whisper offers different model sizes (tiny, base, small, medium, large). Larger models provide better accuracy but are slower.

whisper audio.mp3 --model medium

whisper audio.mp3 --language English --output_format txt

deactivate

Provides an easy-to-use desktop interface for Linux, Windows, and macOS that runs on your local machine using Whisper.
It supports importing audio and video files and exporting transcripts in TXT, SRT, or VTT formats.
You can install it via Flathub.

Another local, private option available on Flathub, Speech Note lets you transcribe audio and supports multiple languages.
You will need to download the specific language models within the app after installation.