- Supported Bit Versions
- 64-bit
- Source Code URL
- https://github.com/XWHQSJ/captionflow
- Minimum OBS Studio Version
- 31.0.0
- Supported Platforms
- Windows
- macOS
Independent third-party plugin for OBS Studio that turns local audio into live captions on your machine. CaptionFlow is not developed by, endorsed by, or affiliated with the OBS Project. GitHub: https://github.com/XWHQSJ/captionflow Latest release: https://github.com/XWHQSJ/captionflow/releases/latest Report issues: https://github.com/XWHQSJ/captionflow/issues License: GPL-2.0-or-later Why this plugin? CaptionFlow keeps speech recognition local during use: your microphone or desktop audio is decoded by sherpa-onnx on your machine, and captions are written to a text file that OBS Studio can read. The first model download contacts the upstream model host; after that, captioning works offline with the cached model. The initial release focuses on low-latency English captions, a bilingual Chinese/English preset, and an optional delay-line mute filter for sensitive words. Features
First use (60 seconds)
- Real-time captions — low-latency partial results while speech is still in progress, written atomically to a text file any Text (GDI+ / FreeType 2) source can read.
- One-click model download — pick English / bilingual / tiny preset in the filter properties; the plugin downloads and extracts on demand, with optional SHA-256 verification.
- Sensitive-word mute — load a hotwords file (
word :boost); the plugin delays output audio so it can retroactively beep out matches. Beep frequency and volume adapt to the speaker's F0 + RMS. - Hardware providers — CPU (default), CUDA (Windows + NVIDIA), DirectML (Windows + any GPU).
- Supply-chain verifiable — every release ships with a Sigstore build provenance attestation, so you can verify the binary came out of public CI.
- Bilingual UI — English and Simplified Chinese (zh-CN) locale included.
- Windows 10 / 11 x64 (OBS Studio 31.0+)
- macOS 11+ universal (Apple Silicon + Intel; OBS Studio 31.0+)
- Windows: extract the ZIP, merge
obs-plugins\anddata\obs-plugins\into%ProgramFiles%\obs-studio\. - macOS: open the .pkg; it installs into the user OBS Studio plugin directory.
Code:
gh attestation verify captionflow-0.2.0-macos-universal.pkg --repo XWHQSJ/captionflow
- Right-click an audio source → Filters → + → CaptionFlow.
- Click Download Model… and pick a preset.
- Set Caption Output File to somewhere like
/tmp/captions.txt. - Add a Text (GDI+) / Text (FreeType 2) source → enable Read from file → point at the same path.
- Speak. Watch captions.
- English (20M, fast) — ~70 MB — default streamers
- Chinese + English (bilingual) — ~300 MB — bilingual content
- English (tiny) — ~40 MB — low-end CPUs
- sherpa-onnx (Next-gen Kaldi team) — streaming ASR runtime
- obs-plugintemplate — build-system scaffolding
- OBS Studio plugin SDK