GitHub - alphacep vosk-api: Offline speech recognition API for Android . . . Vosk is an offline open source speech recognition toolkit It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish More to come Vosk models are small (50 Mb
vosk-api python at master · alphacep vosk-api · GitHub Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification
GitHub - solyarisoftware voskJs: Vosk ASR offline engine API for NodeJs . . . Vosk is an open source embedded (offline on-prem) speech-to-text engine which can run with very low latencies (< 500 msecs on my PC) Vosk is based on a common DNN-HMM architecture Deep neural network is used for sound scoring (acoustic scoring), HMM and WFST frameworks are used for time models (language models) It's based on Kaldi, but Nikolay V Shmyrev's Vosk offers a smart, simplified
Issues · alphacep vosk-api · GitHub Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node - Issues · alphacep vosk-api
alphacep vosk-android-demo - GitHub This demo implements offline speech recognition and speaker identification for mobile applications using Kaldi and Vosk libraries Check the releases for pre-built binaries
Vosk-Browser - GitHub Vosk-Browser A somewhat opinionated speech recognition library for the browser using a WebAssembly build of Vosk This library picks up the work done by Denis Treskunov and packages an updated Vosk WebAssembly build as an easy-to-use browser library Note: WebAssembly builds can target NodeJS, the browser's main thread or web workers
vosk-api python README. md at master - GitHub Vosk supplies speech recognition for chatbots, smart home appliances, virtual assistants It can also create subtitles for movies, transcription for lectures and interviews Vosk scales from small devices like Raspberry Pi or Android smartphone to big clusters