Vosk API

Vosk API provides offline speech recognition for Android, iOS, Raspberry Pi and servers with bindings for Python, Java, C# and Node.

alphacep · Since 2019-09-03

Loading score...

GitHub

Overview

Vosk API is an open-source offline speech recognition project that supports multiple languages and platforms including Android, iOS, Raspberry Pi, and servers. It aims to provide low-latency, privacy-friendly ASR capabilities suitable for network-limited or offline environments.

Key Features

Offline recognition support across mobile and server platforms.
SDKs and bindings for Python, Java, C#, and Node for easy integration.
Low-resource modes optimized for edge and embedded deployments.

Use Cases

Local speech-to-text processing where privacy or connectivity is a concern.
Transcription services for notes, meetings, or voice-controlled applications.
Embedded and edge device deployments requiring efficient ASR.

Technical Highlights

Uses mature speech recognition models with optimized inference pipelines balancing accuracy and performance.
Multi-language support and modular SDK interfaces for cross-platform portability.
Modular architecture facilitating model swaps and custom post-processing.

Core Content

Core Content

Technology

Technology

More

More

AI Infrastructure

AI Infrastructure

Explore

Explore

Connect

Connect

Quick Links

Quick Links

LinkedIn

LinkedIn

Follow on X

Follow on X

Vosk API

Overview

Key Features

Use Cases

Technical Highlights

Score Breakdown

Related Resources

AutoSubs

Axolotl

Cactus