Vosk API

Vosk API provides offline speech recognition for Android, iOS, Raspberry Pi and servers with bindings for Python, Java, C# and Node.

Author: alphacep

Added Date: 2025-10-11

Open Source Since: 2019-09-03

GitHub

Overview

Vosk API is an open-source offline speech recognition project that supports multiple languages and platforms including Android, iOS, Raspberry Pi, and servers. It aims to provide low-latency, privacy-friendly ASR capabilities suitable for network-limited or offline environments.

Key Features

Offline recognition support across mobile and server platforms.
SDKs and bindings for Python, Java, C#, and Node for easy integration.
Low-resource modes optimized for edge and embedded deployments.

Use Cases

Local speech-to-text processing where privacy or connectivity is a concern.
Transcription services for notes, meetings, or voice-controlled applications.
Embedded and edge device deployments requiring efficient ASR.

Technical Highlights

Uses mature speech recognition models with optimized inference pipelines balancing accuracy and performance.
Multi-language support and modular SDK interfaces for cross-platform portability.
Modular architecture facilitating model swaps and custom post-processing.

Vosk API

Overview

Key Features

Use Cases

Technical Highlights

Resource Info

Related Resources

Nano-vLLM

DeepSeek-OCR

LeRobot