Porcupine wake word engine for web browsers (via WebAssembly)
Universal, cross-platform text-to-speech SDK with multi-provider support.
A TypeScript library for implementing read aloud features with Web technologies, following best practices for digital publishing.
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
A React component to make transcribing audio and video easier and faster.
Declarative Web Speech components for Web Components. Framework-agnostic SpeechSynthesis (TTS) and SpeechRecognition (STT) primitives via wc-bindable-protocol.
React hook for Cheetah Web SDK
VOICEVOX speaking CLI with HTTP and Docker backends
SpeakEasy - Unified text-to-speech service with provider abstraction
Inworld speech-to-text and text-to-speech provider for @effect-uai/core.
MCP server for SaluteSpeech — speech recognition and synthesis (Russia)
OpenAI speech-to-text and text-to-speech provider for @effect-uai/core.
JavaScript Web API for Text-to-Speech and Speech-to-Text.
Text-to-speech for React Native on iOS and Android. Maintained fork in the spirit of ak1394/react-native-tts.
A React component to make transcribing audio and video easier and faster, forked from @BBC/react-transcript-editor
ElevenLabs speech-to-text, text-to-speech, and music-generation provider for @effect-uai/core.
A high-performance React Native library for text-to-speech on iOS and Android
Private, lightweight macOS ASR CLI — local speech-to-text with a tiny footprint.
UltrasafeAI REST API with comprehensive endpoints for AI services
Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities
React Native library for on-device voice processing with Switchboard SDK