音字 — realtime speech ⇄ text. Node/Bun bindings for the otoji Rust crate (SenseVoice ASR + multilingual polish + TTS).
音字 — realtime speech ⇄ text. Node/Bun bindings for the otoji Rust crate (SenseVoice ASR + multilingual polish + TTS).
myflo — local-first developer workbench. CLI + MCP server + Next.js dashboard. Forked from ruflo for the runtime substrate; flo CLI on top is ours.
Official JavaScript SDK for Transcribe API.
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
MCP server for minutes — conversation memory for AI assistants. Works with Claude Desktop, Mistral Vibe, Cursor, Windsurf, and any MCP client.
Official Speak AI MCP Server — capture meetings, search thousands of recordings, run async voice and video surveys, create clips, and automate workflows from your AI assistant.
n8n node for AssemblyAI speech-to-text transcription models.
Sofia SDK - AI-powered medical assistant web component for healthcare applications
Mirador 4 plugin to render a hidden or visible selectable text overlay
A Model Context Protocol server that gives GitHub Copilot the ability to understand and analyze video content
Voice + text MCP channel between a Claude Code session and the Viber UI ( https://viber.dgypx.dev ). Push transcripts to Claude; send_message tool delivers text back to the UI.
Picovoice Cheetah React Native binding
React hook for Cheetah Web SDK
PI extension for push-to-talk speech-to-text using the ElevenLabs Scribe API
Access Talkie voice memos, dictations, captures, and workflows from the command line
MCP server for Spoken — fetch podcast transcripts as clean Markdown with real speaker names. Built for AI agents.
MCP server for Convo — AI meeting assistant. Access your meetings, transcripts, summaries, and action items from any AI assistant.
Install Talkie for macOS — voice-first productivity suite
n8n community node to download, transcribe, trim, cut, resize, and subtitle videos via VideoSailor
Inworld speech-to-text and text-to-speech provider for @effect-uai/core.
MCP server pour intégrer Gilbert (transcriptions et synthèses de réunions) dans Claude Desktop, Cursor, et tous les clients compatibles Model Context Protocol.
OpenAI speech-to-text and text-to-speech provider for @effect-uai/core.