Self-hostable, OpenAI-compatible Whisper speech-to-text API server you can run anywhere with npx. Local inference via whisper.cpp or ONNX (transformers.js).
Local Transformers.js embedder for Kura (bge-m3). Runs ML models in JS with no Python.
FuncX: a super-easy typed AI reasoning-functions library for product code
Persistent local memory for any MCP client. SQLite + FTS5 + sqlite-vec hybrid retrieval, bi-temporal asOf queries, contradiction detection, LLM-free reflection. Multilingual embeddings. Knowledge Graph. No cloud, no API keys.
Semantically create chunks from large texts. Useful for workflows involving large language models (LLMs).
Local transformers.js embedding provider for @vivantel/rag-core
HuggingFace Transformers.js provider for @workglow/ai.
Transformers.js embedding model adapter for Anvia.
The knowledge server for AI agents — index docs, code, Notion, Slack, and Discord into searchable, agent-accessible knowledge via MCP. Supports OpenAI, Ollama, and local transformers.js embeddings.
Complete local model SDK for EASBot - Transformers.js + ONNX implementation supporting both Language Model (text generation) and Embedding (text embedding) with full AI SDK v2 compatibility
LLM toolkit for Patchwork tools: a model picker, a refresh-surviving SharedWorker that runs local (transformers.js) / OpenRouter / Ollama generation, and a streaming API that carries rich telemetry — next-token predictions, temperature, tokens/se
Natural language query parser for the Lemma protocol. Converts natural language into structured attribute queries using on-device LLM inference via Transformers.js .