Fast, lightweight PDF and document parsing with spatial text extraction
Kreuzberg document intelligence - Node.js native bindings
Kreuzberg document intelligence - Node.js native bindings
OCR addon for qvac
An easy way to run AI models in React Native with ExecuTorch
OpenCode-only OCR skills, reviewer personas, and workflow assets
V3 PDF intelligence MCP server with smart Agent Document Twin extraction, focused evidence operations, OCR provenance, trust/accessibility reports, and benchmark-gated release proof.
A React Native Vision Camera plugin for real-time OCR (Optical Character Recognition). This package enables seamless integration of on-device OCR by using Google ML Kit on Android and Apple's Vision Framework on iOS. It provides fast, efficient, and cross
Fast, lightweight PDF parsing with spatial text extraction — WebAssembly build for browsers
PDF to Markdown and DOCX conversion powered by Mistral OCR.
Lightweight, probably the fastest PaddleOCR SDK in TypeScript. Runs anywhere JavaScript runs: Node.js, Bun, Deno, web browsers, and browser extensions. Docker & CLI supported. The official SDK is browser-only. Accurate text detection and recognition for d
Kreuzberg document intelligence - Node.js native bindings
Fast PDF classification and text extraction. Detect text-based vs scanned PDFs, extract text by region with quality checks. Native Rust performance via napi-rs.
Kreuzberg document intelligence - Node.js native bindings
Kreuzberg document intelligence - Node.js native bindings
Gasio Tools 14종 MCP 서버 - 100% 로컬 오프라인 미디어 처리 도구 (배경 제거, 화질 개선, OCR, QR코드, SVG변환, GIF변환 등)
Official Talonic MCP server. Lets AI agents extract structured, schema-validated data from any document via the Model Context Protocol.
Sogni SDK - AI image, video & audio generation plus LLM chat with vision via the Sogni Supernet (Stable Diffusion, Flux, WAN 2.2, LTX-2, Seedance, Qwen VLM)
Consolidated ONNX plugin providing object detection, face detection and recognition, license plate detection with OCR, and CLIP semantic embeddings.
Universal CAPTCHA solver - text, image, audio, reCAPTCHA, hCaptcha, FunCAPTCHA, Turnstile, math, puzzle, slider, and more. No paid services needed. Built-in AI.
OCR image-to-text tool for Pi — extracts text from screenshots, terminal output, and code images using Tesseract + ImageMagick
uniner-cli — RPA automation CLI: browser automation (Playwright/CDP), Excel, OCR, desktop/SAP, crypto/HTTP/email tools. Auto-installs a Claude Code skill on install.
Apple CoreML detection backend for camera.ui. Runs object detection, face detection and recognition, license plate recognition with OCR, and CLIP semantic embeddings on Apple hardware.