Canvas for Node.js with skia backend
A robust, strictly-typed Node.js and Browser library for parsing office files (.docx, .pptx, .xlsx, .odt, .odp, .ods, .pdf, .rtf, .csv, .md, .html) and generating high-fidelity outputs in Markdown, HTML, CSV, RTF, and RAG-focused chunks.
SpreadJS PDF export plugin
Fast, lightweight PDF and document parsing with spatial text extraction
View and annotate PDF files in your web app. Full support for mobile and desktop. Runs in the browser using WASM.
The most comprehensive Angular PDF viewer, powered by Mozilla PDF.js 6 — view, annotate, sign, fill forms, search, and read aloud from one component. 8.3M+ downloads, mobile-first, production-ready.
Feature-rich JavaScript PDF library with built-in support for loading and manipulating PDF document.
This repository provides advanced support for data extraction from PDF documents
Utility to work with Docker version of LibreOffice in Lambda
Kreuzberg document intelligence - Node.js native bindings
Kreuzberg document intelligence - Node.js native bindings
Universal document viewer for the web — open-source, framework-agnostic viewer powered by a built-from-scratch WebAssembly engine for high-fidelity rendering across PDF, DOCX, PPTX, XLSX, CSV, SVG, and images.
V3 PDF intelligence MCP server with smart Agent Document Twin extraction, focused evidence operations, OCR provenance, trust/accessibility reports, and benchmark-gated release proof.
Fast, lightweight PDF parsing with spatial text extraction — WebAssembly build for browsers
A web SDK for word processing and rich text capabilities.
PDF to Markdown and DOCX conversion powered by Mistral OCR.
Export an Avodado Document to HTML or PDF.
Kreuzberg document intelligence - Node.js native bindings
Media Viewer
Fast PDF classification and text extraction. Detect text-based vs scanned PDFs, extract text by region with quality checks. Native Rust performance via napi-rs.