Ultra-lightweight Korean morphological analyzer for the web (1MB model, WASM, F1 93.7% NIKL MP)
Single-pass recursive rich-text DSL parser without regex, with pluggable tag handlers. Markdown alternative.
The shared PromptWise engine: heuristic prompt rewriting, context-flood detection, global memory, and persona inference. Zero dependencies; runs in Node and the browser.
POS Tagger and lemmatizer for javascript
Node.js bindings for Lindera morphological analysis engine
Korean tokenizer for MiniSearch, powered by garu-ko (1MB browser-native Korean morphological analyzer)
Korean tokenizer for Orama search, powered by garu-ko (1MB browser-native Korean morphological analyzer)
Node.js bindings for Lindera morphological analysis engine
Node.js bindings for Lindera morphological analysis engine
Wix Restaurants credit-cards tokenizer
lexer for recursive descent parsers
An expression tokenizer, parser and evaluator.
Opt-in RFC 8259 number/string content validation layer over qb-json-next
SQL parser, transpiler, optimizer, and engine for JavaScript/TypeScript - 33+ dialects (Postgres, MySQL, BigQuery, Snowflake, DuckDB, etc). Port of Python SQLGlot.
Small library that provides functions to tokenize a string into an array of words with or without punctuation
Vietnamese word segmentation in JS/TS — an unofficial in-process WebAssembly port of underthesea's word_tokenize. Exact parity, no Python.