Keyword: benchmark

@codspeed/core
Released
5d ago
Version
5.7.1
The core Node library used to integrate with Codspeed runners
codspeed benchmark performance
@codspeed/vitest-plugin
Released
5d ago
Version
5.7.1
vitest plugin for CodSpeed
codspeed benchmark vitest performance
@codspeed/tinybench-plugin
Released
5d ago
Version
5.7.1
tinybench compatibility layer for CodSpeed
codspeed benchmark tinybench performance
@nestia/benchmark
Released
4d ago
Version
11.3.4
NestJS Performance Benchmark Program
e2e nestia nestjs Performance benchmark
@camstack/addon-benchmark
Released
14h ago
Version
1.1.5
Detection addon benchmarking with reference images, accuracy metrics, and distributed execution
camstack addon camstack-addon benchmark detection accuracy
@shopgate/pwa-benchmark
Released
5d ago
Version
7.31.0
Benchmark suite for PWA
shopgate pwa benchmark redux
@remnic/bench
Released
2d ago
Version
9.3.643
Retrieval latency ladder benchmarks + CI regression gates for @remnic/core
remnic benchmark retrieval evaluation ai-memory
free-coding-models
Released
yesterday
Version
0.5.39
Find the fastest coding LLM models in seconds — ping free models from multiple providers, pick the best one for OpenCode, Cursor, or any AI coding assistant.
nvidia nim llm cli ai models +13
oh-my-knowledge
Released
5d ago
Version
0.45.0
Evaluation framework for LLM knowledge inputs — prompts, RAG corpora, skills, agent workflows. Fix the model, vary the artifact. Built-in statistical rigor: bootstrap CI, Krippendorff α, length-debias, saturation curves.
llm-evaluation evaluation-framework prompt-evaluation prompt-testing prompt-regression-testing rag-evaluation +14
benchforge
Released
1 week ago
Version
0.2.12
GC aware benchmarking/profiling, with an interactive viewer. For Node and browser.
benchmark performance profiling bootstrap gc v8
swarmkit-eval
Released
5d ago
Version
0.0.7
Evaluation infrastructure for the swarmkit ecosystem — (harness x model x task x arm x seed) agent evals with ground-truth scoring, cost-matched Pareto reporting, and scalable parallel execution.
eval benchmark agent swarmkit
@grest-ts/schema-benchmark
Released
yesterday
Version
0.0.68
Validation library benchmark suite
typescript framework contract api microservices testing +3
belt-charts
Released
3d ago
Version
1.3.0
A CLI tool for generating charts from verbose data from belt
cli charts belt benchmark visualization metrics
@n8n/n8n-benchmark
Released
5d ago
Version
2.11.0
Cli for running benchmark tests for n8n
automate automation IaaS iPaaS n8n workflow +2
@general-liquidity/sharpebench-mcp
Released
3d ago
Version
0.0.6
MCP server exposing the SharpeBench luck-robust scoring kernel as agent-callable tools (deflated Sharpe, pass^k, process discipline, briefing audit, options Greeks).
mcp model-context-protocol trading sharpebench benchmark ai-agents
@real-router/memory-plugin
Released
3m ago
Version
0.4.12
In-memory history engine for Real-Router — non-browser environments and benchmarks
real-router memory history in-memory react-native benchmark
@castorini/piika
Released
5d ago
Version
0.3.0
A reusable, reproducible pi search-agent workspace
agent benchmark bm25 pi search
ts-timeframe
Released
5d ago
Version
1.0.0
Benchmark framework to collect code time metrics and excel in precision.
benchmark backend typescript
@ykstormsorg/quickdraw
Released
5d ago
Version
1.0.3
Benchmark LLM streaming — TTFT, TPS, p50/p95/p99, cost ceiling, and regression diffing across OpenAI and Anthropic.
llm benchmark streaming ttft tps openai +3
openmodelmap
Released
6d ago
Version
1.0.2
OpenModelMap CLI — discover Chinese open-source AI models. Query OMS scores, benchmarks, hardware requirements, and deploy commands from your terminal.
ai llm open-source chinese-ai-models model-discovery benchmark +6
tryaii
Released
12h ago
Version
0.4.0
AI model router for Node.js and TypeScript with benchmark, cost, and speed-based ranking
ai model-routing embeddings llm benchmark
@hlido/cli
Released
6d ago
Version
0.2.0
Hlido CLI — independent, evidence-backed scorecards for AI agents. Inline scorecard, search, compare, and tier rankings, fetched live from hlido.eu.
ai agent agents evaluation benchmark scorecard +3
as-bench
Released
5d ago
Version
0.1.0
Runtime-agnostic and statistically-aware benchmarking framework for AssemblyScript
assemblyscript benchmark benchmarking criterion statistics
@lzwme/feps-webpack-plugin
Released
4 years ago
Version
1.3.0
This plug-in is used for function execution performance statistics. It calculates the execution time by injecting statistical code and finds slow functions.
webpack plugin function time-consuming performance statistics +1
tinybench
Released
1 month ago
Version
6.0.2
🔎 A simple, tiny and lightweight benchmarking library!
benchmark tinylibs tiny