npm.io
3.29.31 • Published 8h agoCLI

aidevops

Licence
MIT
Version
3.29.31
Deps
4
Size
834 kB
Vulns
0
Weekly
13.4K
Install scriptsThis package runs scripts during installation (preinstall/install/postinstall)

AI DevOps Framework

aidevops.sh is an OpenCode plugin and AI DevOps framework for people who want AI to do useful work across code, infrastructure, business, marketing, content, and creative projects without turning every job into another long, fragile chat.

Most AI tools still leave you doing the hard coordination yourself: finding the right context, choosing a model, protecting secrets, managing branches, watching CI, spotting stuck work, and remembering what went wrong last time. aidevops puts structure around that work so agents can share context, work safely in parallel, spend model budget where it matters, and leave the system better than they found it.

Recommended setup: OpenCode + OpenAI models. GPT-5.5 is the preferred high-capability model for complex agent work; GPT-5.4 mini is the preferred fast, lower-cost model for triage and routine implementation. Claude models (Anthropic) remain fully supported, and other model providers are evaluated from time to time as their quality, latency, and cost profiles change.

"Scope a mission to redesign the landing pages — break it into milestones, dispatch workers in parallel, validate each milestone, and track budget across the whole project."

One conversation, autonomous project delivery, with security, teamwork, token efficiency, and quality control built in.

Founded by Marcus Quinn on 9th November 2025 to help anyone level-up their AI & Open-Source game.

The Aim

Maximum value for your time and money. aidevops is built for the gap between “the model can probably do this” and “the work is actually done, verified, safe, and worth the cost.”

  • Load the right context when it is needed, instead of stuffing every agent, skill, and tool into the prompt.
  • Spend tokens and model budget deliberately. Cheap and fast models should handle routine work; stronger models should handle judgement, architecture, review, and risk.
  • Keep secrets out of chat. Credentials, tenants, scans, confirmations, and audit logs are part of the workflow, not an afterthought.
  • Let people and agents work across machines without trampling each other. Worktrees, branches, PRs, task IDs, mailbox state, and memory keep the work separated and traceable.
  • Notice when the system is struggling. Stuck workers, orphaned PRs, stale assignments, CI failures, review-bot traps, and repeated mistakes should become visible signals.
  • Improve the framework from real use. Imported skills, session learnings, quality findings, and better patterns should become better agents, hooks, scripts, and docs.
  • Optimise for profitable outcomes: useful work shipped, lower supervision cost, safer operations, and decisions that make sense beyond the next prompt.

The result: an AI operations platform that manages projects across every business domain — absorbing everything automatable so you can focus on what matters.

Built on proven patterns: aidevops implements industry-standard agent design patterns - including multi-layer action spaces, context isolation, and iterative execution loops.

Why This Framework?

Beyond single-task AI. A normal AI harness can be brilliant for one job and still weak at the work around the job. aidevops is for the surrounding discipline: context, routing, safety, git hygiene, collaboration, verification, memory, and follow-through.

What makes it different:

  • Pulse supervision - scheduled checks can dispatch workers, merge ready PRs, close loops, and surface stuck work.
  • Domain agents - code, automation, product, business, marketing, legal, content, research, SEO, health, reports, and framework work each get their own guidance.
  • Cross-model checks - risky operations can be reviewed by a second provider to reduce shared failure modes.
  • Service coverage - hosting, Git platforms, DNS, security, monitoring, deployment, payments, communications, and more are handled through repeatable helpers.
  • Mission work - larger goals can be split into milestones with validation, budget tracking, and automatic advancement.

GitHub Actions Quality Gate Status CodeFactor Maintainability Codacy Badge CodeRabbit

License: MIT Copyright

Version npm version Homebrew GitHub repository

Services Supported AGENTS.md AI Optimized MCP Servers API Integrations

Quick Reference

  • Purpose: AI-assisted DevOps automation framework
  • Install: npm install -g aidevops && aidevops update
  • Recommended runtime/models: OpenCode + OpenAI GPT-5.5 / GPT-5.4 mini
  • Entry: aidevops CLI, ~/.aidevops/agents/AGENTS.md
  • Stack: Bash scripts, TypeScript (Bun), MCP servers
  • Recent focus: OpenCode control-plane safety, mobile simulator testing, self-hosted runner operations, and pulse/worker diagnostics
Key Commands
  • aidevops init - Initialize in any project
  • aidevops update - Update framework
  • aidevops auto-update - Automatic update polling (enable/disable/status)
  • aidevops secret - Manage secrets (gopass encrypted, AI-safe)
  • aidevops security - Full security assessment (posture, secrets, supply chain)
  • /onboarding - Interactive setup wizard (in AI assistant)
  • /design-artifact - Route artifact-first UI, deck, email, poster, and mobile mockup work
  • /open-design - Manage the optional Open Design companion studio
  • /auto-browse - Learn, optimize, and graduate repeatable browser operations and web data-mining workflows
  • /report-render - Render report-ready Markdown or JSON to HTML with sticky TOC, print CSS, evidence badges, and source cards for PDF export
  • /report-token-use - Generate a local per-session token, model, compaction, and MCP-use report
  • /pulse - Run the autonomous supervisor loop for dispatch, merge, diagnostics, and stuck-work recovery
  • /serve-sim / serve-sim-helper.sh - Exercise mobile web flows in simulator-backed local previews
Agent Structure
  • 12 primary agents (Build+, Automate, Product, SEO, Marketing-Sales, etc.) with specialist @subagents on demand
  • 2,200+ agent and subagent markdown files organized by domain
  • 1,800+ helper scripts in .agents/scripts/
  • 185+ slash commands and workflow guides for common operations
What You Can Ask aidevops To Do
  • Build, fix, review, release, and maintain software with worktrees, PRs, tests, and quality gates.
  • Run infrastructure, hosting, DNS, monitoring, security, and deployment workflows.
  • Plan products, PRDs, onboarding, monetisation, growth, analytics, and UI direction.
  • Operate business, finance, receipts, invoices, marketing, outreach, SEO, content, video, and personal-productivity routines.
  • Discover the right capability with /skills recommend "TASK", /onboarding, or the OpenCode agent picker.
Recent framework upgrades

Since the last README feature refresh, aidevops has added or expanded:

  • OpenCode GUI/control-plane planning: ADRs, threat model, trust-boundary guidance, and containment rules for a future GUI that stays local-first, auditable, and explicitly separated from secret-bearing helpers (docs/gui/).
  • Mobile and simulator workflows: app-development guidance, App Store Connect support, Expo/Xcode/Swift workflows, Maestro/minisim/iOS Simulator MCP references, and serve-sim mobile web testing support (.agents/tools/mobile/).
  • Self-hosted runner operations: lifecycle and storage runbooks, Docker foreground-mode guidance, systemd timer freshness triage, and ExecStop race-guard documentation (.agents/reference/github-self-hosted-runners.md).
  • Pulse diagnostics and reliability: compact API-budget diagnostics, pulse cadence/API diagnostics, GitHub read ramp pacing, configurable worker floors, Renovate dashboard skipping, and more defensive duplicate/blocked-by/rate-limit handling.
  • Worker and PR observability: worker diagnostic failure families, runtime observability signals, review-thread response scanning, required-check validation, orphan-recovery base handling, and safer automated GitHub write guards.
  • OpenCode runtime polish: versioned session title suffixes, session archive retention, OAuth pool hardening, debug-error preservation, and reusable shell-env version lookup in the OpenCode plugin.

Enterprise-Grade Quality & Security

Comprehensive DevOps framework with tried & tested services integrations, popular and trusted MCP servers, and enterprise-grade infrastructure quality assurance code monitoring and recommendations.

Vault security model: aidevops defines protected data classes, provider routing labels, trust boundaries, and phased encrypted sync architecture in .agents/reference/vault.md. Vault guidance is local-first: third-party AI providers can only reason over data that has been decrypted into their prompt or tool context, so provider-side logs/retention remain outside Vault's technical control. Future local LLM mode reduces provider exposure but not local host compromise risk.

Vault passphrase warning: aidevops cannot recover a lost Vault passphrase. Save it in a trusted password manager with backups; never paste it into AI chat, CLI arguments, environment variables, logs, issue comments, or test fixtures.

Report creation, previews, and PDF exports

Use aidevops to turn evidence bundles into decision-ready reports while keeping Markdown or JSON as the canonical source. Report agents can produce AI-search audits, SEO/GEO scorecards, delivery reviews, campaign reports, board packs, incident summaries, recurring client handoffs, and before/after remediation evidence.

New report capabilities include:

  • Markdown-first report anatomy with cover pages, executive summaries, evidence ledgers, source cards, action prompts, appendix links, charts, Mermaid/LaTeX fallbacks, and verified, partial, inferred, or missing evidence badges.
  • DESIGN.md-backed visual templates plus basic no-CSS output for lightweight handoff.
  • Browser preview HTML with sticky contents, source-card links, copy buttons, and light/dark theme variants where a style supports them.
  • PDF-ready profiles for A4, US Letter, and 16:9 slides. Generated PDF links use *-a4.pdf, *-usletter.pdf, and *-slides.pdf names.
  • Versioned examples under _reports/examples/; open _reports/examples/index.html locally to browse the example reports, rendered styles, and PDF exports.

Create a report:

  1. Load reports/general.md for structure, then the matching domain report doc such as reports/seo-geo.md, reports/development.md, reports/marketing.md, or reports/business.md.
  2. Gather source evidence first. Use deterministic run: steps or service helpers for collection, then ask the domain agent plus agent:Reports to interpret and prioritise.
  3. Save canonical source as report.md or report.json in _reports/drafts/<report-name>/ while working, or in _reports/examples/<example-name>/ only after privacy review.
  4. Render with /report-render report.md or .agents/scripts/report-render-helper.sh render report.md --template <style> --theme auto --pdf-profile a4 --output report.html.
  5. Export PDFs from Chrome/Chromium using the generated HTML and the A4, US Letter, or slides profiles. Regenerate derived HTML/PDF files instead of hand-editing them.

Create a repeatable report agent:

  1. Read reports/routine-handoff.md and tools/build-agent/build-agent.md.
  2. Define the report cadence, evidence collection commands, source IDs, privacy rules, target template/style, and verification gates.
  3. Put deterministic collection in run: steps and reserve agent:Reports for narrative, evidence interpretation, recommendations, and handoff tasks.
  4. Store reusable agent instructions in the appropriate agent tier (custom/ for local/client-specific agents; shared .agents/ only for broadly reusable framework agents).

Security Notice

This framework provides agentic AI assistants with powerful infrastructure access. Use responsibly.

Capabilities: Execute commands, access credentials, modify infrastructure, interact with APIs Your responsibility: Use trusted AI providers, rotate credentials regularly, monitor activity

Security Commands
aidevops security              # Run ALL checks (posture + hygiene + supply chain)
aidevops security posture      # Interactive security posture setup (gopass, gh auth, SSH)
aidevops security status       # Combined posture + hygiene summary
aidevops security scan         # Secret hygiene & supply chain scan only
aidevops security scan-pth     # Python .pth file audit (supply chain attack vector)
aidevops security scan-secrets # Plaintext credential locations only
aidevops security scan-deps    # Unpinned dependency check
aidevops security check        # Per-repo security posture assessment
aidevops security dismiss <id> # Dismiss a security advisory after taking action

Running aidevops security with no arguments is the single command that covers everything — user security posture, plaintext secret detection, supply chain IoC scanning, and active advisories.

Security advisories are delivered via aidevops update and shown in the session greeting until dismissed. The scanner never exposes secret values — only file locations and key names. All remediation commands should be run in a separate terminal, not inside AI chat sessions.

Supply chain hardening: All Python dependencies are pinned to exact versions (==) to prevent malicious package upgrades. The .pth file auditor detects known supply chain attack indicators (e.g., the LiteLLM March 2026 PyPI compromise).

Quick Start

Installation Options

npm (recommended - verified provenance):

npm install -g aidevops && aidevops update

Note: npm suppresses postinstall output. The && aidevops update deploys agents to ~/.aidevops/agents/. The CLI will remind you if agents need updating.

Bun (fast alternative):

bun install -g aidevops && aidevops update

Homebrew (macOS/Linux):

brew install marcusquinn/tap/aidevops && aidevops update

Direct from source (aidevops.sh):

bash <(curl -fsSL https://aidevops.sh/install)

Manual (git clone):

git clone https://github.com/marcusquinn/aidevops.git ~/Git/aidevops
~/Git/aidevops/setup.sh

That's it! The setup script will:

  • Clone/update the repo to ~/Git/aidevops
  • Deploy agents to ~/.aidevops/agents/
  • Install the aidevops CLI command
  • Configure your AI assistants automatically
  • Offer to install Oh My Zsh (optional, opt-in) for enhanced shell experience
  • Install recommended token-efficiency tooling by default, including RTK for compact git/gh/test/lint command summaries before output reaches AI context
  • Guide you through recommended tools (Tabby, Zed, Git CLIs)
  • Ensure all PATH and alias changes work in both bash, zsh, and fish
  • When Claude Code is installed, add a claude alias that runs claude --dangerously-skip-permissions (skips per-tool permission prompts). Re-running setup updates the alias automatically. To grant permissions per-session instead, press Shift-Tab inside Claude Code to cycle through permission modes (default → skip permissions → auto-approve).

New users: Start OpenCode and type /onboarding to configure your services interactively. OpenCode is the recommended tool for aidevops; pair it with OpenAI GPT-5.5 and GPT-5.4 mini for the best current results across agent tiers. The onboarding wizard will:

  • Explain what aidevops can do
  • Ask about your work to give personalized recommendations
  • Show which services are configured vs need setup
  • Guide you through setting up each service with links and commands

After installation, use the CLI:

aidevops status           # Check what's installed
aidevops doctor           # Detect duplicate installs and PATH conflicts
aidevops update           # Update framework + check registered projects
aidevops auto-update      # Manage automatic update polling (every 10 min)
aidevops init             # Initialize aidevops in any project
aidevops features         # List available features
aidevops repos            # List/add/remove registered projects
aidevops design           # DESIGN.md detection + brand guideline HTML/PDF exports
aidevops detect           # Scan for unregistered aidevops projects
aidevops upgrade-planning # Upgrade TODO.md/PLANS.md to latest templates
aidevops update-tools     # Check and update installed tools
aidevops uninstall        # Remove aidevops
Optional Design Artifact Studio

aidevops now treats design as a self-contained stack with optional peripherals:

  • Google DESIGN.md standard: AI-readable design systems with YAML tokens, linting, previews, and brand/style libraries (.agents/tools/design/design-md.md).
  • Init-aware design files: aidevops init records has_interface and seeds root DESIGN.md for standard repos plus minimal-scope repos with detected GUI/interface markers.
  • Brand guideline exports: aidevops design guidelines . --pdf generates _reports/brand-guidelines/brand-guidelines.md, HTML, and A4/US Letter/slides PDFs from DESIGN.md.
  • Repo rollout: aidevops design survey --json audits owned initialized GUI repos; aidevops design issues --apply files worker-ready auto-dispatch issues for missing DESIGN.md/brand-guideline artifacts.
  • Design agents and skills: brand identity, palettes, UI inspiration, product UI rules, shadcn/Tailwind/UI skills, Nothing-style design, email rendering, Remotion/video, and browser-based UI verification.
  • Artifact routing commands: /design-artifact decides whether to use aidevops-native implementation or a companion artifact studio; /open-design manages optional Open Design workflows.
  • Verification gates: Playwright screenshots, accessibility/contrast checks, email rendering, deck export/fidelity checks, and media smoke tests before generated artifacts are accepted.

Optional companion: Open Design by nexu-io (Apache-2.0) is supported as a peripheral for live sandboxed previews, design-skill pickers, .od/ artifact workspaces, and HTML/PDF/PPTX/ZIP-style exports. aidevops remains canonical for agents, skill ingestion, Google DESIGN.md, local hosting, and verification.

# Inspect optional companion status
open-design-helper.sh status

# Print safe install plan only
open-design-helper.sh install

# Install alongside aidevops only after opting in
open-design-helper.sh install --execute

# Start through aidevops local HTTPS if Open Design only prints localhost
open-design-helper.sh start --https-local open-design
# → https://open-design.local when localdev is configured

Imported Open Design skills are not copied verbatim. They are evaluated through aidevops build-agent methodology, deduplicated against existing agents, flattened into aidevops *-skill.md structure, attributed to upstream, and given verification commands. See .agents/tools/design/open-design-ingestion.md for the full skill-value matrix.

Project tracking: When you run aidevops init, the project is automatically registered in ~/.config/aidevops/repos.json. Running aidevops update checks all registered projects for version updates.

Use aidevops in Any Project

Initialize aidevops features in any git repository:

cd ~/your-project
aidevops init                         # Enable all features
aidevops init planning                # Enable only planning
aidevops init planning,time-tracking  # Enable specific features

This creates:

  • .aidevops.json - Configuration with enabled features
  • .agents symlink → ~/.aidevops/agents/
  • TODO.md - Quick task tracking with time estimates
  • todo/PLANS.md - Complex execution plans
  • .beads/ - Task graph database (if beads enabled)

Available features: planning, git-workflow, code-quality, time-tracking, beads

Per-repo platform setup

After aidevops init registers a new repo, run /setup-git in your AI assistant to apply per-repo platform secrets. Most notably, this sets SYNC_PAT — a GitHub Actions secret that lets issue-sync.yml push TODO.md auto-completion past branch protection.

This is distinct from /onboarding (per-account credentials like gh auth login): GitHub Actions secrets are scoped per-repo, so each repo needs its own. You need gh auth login to succeed before any per-repo helper can run, so /onboarding comes first, /setup-git second.

Run /setup-git again whenever you register a new repo with aidevops repos add or when a SYNC_PAT advisory appears in the session greeting toast. If you skip this step, issue-sync.yml will post a remediation comment when it hits branch protection — /setup-git walks through the fix.

Upgrade Planning Files

When aidevops templates evolve, upgrade existing projects to the latest format:

aidevops upgrade-planning           # Interactive upgrade with backup
aidevops upgrade-planning --dry-run # Preview changes without modifying
aidevops upgrade-planning --force   # Skip confirmation prompt

This preserves your existing tasks while adding TOON-enhanced parsing, dependency tracking, and better structure.

Automatic detection: aidevops update now scans all registered projects for outdated planning templates (comparing TOON meta version numbers) and offers to upgrade them in-place with backups.

Task Graph Visualization with Beads

Beads provides task dependency tracking and graph visualization:

aidevops init beads              # Enable beads (includes planning)

Task Dependencies:

- [ ] t001 First task
- [ ] t002 Second task blocked-by:t001
- [ ] t001.1 Subtask of t001
Syntax Meaning
blocked-by:t001 Task waits for t001 to complete
blocks:t002 This task blocks t002
t001.1 Subtask of t001 (hierarchical)

Commands:

Command Purpose
/ready Show tasks with no open blockers
/list-verify List verification queue (pending, passed, failed)
/sync-beads Sync TODO.md/PLANS.md with Beads graph
bd list List all tasks in Beads
bd ready Show ready tasks (Beads CLI)
bd graph <id> Show dependency graph for an issue

Architecture: aidevops markdown files (TODO.md, PLANS.md) are the source of truth. Beads syncs from them for visualization.

Optional Viewers: Beyond the bd CLI, there are community viewers for richer visualization:

  • beads_viewer (Python TUI) - PageRank, critical path analysis
  • beads-ui (Web) - Live updates in browser
  • bdui (React/Ink TUI) - Modern terminal UI
  • perles (Rust TUI) - BQL query language

See .agents/tools/task-management/beads.md for complete documentation and installation commands.

Your AI assistant now has agentic access to 30+ service integrations.

OpenCode with OpenAI is the current recommended aidevops setup. Use GPT-5.5 for complex reasoning, architecture, security-sensitive review, and hard agent tiers; use GPT-5.4 mini for fast triage, routine implementation, retries, and lower-cost worker throughput.

Authenticate via the pool:

aidevops model-accounts-pool add openai
# Restart OpenCode after adding

Why this is the default:

  • Best current cross-tier results — strongest observed balance across interactive Build+, workers, review, and dispatch tiers
  • Good cost/latency split — GPT-5.5 for depth, GPT-5.4 mini for high-volume routine work
  • Provider isolation — OpenAI accounts rotate independently from Anthropic, Google, Cursor, and local providers
  • Fallback-friendly — Claude, Gemini, Cursor, and local models remain available when a task or rate-limit profile calls for them
OpenCode Anthropic OAuth (Supported)

OpenCode includes Anthropic OAuth authentication natively — no API key needed. OAuth is covered by your Claude Pro/Max subscription at zero additional cost.

Authenticate via the pool (recommended):

aidevops model-accounts-pool add anthropic
# Opens browser OAuth flow — no API key required
# Restart OpenCode after adding

Or via the OpenCode TUI:

Open OpenCode → Ctrl+A → Select AnthropicLogin with Claude.ai → follow browser OAuth flow.

Note: opencode auth login prompts for an API key, not OAuth. Use the commands above for subscription-based OAuth access.

Benefits:

  • Still fully supported for users who prefer Claude models or already have Claude Pro/Max
  • Zero marginal cost for Claude Pro/Max subscribers (covered by subscription)
  • Automatic token refresh — no manual re-authentication needed
  • Multiple accounts — add more accounts to the pool for automatic rotation when one hits rate limits
  • Beta features enabled — extended thinking modes and latest features
Cursor Models via Pool Proxy

Access Cursor Pro models (Composer 2, Claude 4.6 Opus/Sonnet, GPT-5.x, Gemini 3.1 Pro) in OpenCode through a local gRPC proxy that translates OpenAI-compatible requests to Cursor's protobuf/HTTP2 protocol.

Setup:

# Add your Cursor account to the pool (reads from local Cursor IDE)
oauth-pool-helper.sh add cursor

# Restart OpenCode — Cursor models appear in Ctrl+T model picker

How it works:

  • Reads Cursor credentials from the local IDE state database
  • Starts a gRPC proxy that speaks Cursor's native protocol (not the cursor-agent CLI)
  • Discovers available models via gRPC and registers them as an OpenCode provider
  • Supports true streaming, tool calling, and automatic token refresh
  • Falls back gracefully if no Cursor accounts are in the pool

Benefits:

  • Zero additional cost for Cursor Pro subscribers
  • True streaming — responses stream as they arrive (not buffered)
  • Tool calling — Cursor's native MCP tool protocol works through the proxy
  • Model discovery — automatically detects all models available to your account
  • Pool rotation — multiple accounts with LRU rotation and 429 failover
Google AI Pool (Gemini CLI / Vertex AI)

Use your Google AI Pro, AI Ultra, or Workspace subscription for Gemini models. Tokens are injected as ADC bearer tokens that Gemini CLI, Vertex AI SDK, and the Gemini API pick up automatically.

Setup:

# Add your Google account to the pool (browser OAuth flow)
aidevops model-accounts-pool add google

# Restart OpenCode — token is injected as GOOGLE_OAUTH_ACCESS_TOKEN

Supported plans:

  • Google AI Pro (~$25/mo) — daily Gemini CLI limits
  • Google AI Ultra (~$65/mo) — higher daily limits
  • Google Workspace with Gemini add-on — enterprise daily limits

Isolation guarantee: Google auth failures never affect Anthropic/OpenAI/Cursor providers. A Google 429 or auth error only puts the Google pool into cooldown.

GitHub AI Agent Integration

Enable AI-powered issue resolution directly from GitHub. Comment /oc fix this on any issue and the AI creates a branch, implements the fix, and opens a PR.

Security-first design - The workflow includes:

  • Trusted users only (OWNER/MEMBER/COLLABORATOR)
  • ai-approved label required on issues before AI processing
  • Prompt injection pattern detection
  • Audit logging of all invocations
  • 15-minute timeout and rate limiting

Quick setup:

# 1. Install the OpenCode GitHub App
# Visit: https://github.com/apps/opencode-agent

# 2. Add API key secret for your chosen provider
# Repository → Settings → Secrets → OPENAI_API_KEY or ANTHROPIC_API_KEY

# 3. Create required labels
gh label create "ai-approved" --color "0E8A16" --description "Issue approved for AI agent"
gh label create "security-review" --color "D93F0B" --description "Requires security review"

The secure workflow is included at .github/workflows/opencode-agent.yml.

Usage:

Context Command Result
Issue (with ai-approved label) /oc fix this Creates branch + PR
Issue /oc explain this AI analyzes and replies
PR /oc review this PR Code review feedback
PR Files tab /oc add error handling here Line-specific fix

See .agents/tools/git/opencode-github-security.md for the full security documentation.

Supported AI tool: OpenCode is the recommended and tested AI coding tool for aidevops. All features, agents, and workflows are designed and tested for OpenCode first. We recommend OpenAI models for the best current results across all agent tiers: GPT-5.4 mini for fast triage/routine work and GPT-5.5 for complex implementation, review, and reasoning. Claude models (Anthropic) remain fully supported, and other providers are tested as their capabilities change.

Recommended stack:

  • OpenCode - The recommended AI coding agent. Powerful agentic TUI/CLI with native MCP support, Tab-based agent switching, LSP integration, plugin ecosystem, and excellent DX. All aidevops features are designed and tested for OpenCode first.
  • OpenCode Zen - Free tier of OpenCode with included models. Start working with AI straight away at no cost -- no API keys or subscriptions required.
  • OpenAI GPT-5.5 / GPT-5.4 mini - Recommended model pair for aidevops today. Use GPT-5.5 for complex reasoning and high-impact agent tiers; use GPT-5.4 mini for triage, routine implementation, and cost-efficient parallel workers.
  • Claude (Anthropic) - Fully supported alternative provider. Claude models remain useful for fallback, cross-provider verification, and users with Claude Pro/Max OAuth access.
  • Tabby - Recommended terminal. Colour-coded Profiles per project/repo, auto-syncs tab title with git repo/branch.
  • Zed - Recommended editor. High-performance with AI integration (use with the OpenCode Agent Extension).
Troubleshooting Auth

If you see "Anthropic Key Missing", "OpenAI Key Missing", or the model stops responding, run these commands from any terminal — no working model session required.

Step 1 — Check pool health

aidevops model-accounts-pool status       # counts: available / rate-limited / auth-error
aidevops model-accounts-pool check        # live token validity test per account

Step 2 — Fix based on what you see

Symptom Command
OpenAI account shows rate-limited aidevops model-accounts-pool rotate openai
Anthropic account shows rate-limited aidevops model-accounts-pool rotate anthropic
All accounts in cooldown aidevops model-accounts-pool reset-cooldowns
OpenAI account shows auth-error aidevops model-accounts-pool add openai (re-auth)
Anthropic account shows auth-error aidevops model-accounts-pool add anthropic (re-auth)
Pool is empty (no accounts) aidevops model-accounts-pool add openai
Recently re-authenticated, still broken aidevops model-accounts-pool assign-pending openai
Google Gemini CLI rate-limited aidevops model-accounts-pool rotate google
Google token expired aidevops model-accounts-pool add google (re-auth)

Step 3 — If still broken, re-add the account

aidevops model-accounts-pool add openai        # ChatGPT Plus/Pro
aidevops model-accounts-pool add anthropic     # Claude Pro/Max — opens browser OAuth
aidevops model-accounts-pool add cursor        # Cursor Pro (reads from local IDE)
aidevops model-accounts-pool add google        # Google AI Pro/Ultra/Workspace — browser OAuth
aidevops model-accounts-pool import claude-cli # Import from existing Claude CLI auth

Restart OpenCode after any add, rotate, or reset-cooldowns to pick up the new credentials.

Full command reference

aidevops model-accounts-pool status            # Pool health at a glance
aidevops model-accounts-pool list              # Per-account detail + expiry
aidevops model-accounts-pool check             # Live API validity test
aidevops model-accounts-pool rotate [provider] # Switch to next available account NOW
aidevops model-accounts-pool reset-cooldowns   # Clear all rate-limit cooldowns
aidevops model-accounts-pool assign-pending <p># Assign stranded pending token
aidevops model-accounts-pool remove <p> <email># Remove an account

Note: reset-cooldowns clears cooldowns in the pool file. If OpenCode is already running, the in-memory token endpoint cooldown is only cleared when OpenCode restarts or when you use the /model-accounts-pool reset-cooldowns slash command inside an active session.

If you prefer guided help: Open OpenCode with a free model (OpenCode Zen includes free models that don't require any API key or subscription) and run the auth troubleshooting agent by typing:

@auth-troubleshooting

The agent contains the full recovery flow and symptom table. Free models work fine for this — no paid subscription needed.

Terminal Tab Title Sync

Your terminal tab/window title automatically shows repo/branch context when working in git repositories. This helps identify which codebase and branch you're working on across multiple terminal sessions.

Supported terminals: Tabby, cmux, iTerm2, Kitty, Alacritty, WezTerm, Hyper, and most xterm-compatible terminals.

How it works: The pre-edit-check.sh script's primary role is enforcing git workflow protection (blocking edits on main/master branches). As a secondary, non-blocking action, it updates the terminal title via escape sequences. No configuration needed - it's automatic.

Example format: {repo}/{branch-type}/{description}

See .agents/tools/terminal/terminal-title.md for customization options.

Companion tool:

  • claude-code CLI - Called from within OpenCode for sub-tasks and headless dispatch

Collaborator compatibility: Projects initialized with aidevops init include pointer files (.cursorrules, .windsurfrules, etc.) that reference AGENTS.md, helping collaborators using other editors find project context. aidevops does not install into or configure those tools.

Repo courtesy files: aidevops init scaffolds standard repo files if they don't exist: DESIGN.md for GUI/interface repos, README.md, LICENCE (MIT), CHANGELOG.md, CONTRIBUTING.md, SECURITY.md, CODE_OF_CONDUCT.md. Author name and email are auto-detected from git config. Existing files are never overwritten.

Core Capabilities

AI-First Infrastructure Management:

  • SSH server access, remote command execution, API integrations
  • DNS management, application deployment, email monitoring
  • Git platform management, domain purchasing, setup automation
  • WordPress management, credential security, code auditing

Autonomous Orchestration:

  • Pulse supervisor - Autonomous AI supervisor runs every 2 minutes via launchd — merges ready PRs, dispatches workers, kills stuck processes, detects orphaned PRs, syncs TODO state with GitHub, triages quality findings, and advances missions. No human in the loop
  • Missions - Multi-day autonomous projects: /mission scopes a high-level goal into milestones and features. The pulse dispatches workers, validates milestones, tracks budget, and advances through the project automatically (mission-dashboard-helper.sh)
  • Multi-model verification - Destructive operations (force push, production deploy, data migration) are verified by a second AI model from a different provider before execution. Different providers have different failure modes, so correlated hallucinations are rare
  • Supervisor - SQLite state machine dispatches tasks to parallel AI agents with retry cycles, batch management, and cron scheduling
  • Runners - Named headless agent instances with persistent identity, instructions, and memory namespaces
  • /runners command - Batch dispatch from task IDs, PR URLs, or descriptions with concurrency control and progress monitoring
  • Self-hosted runner operations - GitHub runner lifecycle, storage, Docker foreground mode, systemd timers, and cleanup race guidance for reliable local/hosted worker capacity
  • Mailbox - SQLite-backed inter-agent messaging for coordination across parallel sessions
  • Worktree isolation - Each agent works on its own branch in a separate directory, no merge conflicts
  • Budget tracking - Append-only cost log (budget-tracker-helper.sh) with burn-rate analysis and /budget-analysis command for model routing decisions
  • Observability - LLM request capture plugin (observability.mjs) for cost tracking, performance analysis, and debugging
  • Rate limits and API budget diagnostics - Per-provider rate-limit configuration, secondary cooldown capture, reset-aware pacing, and compact API budget summaries for pulse and worker decisions

Project Intelligence:

  • Bundles - Project-type presets that auto-configure model tiers, quality gates, and agent routing per repo. 7 built-in bundles (web-app, library, cli-tool, content-site, infrastructure, agent, schema) with auto-detection from marker files (bundle-helper.sh)
  • TTSR rules - Soft rule engine (ttsr-rule-loader.sh) with .agents/rules/ directory for AI output correction (e.g., no-edit-on-main, no-glob-for-discovery)
  • Cross-review - /cross-review dispatches the same prompt to multiple AI models in parallel, diffs results, and optionally auto-scores via a judge model
  • Local models - Run AI models locally via llama.cpp for free, private, offline inference (local-model-helper.sh) with HuggingFace GGUF model management
  • Tech stack lookup - /tech-stack detects technology stacks of URLs or finds sites using specific technologies (Wappalyzer, httpx, nuclei, BuiltWith)
  • IP reputation - ip-reputation-helper.sh checks IP addresses against multiple reputation databases (Spamhaus, ProxyCheck, AbuseIPDB) before VPS purchase or deployment
  • Mobile app guidance - Expo, Swift/Xcode, App Store Connect, simulator automation, push/onboarding/monetisation/testing, and mobile web previews through serve-sim
  • GUI control plane planning - Local-first product scope, stack and repo-layout ADRs, GUI trust boundaries, and threat model for future aidevops UI surfaces

Conversational Memory & Entity System:

  • Entity memory - Cross-channel relationship continuity (entity-helper.sh): people, agents, and services tracked across Matrix, SimpleX, email, and CLI with versioned profiles
  • Conversational memory - Per-conversation context management (conversation-helper.sh): idle detection, immutable summaries, tone profile extraction
  • Three-layer architecture - Layer 0 (immutable raw log), Layer 1 (tactical summaries), Layer 2 (strategic entity profiles) in shared SQLite

Communications:

  • SimpleX bot - Channel-agnostic gateway with SimpleX Chat as first adapter for AI agent dispatch (simplex-bot/)
  • Matterbridge - Multi-platform chat bridge connecting 20+ platforms including Matrix, Discord, Telegram, Slack, IRC, WhatsApp, XMPP (matterbridge-helper.sh)
  • X API via xurl - Official X/Twitter API operations through guarded xurl workflows for search, timelines, bookmarks, posting, replies, DMs, media, and raw API reads. Supports multiple X developer apps/subscription tiers with --app and multiple authenticated accounts with --username; model-provider auth such as OpenCode xAI/Grok remains separate from X API OAuth (content/social-xurl.md, xurl-helper.sh)
  • Localdev - Local development environment manager with dnsmasq, Traefik, mkcert for production-like .local domains with HTTPS (localdev-helper.sh)

MCP Toolkit:

  • MCPorter - Discover, call, compose, and generate CLIs/typed clients for MCP servers (mcporter npm package)
  • OpenAPI Search - Search and explore any OpenAPI specification via MCP (zero install, Cloudflare Worker)
  • Cloudflare Code Mode - Full Cloudflare API (2,500+ endpoints) via 2 tools in ~1,000 tokens

Unified Interface:

  • Standardized commands across all providers
  • Automated SSH configuration and multi-account support for all services
  • Security-first design with comprehensive logging, code quality reviews, and continual feedback-based improvement

Quality Control & Monitoring:

  • Multi-Platform Analysis: SonarCloud, CodeFactor, Codacy, CodeRabbit, Qlty, Gemini Code Assist, Snyk
  • Review gate preferences: choose whether true review-bot rate limits block merges (aidevops review-gate owner/repo wait) or allow merge with follow-up quality coverage (aidevops review-gate owner/repo pass, the default). Per-tool overrides are supported, for example aidevops review-gate owner/repo --tool coderabbitai wait. Failed, skipped, or placeholder bot states are not treated as rate limits and continue to block until a real review/status appears or a human resolves them.
  • Performance Auditing: PageSpeed Insights, Lighthouse, WebPageTest, Core Web Vitals (/performance command)
  • SEO Toolchain: 40+ SEO subagents including Semrush, Ahrefs, ContentKing, Screaming Frog, Bing Webmaster Tools, Rich Results Test, programmatic SEO, analytics tracking, schema validation, content analysis, keyword mapping, and AI readiness
  • SEO Debugging: Open Graph validation, favicon checker, social preview testing
  • Email Deliverability: SPF/DKIM/DMARC/MX validation, blacklist checking
  • Uptime Monitoring: Updown.io integration for website and SSL monitoring

Imported Skills

aidevops includes curated skills imported from external sources. Skills support automatic update tracking:

Skill Source Description
cloudflare-platform dmmulroy/cloudflare-skill 60 Cloudflare products: Workers, Pages, D1, R2, KV, Durable Objects, AI, networking, security
heygen heygen-com/skills AI avatar video creation API: avatars, voices, video generation, streaming, webhooks
remotion remotion-dev/skills Programmatic video creation with React, animations, rendering
video-prompt-design snubroot/Veo-3-Meta-Framework AI video prompt engineering - 7-component meta prompt framework for Veo 3
animejs animejs.com JavaScript animation library patterns and API (via Context7)
caldav-calendar ClawdHub CalDAV calendar sync via vdirsyncer + khal (iCloud, Google, Fastmail, Nextcloud)
proxmox-full ClawdHub Complete Proxmox VE hypervisor management via REST API

CLI Commands:

aidevops skill add <owner/repo>    # Import a skill from GitHub
aidevops skill add clawdhub:<slug> # Import a skill from ClawdHub
aidevops skill list                # List imported skills
aidevops skill check               # Check for upstream updates
aidevops skill update [name]       # Update specific or all skills
aidevops skill scan [name]         # Security scan skills (Cisco Skill Scanner)
aidevops skill remove <name>       # Remove an imported skill

Skills are registered in ~/.aidevops/agents/configs/skill-sources.json with upstream tracking for update detection.

Security Scanning:

Imported skills are automatically security-scanned using Cisco Skill Scanner when installed. Scanning runs on both initial import and updates -- pulling a new version of a skill triggers the same security checks as the first import. CRITICAL/HIGH findings block the operation; MEDIUM/LOW findings warn but allow. Telemetry is disabled - no data is sent to third parties.

When a VirusTotal API key is configured (aidevops secret set VIRUSTOTAL_MARCUSQUINN), an advisory second layer scans file hashes against 70+ AV engines and checks domains/URLs referenced in skill content. VT scans are non-blocking -- the Cisco scanner remains the security gate.

Scenario Security scan runs? CRITICAL/HIGH blocks?
aidevops skill add <source> Yes Yes
aidevops skill update [name] Yes Yes
aidevops skill add <source> --force Yes Yes
aidevops skill add <source> --skip-security Yes (reports only) No (warns)
aidevops skill scan [name] Yes (standalone) Report only

The --force flag only controls file overwrite behavior (replacing an existing skill without prompting). To bypass security blocking, use --skip-security explicitly -- this separation ensures that routine updates and re-imports never silently skip security checks.

Scan results are logged to .agents/SKILL-SCAN-RESULTS.md automatically on each batch scan and skill import, providing a transparent audit trail of security posture over time.

Browse community skills: skills.sh | ClawdHub | Specification: agentskills.io

Reference:

Agent Sources (Private Repos)

Sync agents from private Git repositories into the framework. Private repos keep their own agents, helper scripts, and slash commands — aidevops sources sync deploys them alongside the core agents.

aidevops sources add ~/Git/my-private-agents     # Register a local repo
aidevops sources add-remote git@github.com:u/r.git  # Clone and register a remote repo
aidevops sources list                             # List configured sources
aidevops sources sync                             # Sync all sources
aidevops sources remove my-private-agents         # Remove a source

How it works: Private repos contain a .agents/ directory with agent subdirectories. Agents with mode: primary in their frontmatter are symlinked to the agents root for auto-discovery as primary agent tabs. Markdown files with agent: frontmatter are deployed as /slash commands. All sources sync automatically during aidevops update.

Reference: .agents/aidevops/agent-sources.md

Agent Design Patterns

aidevops implements proven agent design patterns identified by Lance Martin (LangChain).

Pattern Description aidevops Implementation
Give Agents a Computer Filesystem + shell for persistent context ~/.aidevops/.agent-workspace/, 1,480+ helper scripts
Multi-Layer Action Space Few tools, push actions to computer Per-agent MCP filtering (~12-20 tools each)
Knowledge Graph Routing Indexed, cross-referenced agents instead of isolated skills subagent-index.toon maps 2,050+ agents by domain, purpose, and dependency — agents discover related context through the graph, not just their own file
Progressive Disclosure Load context on-demand Subagent routing with content summaries, YAML frontmatter, read-on-demand
Offload Context Write results to filesystem .agent-workspace/work/[project]/ for persistence
Cache Context Prompt caching for cost Stable instruction prefixes
Isolate Context Sub-agents with separate windows Subagent files with specific tool permissions
Multi-Agent Orchestration Coordinate parallel agents TOON mailbox, agent registry, supervisor dispatch
Compaction Resilience Preserve context across compaction OpenCode plugin injects dynamic state at compaction time
Ralph Loop Iterative execution until complete /full-loop, full-loop-helper.sh
Evolve Context Learn from sessions /remember, /recall with SQLite FTS5 + opt-in semantic search
Pattern Tracking Learn what works/fails /patterns command, memory-helper.sh
Token-Efficient Serialisation Minimise context overhead for structured data TOON format — 20-60% token reduction vs JSON/YAML for agent indexes, registries, and data exchange
Token-Efficient Tool Output Summarise noisy terminal output without hiding evidence RTK is installed by default during setup; start with rtk-helper.sh for compact supported summaries, then rerun raw/direct commands when filtered output is insufficient; bypass compression for file reads, JSON assertions, exact diffs, security scans, and other verbatim evidence
Cost-Aware Routing Match model to task complexity model-routing.md with provider-aware tier guidance, /route command
Model Comparison Compare models side-by-side /compare-models (live data), /compare-models-free (offline)
Response Scoring Evaluate actual model outputs /score-responses with structured criteria

Key insight: Context is a finite resource with diminishing returns. aidevops treats every token as precious - loading only what's needed, when it's needed.

See .agents/aidevops/architecture.md for detailed implementation notes and references.

Multi-Agent Orchestration

Run multiple AI agents in parallel on separate branches, coordinated through a lightweight mailbox system. Each agent works independently in its own git worktree while the supervisor manages task distribution and status reporting.

Architecture:

Supervisor (pulse loop)
├── Agent Registry (TOON format - who's active, what branch, idle/busy)
├── Mailbox System (SQLite WAL-mode, indexed queries)
│   ├── task_assignment → worker inbox
│   ├── status_report → coordinator outbox
│   └── broadcast → all agents
└── Model Routing (tier-based: GPT-5.4 mini / GPT-5.5 / provider fallbacks)

Key components:

Component Script Purpose
Mailbox mail-helper.sh SQLite-backed inter-agent messaging (send, check, broadcast, archive)
Supervisor supervisor-helper.sh Autonomous multi-task orchestration with SQLite state machine, batches, retry cycles, cron scheduling, auto-pickup from TODO.md
Registry mail-helper.sh register Agent registration with role, branch, worktree, heartbeat
Model routing model-routing.md, /route Cost-aware routing across OpenAI, Anthropic, Gemini, Cursor, Grok, and local providers
Budget tracking budget-tracker-helper.sh Append-only cost log for model routing decisions
Observability observability.mjs plugin LLM request capture for cost tracking and performance analysis

How it works:

  1. Each agent registers on startup (mail-helper.sh register --role worker)
  2. Supervisor runs periodic pulses (supervisor-helper.sh pulse)
  3. Pulse collects status reports, dispatches queued tasks to idle workers
  4. Agents send completion reports back via mailbox
  5. SQLite WAL mode + busy_timeout handles concurrent access (79x faster than previous file-based system)

Compaction plugin (.agents/plugins/opencode-aidevops/): When OpenCode compacts context (at ~200K tokens), the plugin injects current session state - agent registry, pending mailbox messages, git context, and relevant memories - ensuring continuity across compaction boundaries.

Custom system prompt (.agents/prompts/build.txt): Based on upstream OpenCode with aidevops-specific overrides for tool preferences, professional objectivity, and per-model reinforcements for weaker models.

Subagent index (.agents/subagent-index.toon): Compressed TOON routing table listing all agents, subagents, workflows, and scripts with model tier assignments - enables fast agent discovery without loading full markdown files.

Autonomous Orchestration & Parallel Agents

Why this matters: Long-running tasks -- batch PR reviews, multi-site audits, large refactors, multi-day feature projects -- are where AI agents deliver the most value. Instead of babysitting one task at a time, the supervisor dispatches work to parallel agents, each in its own git worktree, with automatic retry, progress tracking, and batch completion reporting.

Pulse Supervisor: Autonomous AI Operations

The pulse is the heartbeat of aidevops — an autonomous AI supervisor that runs every 2 minutes via launchd. There is no human at the terminal. It manages the entire development pipeline across all repos registered with pulse: true.

What it does each cycle:

Phase Action
Capacity check Circuit breaker, dynamic worker slots calculated from available RAM
Merge ready PRs Green CI + no blocking reviews → squash merge (free — no worker slot needed)
Fix failing PRs Dispatch a worker to fix CI failures or address review feedback
Detect stuck work PRs open 6+ hours with no activity → flag or close and re-file
Dispatch workers Route open issues to available worker slots, respecting priority and blocked-by: dependencies
Advance missions Check active multi-day missions, dispatch features, validate milestones, track budget
Triage quality Read daily quality sweep findings (ShellCheck, SonarCloud, Codacy, CodeRabbit), create issues for actionable findings
Sync TODOs Create GitHub issues for unsynced TODO entries, commit ref changes
Respect API budget Use cached/prefetched GitHub metadata, cooldown headers, and ramp pacing before spending more API calls
Kill stuck workers Workers running 3+ hours with no PR are killed to free slots
Detect orphaned PRs Open PRs with no active worker and no activity for 6+ hours are flagged for re-dispatch

Operational intelligence:

  • Struggle-ratio — computes messages / max(1, commits) for each active worker. High ratio (>30) with >30 min elapsed and zero commits flags the worker as "struggling". Ratio >50 after 1 hour flags "thrashing". Informational signal — the supervisor LLM decides the action (kill, wait, re-dispatch with more context)
  • Circuit breaker — prevents cascading failures by tracking success/failure rates and tripping when error rate exceeds threshold
  • Dynamic concurrency — worker slot count adapts to available RAM, not a hardcoded constant
  • API budget diagnostics — compact reports show GitHub core/search usage, cooldown provenance, cached PR metadata freshness, and pacing decisions before pulse burns through API quota
  • Worker failure families — headless runtime errors, local runtime diagnostics, blocked-by lookup gaps, provider quota/credit exhaustion, and review-thread remediation issues are classified for targeted redispatch
  • Stale assignment recovery — tasks assigned to workers that died (no active process, no PR, 3+ hours stale) are automatically unassigned and made available for re-dispatch
  • Priority ordering — green PRs (free merge) > failing PRs (closer to done) > high-priority/bug issues > active mission features > product repos > smaller tasks > oldest

The pulse is an LLM, not a script. It reads issue bodies, assesses context, and uses judgment. When it encounters something unexpected — an issue body that says "completed", a task with no clear description, a label that doesn't match reality — it handles it the way a competent human manager would.

# Pulse runs automatically via launchd (every 2 minutes)
# Manual trigger:
opencode run "/pulse"

See: .agents/scripts/commands/pulse.md for the full supervisor specification.

Missions: Multi-Day Autonomous Projects

Missions are the highest-level orchestration primitive — autonomous multi-day projects that break a high-level goal into milestones, features, and validation criteria. The pulse supervisor advances them automatically.

# Scope a mission interactively
/mission "Redesign the landing pages for mobile-first with A/B testing"

How missions work:

  1. /mission scopes the goal into milestones with features and acceptance criteria
  2. Each feature becomes a TODO entry tagged mission:mNNN with a GitHub issue
  3. The pulse dispatches features as regular workers (respecting MAX_WORKERS)
  4. When all features in a milestone complete, the pulse dispatches a validation worker to verify integration
  5. Passed milestones advance automatically — the next milestone's features are dispatched
  6. Budget tracking pauses the mission if any category exceeds the alert threshold (default 80%)

Two execution modes:

Mode Workflow Best for
Full Worktree + PR per feature, standard review flow Production code, collaborative projects
POC Direct commits, skip ceremony Prototypes, experiments, proof-of-concept

Mission state is tracked in a JSON file committed to the repo. Each pulse cycle reads the state, acts on it, and commits updates — so any session (or the next pulse) can pick up where the last one left off.

See: .agents/workflows/mission-orchestrator.md for the full orchestrator specification, .agents/scripts/commands/dashboard.md for the mission progress dashboard.

Multi-Model Verification: Cross-Provider Safety

High-stakes operations are verified by a second AI model from a different provider before execution. This catches single-model hallucinations before destructive operations cause irreversible damage.

When verification triggers:

Risk Level Examples Action
Critical git push --force to main, DROP DATABASE, production deploy Blocked unless second model agrees
High Force push to task ref, data migration, secret exposure Warned, verification recommended
Medium Bulk file deletion, config changes Logged
Low Normal edits, test runs No verification

How it works:

  1. pre-edit-check.sh screens operations against the high-stakes taxonomy
  2. For critical/high operations, verify-operation-helper.sh sends the operation context to a second model (different provider than the primary)
  3. The verifier independently assesses whether the operation is safe
  4. On disagreement, the operation is blocked (critical) or warned (high)
  5. All verification decisions are logged for audit

Why cross-provider? Same-provider models share training data and failure modes. A GPT hallucination is unlikely to be reproduced by Claude or Gemini, and vice versa. The verifier uses the cheapest suitable model tier, so cost is minimal per check.

Configuration: Per-repo via .agents/reference/high-stakes-operations.md. Opt-out with VERIFY_ENABLED=false (not recommended).

See: .agents/tools/verification/parallel-verify.md for the verification agent specification.

Project Bundles: Auto-Configuration

Bundles are project-type presets that auto-configure model tiers, quality gates, and agent routing per repo. Instead of manually configuring each project, bundles detect what kind of project you're working on and apply sensible defaults.

Built-in bundles:

Bundle Auto-detected by Model default Quality gates Agent routing
web-app package.json + framework markers standard Full (lint, test, build, a11y) Build+ default
library package.json with main/exports standard Full + API docs check Build+ default
cli-tool bin field in package.json standard ShellCheck, test Build+ default
content-site CMS markers, wp-config.php fast Lighthouse, SEO Marketing for content tasks
infrastructure Dockerfile, terraform/, ansible/ standard ShellCheck, security scan Build+ default
agent AGENTS.md, .agents/ thinking Agent review, prompt quality Build+ default

Resolution priority: Explicit bundle field in repos.json > .aidevops.json project config > auto-detection from marker files.

CLI:

bundle-helper.sh detect <repo-path>    # Auto-detect bundle type
bundle-helper.sh resolve <repo-path>   # Show resolved config (with overrides)
bundle-helper.sh show <bundle-name>    # Show bundle defaults
bundle-helper.sh list                  # List all available bundles

See: .agents/bundles/ for bundle definitions, .agents/scripts/bundle-helper.sh for the CLI.

Parallel Agents & Headless Dispatch

Run multiple AI sessions concurrently with isolated contexts. Named runners provide persistent agent identities with their own instructions and memory.

Feature Description
Headless dispatch opencode run for one-shot tasks, opencode serve + --attach for warm server
Runners Named agent instances with per-runner AGENTS.md, config, and run logs (runner-helper.sh)
Self-hosted runner runbooks GitHub runner storage, lifecycle, Docker foreground mode, timer freshness, and cleanup-race guidance (.agents/reference/github-self-hosted-runners.md)
Session management Resume sessions with -s <id> or -c, fork with SDK
Memory namespaces Per-runner memory isolation with shared access when needed
SDK orchestration @opencode-ai/sdk for TypeScript parallel dispatch via Promise.all
Matrix integration Chat-triggered dispatch via self-hosted Matrix (optional)
# Create a named runner
runner-helper.sh create code-reviewer --description "Reviews code for s

Keywords