技能总览
Hermes Agent 完整技能库 · 覆盖 AI、开发、运维、创意等全领域
api-and-interface-design
api-and-interface-design
Guides stable API and interface design. Use when designing APIs, module boundaries, or any public interface. Use when creating REST or GraphQL endpoints, defining type contracts between modules, or establishing boundaries between frontend and backend.
browser-testing-with-devtools
browser-testing-with-devtools
Tests in real browsers. Use when building or debugging anything that runs in a browser. Use when you need to inspect the DOM, capture console errors, analyze network requests, profile performance, or verify visual output with real runtime data via Chrome DevTools MCP.
ci-cd-and-automation
ci-cd-and-automation
Automates CI/CD pipeline setup. Use when setting up or modifying build and deployment pipelines. Use when you need to automate quality gates, configure test runners in CI, or establish deployment strategies.
code-review-and-quality
code-review-and-quality
Conducts multi-axis code review. Use before merging any change. Use when reviewing code written by yourself, another agent, or a human. Use when you need to assess code quality across multiple dimensions before it enters the main branch.
code-simplification
code-simplification
Simplifies code for clarity. Use when refactoring code for clarity without changing behavior. Use when code works but is harder to read, maintain, or extend than it should be. Use when reviewing code that has accumulated unnecessary complexity.
context-engineering
context-engineering
Optimizes agent context setup. Use when starting a new session, when agent output quality degrades, when switching between tasks, or when you need to configure rules files and context for a project.
debugging-and-error-recovery
debugging-and-error-recovery
Guides systematic root-cause debugging. Use when tests fail, builds break, behavior doesn't match expectations, or you encounter any unexpected error. Use when you need a systematic approach to finding and fixing the root cause rather than guessing.
deprecation-and-migration
deprecation-and-migration
Manages deprecation and migration. Use when removing old systems, APIs, or features. Use when migrating users from one implementation to another. Use when deciding whether to maintain or sunset existing code.
documentation-and-adrs
documentation-and-adrs
Records decisions and documentation. Use when making architectural decisions, changing public APIs, shipping features, or when you need to record context that future engineers and agents will need to understand the codebase.
doubt-driven-development
doubt-driven-development
Subjects every non-trivial decision to a fresh-context adversarial review before it stands. Use when correctness matters more than speed, when working in unfamiliar code, when stakes are high (production, security-sensitive logic, irreversible operations), or any time a confident output would be cheaper to verify now than to debug later.
frontend-ui-engineering
frontend-ui-engineering
Builds production-quality UIs. Use when building or modifying user-facing interfaces. Use when creating components, implementing layouts, managing state, or when the output needs to look and feel production-quality rather than AI-generated.
git-workflow-and-versioning
git-workflow-and-versioning
Structures git workflow practices. Use when making any code change. Use when committing, branching, resolving conflicts, or when you need to organize work across multiple parallel streams.
idea-refine
idea-refine
Refines ideas iteratively. Refine ideas through structured divergent and convergent thinking. Use "idea-refine" or "ideate" to trigger.
incremental-implementation
incremental-implementation
Delivers changes incrementally. Use when implementing any feature or change that touches more than one file. Use when you're about to write a large amount of code at once, or when a task feels too big to land in one step.
performance-optimization
performance-optimization
Optimizes application performance. Use when performance requirements exist, when you suspect performance regressions, or when Core Web Vitals or load times need improvement. Use when profiling reveals bottlenecks that need fixing.
planning-and-task-breakdown
planning-and-task-breakdown
Breaks work into ordered tasks. Use when you have a spec or clear requirements and need to break work into implementable tasks. Use when a task feels too large to start, when you need to estimate scope, or when parallel work is possible.
security-and-hardening
security-and-hardening
Hardens code against vulnerabilities. Use when handling user input, authentication, data storage, or external integrations. Use when building any feature that accepts untrusted data, manages user sessions, or interacts with third-party services.
shipping-and-launch
shipping-and-launch
Prepares production launches. Use when preparing to deploy to production. Use when you need a pre-launch checklist, when setting up monitoring, when planning a staged rollout, or when you need a rollback strategy.
source-driven-development
source-driven-development
Grounds every implementation decision in official documentation. Use when you want authoritative, source-cited code free from outdated patterns. Use when building with any framework or library where correctness matters.
spec-driven-development
spec-driven-development
Creates specs before coding. Use when starting a new project, feature, or significant change and no specification exists yet. Use when requirements are unclear, ambiguous, or only exist as a vague idea.
using-agent-skills
using-agent-skills
Discovers and invokes agent skills. Use when starting a session or when you need to discover which skill applies to the current task. This is the meta-skill that governs how all other skills are discovered and invoked.
claude-code
claude-code
Delegate coding to Claude Code CLI (features, PRs).
codex
codex
Delegate coding to OpenAI Codex CLI (features, PRs).
hermes-agent
hermes-agent
Configure, extend, or contribute to Hermes Agent.
opencode
opencode
Delegate coding to OpenCode CLI (features, PR review).
architecture-diagram
architecture-diagram
Dark-themed SVG architecture/cloud/infra diagrams as HTML.
ascii-art
ascii-art
ASCII art: pyfiglet, cowsay, boxes, image-to-ascii.
ascii-video
ascii-video
ASCII video: convert video/audio to colored ASCII MP4/GIF.
baoyu-comic
baoyu-comic
Knowledge comics (知识漫画): educational, biography, tutorial.
baoyu-infographic
baoyu-infographic
Infographics: 21 layouts x 21 styles (信息图, 可视化).
claude-design
claude-design
Design one-off HTML artifacts (landing, deck, prototype).
comfyui
comfyui
Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.
creative-ideation
creative-ideation
Generate project ideas via creative constraints.
design-md
design-md
Author/validate/export Google's DESIGN.md token spec files.
drawio-headless
drawio-headless
Generate architecture diagrams with draw.io in headless/server environments (WSL2, VPS, Docker)
excalidraw
excalidraw
Hand-drawn Excalidraw JSON diagrams (arch, flow, seq).
humanizer
humanizer
Humanize text: strip AI-isms and add real voice.
manim-video
manim-video
Manim CE animations: 3Blue1Brown math/algo videos.
p5js
p5js
p5.js sketches: gen art, shaders, interactive, 3D.
pixel-art
pixel-art
Pixel art w/ era palettes (NES, Game Boy, PICO-8).
popular-web-designs
popular-web-designs
54 real design systems (Stripe, Linear, Vercel) as HTML/CSS.
pretext
pretext
Use when building creative browser demos with @chenglou/pretext — DOM-free text layout for ASCII art, typographic flow around obstacles, text-as-geometry games, kinetic typography, and text-powered generative art. Produces single-file HTML demos by default.
sketch
sketch
Throwaway HTML mockups: 2-3 design variants to compare.
social-media-slideshow-video
social-media-slideshow-video
PIL + ffmpeg slideshow videos: product reviews, promos, TikTok/Reels/Shorts.
songwriting-and-ai-music
songwriting-and-ai-music
Songwriting craft and Suno AI music prompts.
touchdesigner-mcp
touchdesigner-mcp
Control a running TouchDesigner instance via twozero MCP — create operators, set parameters, wire connections, execute Python, build real-time visuals. 36 native tools.
visual-assets-generation
visual-assets-generation
Generate visual assets (ASCII art, diagrams, banners) for repos and documentation
jupyter-live-kernel
jupyter-live-kernel
Iterative Python via live Jupyter kernel (hamelnb).
api-monitoring-bots
api-monitoring-bots
Build monitoring bots that poll APIs and send notifications on state changes (new listings, price alerts, status updates)
cloud-browser-automation
cloud-browser-automation
Use cloud browser services (Browserbase) for Cloudflare bypass, JavaScript rendering, and stealth scraping when local tools fail
docker-compose
docker-compose
Multi-container Docker applications with docker-compose — define services, networks, volumes, and orchestrate local development environments
kanban-orchestrator
kanban-orchestrator
Decomposition playbook + specialist-roster conventions + anti-temptation rules for an orchestrator profile routing work through Kanban. The "don't do the work yourself" rule and the basic lifecycle are auto-injected into every kanban worker's system prompt; this skill is the deeper playbook when you're specifically playing the orchestrator role.
kanban-worker
kanban-worker
Pitfalls, examples, and edge cases for Hermes Kanban workers. The lifecycle itself is auto-injected into every worker's system prompt as KANBAN_GUIDANCE (from agent/prompt_builder.py); this skill is what you load when you want deeper detail on specific scenarios.
tinyfish-integration
tinyfish-integration
Integrate TinyFish web toolkit (search, fetch, browser automation) into Hermes Agent
vps-cleanup
vps-cleanup
Systematic VPS cleanup — analyze files, categorize by importance, safely delete temporary/old data to free disk space
vps-security-hardening
vps-security-hardening
Audit and harden VPS security — fail2ban, SSH hardening, firewall setup
webhook-subscriptions
webhook-subscriptions
Webhook subscriptions: event-driven agent runs.
domain-intel
domain-intel
Passive domain reconnaissance using Python stdlib. Use this skill for subdomain discovery, SSL certificate inspection, WHOIS lookups, DNS records, domain availability checks, and bulk multi-domain analysis. No API keys required. Triggers on requests like "find subdomains", "check ssl cert", "whois lookup", "is this domain available", "bulk check these domains".
himalaya
himalaya
Himalaya CLI: IMAP/SMTP email from terminal.
minecraft-modpack-server
minecraft-modpack-server
Host modded Minecraft servers (CurseForge, Modrinth).
pokemon-player
pokemon-player
Play Pokemon via headless emulator + RAM reads.
codebase-inspection
codebase-inspection
Inspect codebases w/ pygount: LOC, languages, ratios.
comprehensive-public-repo-setup
comprehensive-public-repo-setup
Create production-ready public repos with complete documentation, automated setup, bundled dependencies, and user-friendly installation.
github-auth
github-auth
GitHub auth setup: HTTPS tokens, SSH keys, gh CLI login.
github-code-review
github-code-review
Review PRs: diffs, inline comments via gh or REST.
github-issues
github-issues
Create, triage, label, assign GitHub issues via gh or REST.
github-pr-workflow
github-pr-workflow
GitHub PR lifecycle: branch, commit, open, CI, merge.
github-repo-management
github-repo-management
Clone/create/fork repos; manage remotes, releases.
github-repo-visual-assets
github-repo-visual-assets
Create professional visual assets for GitHub repositories — architecture diagrams, social cards, and README banners.
public-repo-creation
public-repo-creation
Create production-ready public GitHub repositories with comprehensive documentation, automated setup, and quality assurance
repo-quality-maksimalisasi
repo-quality-maksimalisasi
Evaluate and maximize GitHub repo quality from 7/10 to 10/10 perfect
inference-sh
inference-sh
Run 150+ AI applications in the cloud via the inference.sh platform. Triggers on "generate image with FLUX", "create video", "use Veo/Seedance", "run inference.sh", "infsh CLI", "single key for image+video+LLM". One API key covers image generation (FLUX, Reve, Seedream, Grok, Gemini), video (Veo, Wan, Seedance, OmniHuman, HunyuanVideo), LLMs (Claude, Gemini, Kimi, GLM-4), search (Tavily, Exa), 3D (Rodin), social (Twitter/X), and audio (TTS, voice cloning).
native-mcp
native-mcp
MCP client: connect servers, register tools (stdio/HTTP).
gif-search
gif-search
Search/download GIFs from Tenor via curl + jq.
heartmula
heartmula
HeartMuLa: Suno-like song generation from lyrics + tags.
songsee
songsee
Audio spectrograms/features (mel, chroma, MFCC) via CLI.
spotify
spotify
Spotify: play, search, queue, manage playlists and devices.
youtube-content
youtube-content
YouTube transcripts to summaries, threads, blogs.
crypto-mining-setup
crypto-mining-setup
Setup and optimize cryptocurrency mining operations — AI-powered mining (soul.md protocol), parallel agent deployment, accumulation strategies, and performance optimization.
huggingface-hub
huggingface-hub
HuggingFace hf CLI: search/download/upload models, datasets.
windows-local-ai-services
windows-local-ai-services
Run local AI services on Windows — port binding, firewall, WSL2 networking, and common pitfalls.
obsidian
obsidian
Read, search, create, and edit notes in the Obsidian vault.
obsidian-mobile-sync
obsidian-mobile-sync
Setup Obsidian mobile sync via GitHub (free) or Obsidian Sync (paid). Includes GitHub CLI automation, MGit/Working Copy setup, and Mnemosyne integration patterns.
airtable
airtable
Airtable REST API via curl. Records CRUD, filters, upserts.
google-workspace
google-workspace
Gmail, Calendar, Drive, Docs, Sheets via gws CLI or Python.
linear
linear
Linear: manage issues, projects, teams via GraphQL + curl.
maps
maps
Geocode, POIs, routes, timezones via OpenStreetMap/OSRM.
nano-pdf
nano-pdf
Edit PDF text/typos/titles via nano-pdf CLI (NL prompts).
notion
notion
Notion API via curl: pages, databases, blocks, search.
ocr-and-documents
ocr-and-documents
Extract text from PDFs/scans (pymupdf, marker-pdf).
powerpoint
powerpoint
Create, read, edit .pptx decks, slides, notes, templates.
godmode
godmode
Jailbreak LLMs: Parseltongue, GODMODE, ULTRAPLINIAN.
arxiv
arxiv
Search arXiv papers by keyword, author, category, or ID.
blogwatcher
blogwatcher
Monitor blogs and RSS/Atom feeds via blogwatcher-cli tool.
credential-pooling-analysis
credential-pooling-analysis
Analyze credential pooling operations and API reseller business models — economics, risks, detection patterns, and sustainability
crypto-token-analysis
crypto-token-analysis
Deep-dive framework for analyzing crypto tokens — market data, liquidity health, tokenomics, unlock schedules, and risk assessment. Combines on-chain data, API queries, and multi-source validation to generate actionable investment verdicts.
llm-wiki
llm-wiki
Karpathy's LLM Wiki: build/query interlinked markdown KB.
nft-analysis
nft-analysis
Analyze NFT projects for investment decisions — evaluate fundamentals, identify red flags, assess risk, and provide buy/hold/avoid recommendations
polymarket
polymarket
Query Polymarket: markets, prices, orderbooks, history.
research-paper-writing
research-paper-writing
Write ML papers for NeurIPS/ICML/ICLR: design→submit.
telegram-bot-security-analysis
telegram-bot-security-analysis
Reverse engineer and security-test Telegram bots — API analysis, callback interception, exploit discovery, and vulnerability documentation
trending-repos-discovery
trending-repos-discovery
Discover and analyze trending GitHub repositories from TrendShift and other sources — evaluate usefulness, extract learnings, and identify valuable skills/tools
web-scraping
web-scraping
Extract data from websites, including JavaScript-rendered SPAs and dynamic content
openhue
openhue
Control Philips Hue lights, scenes, rooms via OpenHue CLI.
social-media-account-audit
social-media-account-audit
Audit social media accounts (TikTok, IG, etc.): scrape profiles, calculate engagement metrics, diagnose performance drops, interpret analytics screenshots.
xurl
xurl
X/Twitter via xurl CLI: post, search, DM, media, v2 API.
api-testing
api-testing
REST/GraphQL API testing with automated validation — test endpoints, validate responses, check status codes, and ensure API contracts
contributing-to-ide-projects
contributing-to-ide-projects
Comprehensive workflow for contributing features to open-source IDE and coding tool projects
debugging-hermes-tui-commands
debugging-hermes-tui-commands
Debug Hermes TUI slash commands: Python, gateway, Ink UI.
ecosystem-tool-evaluation
ecosystem-tool-evaluation
Evaluate and install ecosystem tools (Hermes plugins, skills, integrations) — assess complexity vs. benefit before installation, show examples for visual tools, and avoid "cloned but not configured" states.
hermes-agent-skill-authoring
hermes-agent-skill-authoring
Author in-repo SKILL.md: frontmatter, validator, structure.
node-inspect-debugger
node-inspect-debugger
Debug Node.js via --inspect + Chrome DevTools Protocol CLI.
open-source-contribution
open-source-contribution
Comprehensive workflow for contributing to open-source projects with quality checks
plan
plan
Plan mode: write markdown plan to .hermes/plans/, no exec.
python-debugpy
python-debugpy
Debug Python: pdb REPL + debugpy remote (DAP).
spike
spike
Throwaway experiments to validate an idea before build.
user-ryzen-preferences
user-ryzen-preferences
User ryzen's workflow preferences, communication style, and anti-patterns to avoid
brainstorming
brainstorming
You MUST use this before any creative work - creating features, building components, adding functionality, or modifying behavior. Explores user intent, requirements and design before implementation.
dispatching-parallel-agents
dispatching-parallel-agents
Use when facing 2+ independent tasks that can be worked on without shared state or sequential dependencies
executing-plans
executing-plans
Use when you have a written implementation plan to execute in a separate session with review checkpoints
finishing-a-development-branch
finishing-a-development-branch
Use when implementation is complete, all tests pass, and you need to decide how to integrate the work - guides completion of development work by presenting structured options for merge, PR, or cleanup
receiving-code-review
receiving-code-review
Use when receiving code review feedback, before implementing suggestions, especially if feedback seems unclear or technically questionable - requires technical rigor and verification, not performative agreement or blind implementation
superpowers-requesting-code-review
superpowers-requesting-code-review
Use when completing tasks, implementing major features, or before merging to verify work meets requirements
superpowers-subagent-driven-development
superpowers-subagent-driven-development
Use when executing implementation plans with independent tasks in the current session
superpowers-systematic-debugging
superpowers-systematic-debugging
Use when encountering any bug, test failure, or unexpected behavior, before proposing fixes
superpowers-tdd
superpowers-tdd
Use when implementing any feature or bugfix, before writing implementation code
using-git-worktrees
using-git-worktrees
Use when starting feature work that needs isolation from current workspace or before executing implementation plans - ensures an isolated workspace exists via native tools or git worktree fallback
using-superpowers
using-superpowers
Use when starting any conversation - establishes how to find and use skills, requiring Skill tool invocation before ANY response including clarifying questions
verification-before-completion
verification-before-completion
Use when about to claim work is complete, fixed, or passing, before committing or creating PRs - requires running verification commands and confirming output before making any success claims; evidence before assertions always
superpowers-writing-plans
superpowers-writing-plans
Use when you have a spec or requirements for a multi-step task, before touching code
writing-skills
writing-skills
Use when creating new skills, editing existing skills, or verifying skills work before deployment