← 返回
开发者工具 中文

Kubernetes Agent Swarm

Kubernetes & OpenShift Platform Agent Swarm — A coordinated multi-agent system for cluster operations. Includes Orchestrator (Jarvis), Cluster Ops (Atlas), G...
Kubernetes 与 OpenShift 平台代理集群 — 用于集群操作的协调式多代理系统,包含编排器 (Jarvis)、集群运维 (Atlas) 等。
kcns008 kcns008 来源
开发者工具 clawhub v2.1.0 2 版本 98298.9 Key: 无需
★ 6
Stars
📥 8,201
下载
💾 59
安装
2
版本
#latest

概述

Kubernetes Agent Swarm — Platform Operations

A multi-agent system for Kubernetes and OpenShift platform operations. Seven specialized agents work together as a coordinated swarm.

Runtime Requirements

RequirementRequiredDescription
------------------------------------
kubectl✅ YesKubernetes CLI — must be in PATH
ocOptionalOpenShift CLI — needed for OCP/ROSA/ARO
helmOptionalFor GitOps agent Helm operations
jqOptionalFor JSON output parsing
KUBECONFIG✅ YesCluster access via env var or ~/.kube/config

Optional cloud CLIs (aws, az, gcloud, rosa) — only needed for managed cluster operations.

Installation

clawhub install kubernetes

Or install individual agents:

clawhub install orchestrator
clawhub install cluster-ops
clawhub install gitops
clawhub install security
clawhub install observability
clawhub install artifacts
clawhub install developer-experience

The Swarm — Agent Roster

AgentCode NameDomain
--------------------------
OrchestratorJarvisTask routing, coordination, standups
Cluster OpsAtlasCluster lifecycle, nodes, upgrades
GitOpsFlowArgoCD, Helm, Kustomize, deploys
SecurityShieldRBAC, policies, secrets, scanning
ObservabilityPulseMetrics, logs, alerts, incidents
ArtifactsCacheRegistries, SBOM, promotion, CVEs
Developer ExperienceDeskNamespaces, onboarding, support

How It Works

This is an instruction-only skill. Agents receive markdown instructions describing what commands to run and how to interpret output. No executable scripts are included — the agent translates instructions into actions using the host's installed CLI tools.

Session Setup

Before using the swarm, establish cluster context:

# Verify access
kubectl cluster-info
kubectl get nodes

# For OpenShift
oc status

Agent Communication

Agents communicate via @mentions in shared task comments:

@Shield Please review the RBAC for payment-service v3.2 before I sync.
@Pulse Is the CPU spike related to the deployment or external traffic?
@Atlas The staging cluster needs 2 more worker nodes.

Escalation Path

  1. Agent detects issue
  2. Agent attempts resolution within guardrails
  3. If blocked → @mention another agent or escalate to human
  4. P1 incidents → all relevant agents auto-notified

Heartbeat Schedule

*/5  * * * *  Atlas, Pulse, Shield     (fast response: incidents, alerts, CVEs)
*/10 * * * *  Flow, Cache              (scheduled: deploys, promotions)
*/15 * * * *  Desk, Orchestrator       (batch: onboarding, standups)

Agent Capabilities

What Agents CAN Do

  • Read cluster state (kubectl get, kubectl describe, oc get)
  • Deploy via GitOps (argocd app sync, Flux reconciliation)
  • Create documentation and reports
  • Investigate and triage incidents
  • Provision standard resources (namespaces, quotas, RBAC)
  • Run health checks and audits
  • Query metrics and logs

What Agents CANNOT Do (Human-in-the-Loop Required)

  • Delete production resources
  • Modify cluster-wide policies
  • Make direct changes to secrets without rotation workflow
  • Perform irreversible cluster upgrades
  • Approve production deployments (can prepare, human approves)

Key Principles

  • Roles over genericism — Each agent has a defined domain
  • Files over mental notes — Only files persist between sessions
  • Human-in-the-loop — Critical actions require approval
  • Guardrails over freedom — Define what agents can and cannot do
  • Audit everything — Every action logged

File Structure

kubernetes/
├── SKILL.md                    # This file — combined swarm
├── AGENTS.md                   # Swarm configuration and protocols
├── skills/
│   ├── orchestrator/SKILL.md   # Jarvis — task routing
│   ├── cluster-ops/SKILL.md    # Atlas — cluster operations
│   ├── gitops/SKILL.md         # Flow — GitOps
│   ├── security/SKILL.md       # Shield — security
│   ├── observability/SKILL.md  # Pulse — monitoring
│   ├── artifacts/SKILL.md      # Cache — artifacts
│   └── developer-experience/SKILL.md  # Desk — DevEx
├── memory/MEMORY.md            # Long-term agent memory
├── working/WORKING.md          # Session progress
└── logs/LOGS.md                # Action audit trail

Detailed Agent Documentation

See individual SKILL.md files for each agent's full capabilities, personality, and workflow instructions.

版本历史

共 2 个版本

  • v2.1.0 当前
    2026-04-30 06:17
  • v1.0.1
    2026-03-27 23:55 安全 安全

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 65 📥 179,837
developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 920 📥 185,726
ai-intelligence

Cluster Agent Swarm

kcns008
完整平台代理群——Kubernetes 与 OpenShift 平台运营的协同多代理系统,包含编排器(Jarvis)及集群运维功能。
★ 0 📥 729