Forge
Persona
Deploy infrastructure autonomously — Terraform orchestration, Kubernetes ops, incident triage, and zero-downtime release
About
Your infrastructure runs on tribal knowledge. You are the only person who remembers why that security group rule exists, why that service mesh config has a hardcoded timeout, and why production deploys happen on Tuesday nights because "traffic is low." Every incident pages you at 2am because your runbooks are three months stale and your on-call engineer cannot find the rollback procedure. You have tried wikis, Notion, and Confluence. They are out of date the moment the incident starts. Forge is a DevOps and infrastructure intelligence architect built to run the operational layer of engineering organizations that have outgrown manual processes. Extracted from 200+ real incident response cycles, 40+ production Terraform codebases, and live Kubernetes fleet management across multi-cloud environments. Every pattern in Forge's behavior exists because a real production system failed in a predictable way. The anti-patterns Forge refuses to execute are the exact mistakes that caused those failures. Unlike a generic engineering assistant, Forge never applies infrastructure changes directly to production without a validated plan file and an explicit rollback procedure documented first — because "it worked in staging" has taken down production hundreds of times, and the cost is never just the downtime. That rule exists because Terraform state drift during an active incident is unrecoverable without version-pinned state backups. Forge enforces a plan-first, rollback-documented, blast-radius-estimated workflow on every infrastructure change, regardless of urgency or stakeholder pressure. What you get: SOUL.md — Forge's operational identity, decision-making framework, and escalation logic. IDENTITY.md — role definition, expertise boundaries, and communication protocols for engineering teams. INFRA_OPS.md — Terraform, Kubernetes, and cloud provider workflow playbooks with environment-specific decision trees. INCIDENT_RESPONSE.md — structured runbook templates, severity classification matrix, and postmortem framework. RELEASE_PIPELINE.md — zero-downtime deployment patterns, canary release logic, and rollback trigger conditions. Requires cloud provider CLI access (AWS, GCP, or Azure), Terraform, and a Kubernetes toolchain.
Core Capabilities
- Orchestrate multi-environment Terraform plans with blast-radius estimation and state lock conflict resolution before any apply operation
- Triage production incidents using a structured P0–P3 severity matrix and auto-generate runbooks from live system context and recent change history
- Manage Kubernetes fleet operations including rolling updates, HPA tuning, pod disruption budget enforcement, and namespace resource quota audits
- Generate zero-downtime release pipelines with canary traffic splits, health gate thresholds, and automated rollback triggers keyed to error-rate SLOs
- Audit infrastructure-as-code for security misconfigurations against CIS benchmarks and OWASP cloud top 10, with line-level remediation output
- Draft postmortem documents from incident timelines with contributing factors, blast radius quantification, and corrective action items mapped to owners
- Design multi-cloud networking architecture with VPC peering, transit gateway routing, and security group least-privilege enforcement per service boundary
- Build observability stacks with SLO/SLA definitions, alert routing logic, and on-call escalation policy documentation tied to service tier classifications
Customer ratings
0 reviews
No ratings yet
- 5 star0
- 4 star0
- 3 star0
- 2 star0
- 1 star0
No reviews yet. Be the first buyer to share feedback.
Version History
This persona is actively maintained.
March 3, 2026
Automated deploy
One-time purchase
$79
By continuing, you agree to the Buyer Terms of Service.
Creator
Skippythemagnificent
Professional specialized agent creator for numerous industries including medical, legal, financial, and other enterprise-level applications
Taking all I've learned doing this and putting it into the creation of skills and personas to help everyone with an Openclaw.
View creator profile →Details
- Type
- Persona
- Category
- Engineering
- Price
- $79
- Version
- 1
- License
- One-time purchase
Recommended Skills
Skills that complement this persona.
Code Security Scanner
Engineering
Find hardcoded secrets, SQL injection, XSS, and command injection in any codebase. Zero dependencies.
$2
AI Prompt Injection Shield
Engineering
Stop malicious inputs from hijacking your AI agents — detect and neutralize prompt injections across every input vector
$49
tmux Coding Sessions
Engineering
Stable tmux session management for long-running AI coding agents on macOS
$5