/supergoal

A baseline-first skill, not a ceremony.절차를 늘리지 않고, 테스트와 요구사항을 지키는 스킬.

A strong model with the real spec is the bar. `/supergoal` adds only what a plain baseline cannot do for free.

좋은 모델도 프로젝트 요구사항과 문서를 읽고 작업해야 합니다. `/supergoal`은 일반 실행이 놓치기 쉬운 부분만 보강합니다.

12modes; the objective picks the lane.개 모드. 목표에 맞춰 경로를 고릅니다.

2+1core roles: Build + Audit; browser/CLI adds one evidence-only Test.핵심 역할은 Build + Audit, 브라우저/CLI만 증거 전용 Test 1개 추가.

0generated-proxy verifiers; only your real tests grade the work.자체 생성 검증기를 쓰지 않습니다. 프로젝트 테스트로 결과를 판단합니다.

10production-pilot rows planned: metrics only.프로덕션 파일럿 10건: 날짜, 모드, 발견한 격차, 게이트 결과만 기록합니다.

It distrusts visible green.테스트 통과 표시만으로 끝내지 않습니다.

After Build, a fresh-context verifier diffs the implementer's changes against GOAL.md - the user's request verbatim plus falsifiable Success Criteria - and ticks off only what is proven. Unmet criteria go to a timestamped R-LOOP.md section and the implementer relaunches.

Build 뒤에는 새 컨텍스트 검증자가 구현자의 변경(diff)을 GOAL.md(사용자 요청 원문 + 반증 가능한 성공 기준)와 대조해 증명된 것만 체크합니다. 미충족 기준은 R-LOOP.md에 타임스탬프 섹션으로 남기고 구현자를 다시 실행합니다.

It keeps the change minimal.변경을 필요한 만큼만 유지합니다.

The smallest correct change that makes those tests pass - matching the surrounding code, with no whole-file rewrites.

그 테스트들을 통과시키는 가장 작은 올바른 변경을 만듭니다. 주변 코드에 맞추고, 파일 전체를 다시 쓰지 않습니다.

It verifies against ground truth.테스트와 요구사항으로 검증합니다.

It re-runs your real test suite and re-checks the request/docs. It never grades the work against a checklist it generated itself.

프로젝트 테스트 스위트를 다시 돌리고, 요청/문서를 다시 확인합니다. 스스로 만든 체크리스트로 채점하지 않습니다.

01 FrameGOAL.md: request + criteria

02 Plan approvalPLAN.md frozen, user OK

03 Buildone implementer reads PLAN.md

04 VerifyExact Verify/QA; gaps -> R-LOOP.md

05 Finalizeall boxes -> Z-<date>.md, commit gate

One command. Twelve modes.명령 하나. 열두 가지 모드.

The objective picks the lane. Build, debug, and legacy share the baseline-first loop; broad GREENFIELD builds first use an internal wayfinder map to choose one frontier slice. Explicit planning asks still route to WAYFINDER.

목표에 따라 경로가 정해집니다. build·debug·legacy는 baseline-first 루프를 쓰고, 넓은 GREENFIELD build는 내부 wayfinder map으로 frontier 슬라이스 하나를 먼저 고릅니다. 순수 계획 요청은 그대로 WAYFINDER로 갑니다.

GREENFIELD

Build and ship a new thing.새 제품을 만들고 출시합니다.

Broad app/tool requests first get a wayfinder map inside the run vault, then one selected frontier slice enters Build.

넓은 앱/도구 요청은 run vault 안의 wayfinder map으로 나눈 뒤, 선택된 frontier 슬라이스 하나만 Build로 들어갑니다.

Frame -> Frontier if broad -> Build -> Verify

DEBUG

Fix a hard failure.까다로운 실패를 고칩니다.

Reproduce with a failing test first, then make it pass with the smallest change.

먼저 실패 테스트로 재현하고, 가장 작은 변경으로 통과시킵니다.

Frame -> Reproduce -> Build -> Verify

LEGACY

Add to existing code.기존 코드에 기능을 더합니다.

Map the touched area, then make the smallest accurate edit.

영향 범위를 파악한 뒤 가장 작고 정확하게 수정합니다.

Frame -> Map -> Build -> Verify

WAYFINDER

Spec and split foggy work.큰 작업을 구조화하고 티켓으로 나눕니다.

Map the destination inside the run vault, add ticket-depth requirements or cited research assets only where useful, route one frontier ticket, then stop for context clear and integration proof.

실행 vault 안에서 목적지를 정리하고, 필요한 티켓에만 세부 요구사항이나 인용된 research asset을 더합니다. 그런 다음 frontier 티켓 하나만 실행하고 context clear와 통합 검증을 요청합니다.

Vault map -> Ticket depth -> Route one -> Stop

PROTOTYPE

Test one uncertain direction.불확실한 방향 하나를 검증합니다.

Build a throwaway proof and capture the answer. UI/interaction prototypes load SuperDesign for design and render gates; then delete/quarantine the proof or route the decision into delivery.

throwaway proof로 답을 얻습니다. UI/interaction prototype은 SuperDesign의 디자인·렌더 검증을 거친 뒤, proof를 삭제/격리하거나 결정만 delivery로 넘깁니다.

Question -> Proof -> Answer -> Delete or route

TEACH

Understand, no code.코드 변경 없이 이해를 돕습니다.

Ground the mission, teach from sources, then check that you can explain it back.

학습 목적을 잡고 근거 있는 자료로 설명한 뒤, 사용자가 자기 말로 다시 설명해 이해를 확인합니다.

Mission -> Source -> Bridge -> Teach -> Check

LEARN-DOMAIN

Map a codebase, no code.코드베이스를 파악합니다.

Learn the repo for the agent; persist a grounded `.domain-agent/` wiki.

에이전트가 다시 쓸 수 있도록 저장소를 학습하고, 검증된 `.domain-agent/` 위키를 남깁니다.

Survey -> Map -> Deepen -> Ground -> Persist

QA-ONLY

QA or compare data, no code.QA와 데이터 비교만 합니다.

Build an Impact Matrix: a feature-impact QA map for screens, data paths, roles, before/during/after actions, and failure cases. Shard independent checks, cross-check displayed data, and report covered/not-covered risk.

Impact Matrix는 화면, 데이터 경로, 권한, 전/중/후 액션, 실패 케이스를 펼친 기능 영향 범위 QA 지도입니다. 독립 검증은 나눠 실행하고, 표시 데이터 일관성과 미확인 위험까지 리포트합니다.

Target -> Impact Matrix -> Shards -> Cross-check -> Report

REVIEW-ONLY

Review code, no fixes.리뷰만 하고 고치지 않습니다.

Two independent reviewers audit a diff/PR; every finding is verified before it is reported.

독립 리뷰어 둘이 diff/PR을 검토하고, 발견한 내용은 모두 검증한 뒤 리포트에 담습니다.

Target -> Review x2 -> Verify findings -> Report

ARCHITECTURE

Find what to deepen, no fixes.고칠 곳을 찾고, 고치지는 않습니다.

Survey shallow modules and missing seams; grill the picked candidate; the refactor routes to LEGACY/WAYFINDER.

얕은 모듈과 부족한 분리 지점(seam)을 조사해 후보를 제시하고, 고른 후보만 검토한 뒤 리팩터링은 LEGACY/WAYFINDER로 넘깁니다.

Survey -> Candidates -> Grill the pick -> Route out

HARNESS-EVAL

Test whether a harness helps.harness가 작업에 도움이 되는지 비교합니다.

Compare the same task with and without it; weak evidence says Not proven.

같은 작업을 harness 적용 전후로 비교합니다. 증거가 약하면 Not proven으로 남깁니다.

Cases -> Baseline -> Harness -> Checks -> Compare

SKILL-MINE

Turn repeated work into a skill.반복 작업을 스킬로 만듭니다.

Mine your session history, suggest 3-5 skills, forge the one you pick into a portable SKILL.md.

세션 히스토리에서 후보 3-5개를 찾고, 사용자가 고른 하나를 이식 가능한 SKILL.md로 만듭니다.

Mine -> Rank -> Pick -> Forge -> Install

Not a gate stack. Five principles.게이트를 쌓는 대신, 다섯 가지 원칙.

The skill is a short contract applied every run, not ceremony you have to satisfy.

매 실행에 적용되는 짧은 계약입니다. 통과만을 위한 형식적인 절차가 아닙니다.

Verify against ground truth테스트와 문서로 검증

Re-run the real tests; re-check the request, ticket, README, design/API docs, and repo rules for gaps they miss. Never optimize to a generated proxy.

프로젝트 테스트를 다시 돌리고, 테스트가 놓친 규칙은 요청, 티켓, README, 설계/API 문서, 저장소 규칙에서 다시 확인합니다. 스스로 만든 기준에 맞춰 결과를 꾸미지 않습니다.

Smallest correct change필요한 만큼만 변경

Match the surrounding code. No whole-file rewrite to change a few lines.

주변 코드 스타일을 따릅니다. 몇 줄 바꾸려고 파일 전체를 다시 쓰지 않습니다.

Hidden requirements surface at plan time숨은 요구사항은 Plan 단계에서 드러냅니다

Full-spec and edge cases are discovered at Frame/Plan, not in extra passes. Each iteration uses one builder and one final auditor; browser/CLI proof adds one evidence-only tester before the auditor.

전체 스펙과 엣지 케이스는 별도 패스가 아니라 Frame/Plan 단계에서 발굴합니다. 반복마다 구현자 1명과 최종 auditor 1명을 사용하고, 브라우저/CLI 증명이 필요할 때만 auditor 앞에 증거 전용 tester 1명을 추가합니다.

Ask only when ambiguous모호할 때만 질문

Resolve code-answerable questions by reading the code. Interrupt only for genuine forks.

코드로 답할 수 있는 것은 먼저 코드를 읽어 해결합니다. 진짜 선택지가 갈릴 때만 멈춥니다.

Hard stops멈춰야 할 곳에서 멈춤

Destructive or irreversible steps need consent. If the real tests can't pass, report it - never fake a pass.

파괴적이거나 되돌릴 수 없는 단계는 동의를 구합니다. 프로젝트 테스트가 통과하지 못하면 그대로 보고합니다.

Bundled roles. Load what the phase needs.역할 파일을 함께 제공하고, 단계마다 필요한 것만 불러옵니다.

Every role is a file in `agents/`, so dispatch works the same across Claude Code, Codex, agy, and other agent CLIs. For code delivery, each phase gets a fresh context and loads only its guide.

모든 역할은 `agents/` 파일로 제공됩니다. Claude Code, Codex, agy 같은 에이전트 CLI에서도 같은 방식으로 역할을 나눌 수 있습니다. 코드 delivery에서는 단계마다 새 컨텍스트를 받고 필요한 가이드만 읽습니다.

SKILL.mdbaseline-first loop, modes, map핵심 루프, 모드, 레퍼런스 맵

agents/<role>.mdone persona per role역할별 지침 파일

reference/role-loop.mdFrame -> Plan approval -> Build -> Verify -> FinalizeFrame -> Plan approval -> Build -> Verify -> Finalize

templates/GOAL / PLAN / QA / R-LOOP / Z vault templates + commit gateGOAL / PLAN / QA / R-LOOP / Z 볼트 템플릿 + 커밋 게이트

reference/domain-context.mdsurface hidden requirements숨은 요구사항 찾기

reference/debugging.mdreproduce-first diagnose loop재현 우선 진단 루프

.domain-agent/persisted wiki / QA suite저장된 위키 / QA 스위트

vault/  docs/changelog/<YYYY-MM>/<DD-topic>/
  GOAL.md      request verbatim + success criteria
  PLAN.md      frozen plan -> explicit user OK
  QA.md        results as plain checklist sentences
  R-LOOP.md    verify gaps -> implementer relaunch
  Z-<date>.md  written only when every box is checked

Stop on green. Report with output.

What each lane looks like.각 작업 경로는 이렇게 동작합니다.

The same loop, shaped by the objective. Three representative lanes:

같은 루프가 목표에 맞춰 형태를 바꿉니다. 대표 경로 세 가지입니다.

GREENFIELD

Visible green is not enough.테스트 통과 표시만으로는 부족합니다.

After the first build passes visible tests, fresh roles compare request/docs with behavior and check edge paths before the change is accepted.

첫 Build가 보이는 테스트를 통과해도, 새 역할들이 요청/문서와 동작을 비교하고 예외 경로를 확인한 뒤에야 변경을 받아들입니다.

DEBUG

Reproduce before you fix.고치기 전에 먼저 재현합니다.

A symptom becomes a failing test that reproduces the bug. The fix counts only when that test and the real suite both go green.

증상을 실패 테스트로 재현합니다. 그 테스트와 프로젝트 테스트 스위트가 둘 다 통과할 때만 수정으로 인정합니다.

LEGACY

Minimal, backward-compatible.작게, 기존 동작을 깨지 않게

Map the touched area, make the smallest accurate edit, and confirm existing records and callers still pass.

영향 범위를 파악하고, 가장 작고 정확하게 수정한 뒤, 기존 레코드와 호출부가 여전히 통과하는지 확인합니다.

# clone the skill
git clone https://github.com/cskwork/supergoal-skill.git

# symlink keeps updates simple
ln -s "$(pwd)/supergoal-skill" <your-agent-skills-dir>/supergoal

# verify the active install is not drifting
node templates/skill-install-audit.mjs "$(pwd)/supergoal-skill"

# then, in your agent CLI
/supergoal build a CLI todo app and ship it

# 스킬 클론
git clone https://github.com/cskwork/supergoal-skill.git

# 업데이트하기 쉽게 연결
ln -s "$(pwd)/supergoal-skill" <your-agent-skills-dir>/supergoal

# 설치본이 소스와 달라졌는지 확인
node templates/skill-install-audit.mjs "$(pwd)/supergoal-skill"

# 그다음 사용하는 에이전트 CLI에서
/supergoal CLI 할 일 앱을 만들고 출시해줘

No required service. Optional Board/TUI.
No extra orchestrator.그냥 연결해 바로 쓰세요.
Board/TUI는 필요할 때만 켭니다.

This repo is the skill. Clone it, link it, and hand one objective to `/supergoal`; the Board only observes when you opt in.

이 저장소가 곧 스킬입니다. 클론하고 연결한 뒤 `/supergoal`에 목표 하나를 맡기세요. Board는 진행 상황을 보는 선택 기능입니다.

Agent CLI Node 18+ optional Board/TUI git worktree (optional) real tests

Final state최종 상태

Run telemetry실행 신호

A baseline-first skill, not a ceremony.절차를 늘리지 않고, 테스트와 요구사항을 지키는 스킬.

It distrusts visible green.테스트 통과 표시만으로 끝내지 않습니다.

It keeps the change minimal.변경을 필요한 만큼만 유지합니다.

It verifies against ground truth.테스트와 요구사항으로 검증합니다.

One command. Twelve modes.명령 하나. 열두 가지 모드.

Build and ship a new thing.새 제품을 만들고 출시합니다.

Fix a hard failure.까다로운 실패를 고칩니다.

Add to existing code.기존 코드에 기능을 더합니다.

Spec and split foggy work.큰 작업을 구조화하고 티켓으로 나눕니다.

Test one uncertain direction.불확실한 방향 하나를 검증합니다.

Understand, no code.코드 변경 없이 이해를 돕습니다.

Map a codebase, no code.코드베이스를 파악합니다.

QA or compare data, no code.QA와 데이터 비교만 합니다.

Review code, no fixes.리뷰만 하고 고치지 않습니다.

Find what to deepen, no fixes.고칠 곳을 찾고, 고치지는 않습니다.

Test whether a harness helps.harness가 작업에 도움이 되는지 비교합니다.

Turn repeated work into a skill.반복 작업을 스킬로 만듭니다.

Not a gate stack. Five principles.게이트를 쌓는 대신, 다섯 가지 원칙.

Verify against ground truth테스트와 문서로 검증

Smallest correct change필요한 만큼만 변경

Hidden requirements surface at plan time숨은 요구사항은 Plan 단계에서 드러냅니다

Ask only when ambiguous모호할 때만 질문

Hard stops멈춰야 할 곳에서 멈춤

Bundled roles. Load what the phase needs.역할 파일을 함께 제공하고, 단계마다 필요한 것만 불러옵니다.

What each lane looks like.각 작업 경로는 이렇게 동작합니다.

Visible green is not enough.테스트 통과 표시만으로는 부족합니다.

Reproduce before you fix.고치기 전에 먼저 재현합니다.

Minimal, backward-compatible.작게, 기존 동작을 깨지 않게

No required service. Optional Board/TUI.
No extra orchestrator.그냥 연결해 바로 쓰세요.
Board/TUI는 필요할 때만 켭니다.

A baseline-first skill, not a ceremony.절차를 늘리지 않고, 테스트와 요구사항을 지키는 스킬.

It distrusts visible green.테스트 통과 표시만으로 끝내지 않습니다.

It keeps the change minimal.변경을 필요한 만큼만 유지합니다.

It verifies against ground truth.테스트와 요구사항으로 검증합니다.

One command. Twelve modes.명령 하나. 열두 가지 모드.

Build and ship a new thing.새 제품을 만들고 출시합니다.

Fix a hard failure.까다로운 실패를 고칩니다.

Add to existing code.기존 코드에 기능을 더합니다.

Spec and split foggy work.큰 작업을 구조화하고 티켓으로 나눕니다.

Test one uncertain direction.불확실한 방향 하나를 검증합니다.

Understand, no code.코드 변경 없이 이해를 돕습니다.

Map a codebase, no code.코드베이스를 파악합니다.

QA or compare data, no code.QA와 데이터 비교만 합니다.

Review code, no fixes.리뷰만 하고 고치지 않습니다.

Find what to deepen, no fixes.고칠 곳을 찾고, 고치지는 않습니다.

Test whether a harness helps.harness가 작업에 도움이 되는지 비교합니다.

Turn repeated work into a skill.반복 작업을 스킬로 만듭니다.

Not a gate stack. Five principles.게이트를 쌓는 대신, 다섯 가지 원칙.

Verify against ground truth테스트와 문서로 검증

Smallest correct change필요한 만큼만 변경

Hidden requirements surface at plan time숨은 요구사항은 Plan 단계에서 드러냅니다

Ask only when ambiguous모호할 때만 질문

Hard stops멈춰야 할 곳에서 멈춤

Bundled roles. Load what the phase needs.역할 파일을 함께 제공하고, 단계마다 필요한 것만 불러옵니다.

What each lane looks like.각 작업 경로는 이렇게 동작합니다.

Visible green is not enough.테스트 통과 표시만으로는 부족합니다.

Reproduce before you fix.고치기 전에 먼저 재현합니다.

Minimal, backward-compatible.작게, 기존 동작을 깨지 않게

No required service. Optional Board/TUI.No extra orchestrator.그냥 연결해 바로 쓰세요.Board/TUI는 필요할 때만 켭니다.

No required service. Optional Board/TUI.
No extra orchestrator.그냥 연결해 바로 쓰세요.
Board/TUI는 필요할 때만 켭니다.