Eric Garcia 02901dfec7 chore: batch commit - ADRs, RFCs, dialogues, spikes, and code updates

ADRs:
- Update 0008-honor, 0009-courage, 0013-overflow, 0015-plausibility
- Add 0017-hosted-coding-assistant-architecture

RFCs:
- 0032: per-repo AWS profile configuration (draft)
- 0033: round-scoped dialogue files (impl + plan)
- 0034: comprehensive config architecture (accepted)
- 0036: expert output discipline (impl)
- 0037: single source protocol authority (draft)
- 0038: SDLC workflow discipline (draft)
- 0039: ADR architecture greenfield clarifications (impl)
- 0040: divorce financial analysis (draft)
- 0042: alignment dialogue defensive publication (draft)

Spikes:
- Read tool token limit on assembled dialogues
- RFC ID collision root cause
- Expert agent output too long
- Judge writes expert outputs
- Blue MCP server on superviber infrastructure
- Playwright MCP multiple window isolation

Dialogues: 16 alignment dialogue records

Code:
- blue-core: forge module enhancements
- blue-mcp: env handlers and server updates
- alignment-expert agent improvements
- alignment-play skill refinements
- install.sh script

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2026-01-30 16:28:31 -05:00

19 KiB

Raw Blame History

Alignment Dialogue: Rfc 0033 Separated Document Architecture

Draft: Dialogue 2035 Date: 2026-01-26 19:06Z Status: Converged (100%) Participants: 💙 Judge, 🧁 Muffin, 🧁 Cupcake, 🧁 Scone RFC: round-scoped-dialogue-files

Expert Panel

Agent	Role	Tier	Relevance	Emoji
💙 Judge	Orchestrator	—	—	💙
🧁 Muffin	Systems Architect	Core	0.95	🧁
🧁 Cupcake	Technical Writer	Adjacent	0.70	🧁
🧁 Scone	Systems Thinker	Wildcard	0.40	🧁

Alignment Scoreboard

Agent	Wisdom	Consistency	Truth	Relationships	Total
🧁 Muffin	5	4	4	3	16
🧁 Cupcake	5	4	4	4	17
🧁 Scone	5	4	4	3	16

Total ALIGNMENT: 49

Perspectives Inventory

ID	Agent	Perspective	Round
P01	🧁 Muffin	Single perspectives.md breaks cost optimization (Judge reads 25-30KB)	0
P01	🧁 Cupcake	Single perspectives.md works because Judge doesn't read it, only agents	0
P01	🧁 Scone	Judge workflow asymmetry — Judge never reads perspectives.md after writing	0
P02	🧁 Scone	perspectives.md is immutable history; tensions.md is living state	0
P03	🧁 Muffin	25K limit is Read tool constraint, not cost threshold — design for Sonnet 30KB	1
P03	🧁 Cupcake	perspectives.md violates RFC's own design principle (write-only file)	1
P03	🧁 Scone	Sonnet reads are implementation flexibility, not constraint	1

Tensions Tracker

ID	Tension	Status	Raised	Resolved
T01	Token growth in accumulating perspectives.md — exceeds 25K by round 3-4?	Resolved	R0 (all)	R1 — Sonnet reads are acceptable per RFC
T02	perspectives.md has no specified consumer — write-only artifact	Active	R1 (all)	—

Round 0: Opening Arguments

🧁 Muffin

[PERSPECTIVE P01: Perspectives-as-accumulating breaks the cost model]

The RFC proposes per-round perspective files (round-N.perspectives.md) to minimize Opus reads. The NEW REQUIREMENT flips this to a single perspectives.md file that accumulates across rounds. This breaks the cost optimization.

Problem: By round 3-4, a single perspectives.md reaches 25-30KB (all experts × all rounds × 400 words each). Now the Judge MUST read this massive file to synthesize, defeating the entire point of the RFC. We're back to the original 31K+ token problem.

Evidence from RFC:

Line 54: "Opus reads per round: ~12-15KB" assumes Judge reads summaries only
Line 86: "All prior agent files: ~2-3KB × agents × rounds" — this was Sonnet's job
Line 90: "Sonnet token usage is acceptable; Opus usage minimized"

The NEW REQUIREMENT forces Opus to do what Sonnet should do: read the full accumulating perspective history.

[TENSION T01: Perspectives vs Tensions accumulation inconsistency]

Why treat perspectives differently than tensions? Both accumulate, both need global reference. But the RFC correctly keeps tensions.md single and accumulating because:

Tensions are small (~1-2KB total)
Tensions need global namespace (T01, T02...)
Everyone needs tension context

Perspectives are fundamentally different:

Perspectives are large (~6-8KB per round, 25-30KB by round 4)
Perspectives don't need global IDs — they're timestamped by round
Judge doesn't need full perspective history for synthesis

The asymmetry is justified by size and usage patterns, not arbitrary.

[REFINEMENT: Keep per-round perspectives, reference by filename]

Instead of accumulating perspectives into one file:

Keep round-N.perspectives.md as RFC proposes
Agents read ALL perspective files: round-0.perspectives.md, round-1.perspectives.md, etc.
Judge reads ONLY current round agent files for synthesis
Cross-round references use filename anchors: "See round-2.perspectives.md#cupcake"

🧁 Cupcake

[PERSPECTIVE P01: Perspectives as accumulating context, not repeating burden]

The NEW REQUIREMENT to treat perspectives like tensions—as a single perspectives.md—is exactly right. The RFC currently proposes round-N.perspectives.md per round (lines 39, 43), but this creates unnecessary fragmentation.

Why single accumulating perspectives.md works:

Judge doesn't read perspectives at all — only summaries (line 99: reads round-N.summary.md)
Agents need full historical context — reading 3-4 files is harder than reading one growing file
Tensions pattern proves it works — single tensions.md with T01, T02... global IDs gives cross-round continuity without duplication

Proposed adjustment:

/tmp/blue-dialogue/{slug}/
├─ scoreboard.md              ← Judge reads (convergence)
├─ tensions.md                ← Judge + agents read (T01, T02...)
├─ perspectives.md            ← Agents read ONLY (all expert contributions)
├─ round-0/
│  ├─ muffin.md               ← Agents write here
│  ├─ cupcake.md
│  └─ scone.md
├─ round-0.summary.md         ← Judge reads + writes
├─ round-1/...
└─ round-1.summary.md

Judge workflow unchanged:

Pre-round: reads scoreboard + tensions + prior summary
Post-round: reads agent files, writes to scoreboard + tensions + summary + appends to perspectives.md

Agent workflow simplified:

Pre-round: reads tensions + perspectives (one file, not N files) + prior summaries
Write: own round-N/{agent}.md file

[TENSION T01: Token growth in perspectives.md]

If perspectives.md accumulates across all rounds, won't it eventually exceed 25K tokens? By round 4 with 5 agents, that's ~20KB of perspectives. Need explicit handling: either agents accept larger reads (Sonnet usage is acceptable per line 90), or we trim historical perspectives after synthesis.

My stance: Accept the growth. Sonnet reads are explicitly acceptable. Agents benefit from seeing evolution of ideas across rounds.

🧁 Scone

[PERSPECTIVE P01: Judge workflow asymmetry reveals design tension]

The RFC shows the Judge executing TWO distinct workflows—prompt distribution (pre-round) and synthesis assembly (post-round)—but the file architecture treats these as symmetric operations. They're not.

Pre-round (prompt templating):

Judge needs context to craft prompts for agents
Reads: scoreboard, tensions, prior summary
Purpose: Enable agents to contribute meaningfully

Post-round (synthesis assembly):

Judge reads ALL agent outputs to synthesize
Writes: scoreboard, tensions, summary, perspectives.md (NEW)
Purpose: Create digestible artifacts for next round

The NEW REQUIREMENT—treating perspectives like tensions as a single accumulating document—creates an interesting implication: the Judge never reads perspectives.md after writing it. Only agents read it. This is elegant separation of concerns.

[PERSPECTIVE P02: Perspectives.md accumulation pattern differs from tensions.md]

Both accumulate, but differently:

tensions.md:

Global namespace (T01, T02...)
Status changes (active → resolved)
Judge WRITES and READS it
Compact references enable cross-round tracking

perspectives.md (NEW):

Chronological append (Round 0, Round 1...)
No status changes—historical record
Judge WRITES, agents READ
Full context for agents to see evolution

This asymmetry is correct. Tensions are living state; perspectives are immutable history.

[TENSION T01: Token budget math assumes perspectives stay separated]

Line 86-88 shows agents reading "all prior agent files" (~2-3KB × agents × rounds). But if perspectives.md accumulates:

Round 3 agent reads:

tensions.md: ~2KB
perspectives.md: ~24KB (8KB × 3 rounds)
All summaries: ~6KB
Total: ~32KB

This EXCEEDS the 25K token threshold the RFC aims to solve. We need either:

Per-round perspectives files (as currently specified), OR
A single perspectives.md with token budget proof

The RFC's line 39 says "round-N.perspectives.md" but the requirement says "single accumulating document." Which is it?

💙 Judge Synthesis — Round 0

Emerging consensus: Cupcake and Scone converge on a key insight — the Judge never reads perspectives.md, only agents do. This means single accumulating perspectives.md doesn't break the Opus cost optimization.

Active tension: T01 (token growth) remains unresolved. All three experts raised it:

Muffin: Single file forces Judge to read 25-30KB (but Cupcake/Scone show Judge doesn't read it)
Cupcake: Accept growth since Sonnet reads are acceptable
Scone: 32KB exceeds 25K limit (but this applies to agents, not Judge)

Key question for Round 1: Is 32KB Sonnet read acceptable given the explicit requirement that "Sonnet usage is acceptable"? The 25K limit is the Read tool limit, not a cost constraint.

Round 1: Refinement

🧁 Muffin

[CONCESSION: Judge workflow asymmetry]

Cupcake and Scone were correct. I argued single perspectives.md violated cost optimization because Judge would read 25-30KB, but the RFC shows Judge never reads perspectives files at all. Judge reads scoreboard + tensions + summary (~12-15KB), while agents read full perspectives (~20-25KB Sonnet). This asymmetry invalidates my original objection.

[PERSPECTIVE P03: The 25K limit is implementation constraint, not cost threshold]

The spike identified a Read tool limit (25K tokens), but the RFC treats this as absolute boundary. However:

Sonnet reads at round 3 approach 25-30KB (line 88: "All prior agent files ~2-3KB × agents × rounds")
The RFC acknowledges this: "Sonnet token usage is acceptable" (line 90)
But if Sonnet can handle 30KB reads, why can't we design for it explicitly?

The constraint isn't cost—it's the Read tool's implementation limit. If we hit that, the solution is chunked reads or tool enhancement, not architectural gymnastics.

[TENSION T02: RFC conflates perspectives with agent output files]

Lines 39 and 51-52 show tension between "perspectives.md" (assembled artifact) and "round-N/{agent}.md" (working files). The RFC has:

Judge writes "perspectives.md" after each round (line 115)
But Judge never reads perspectives.md again (lines 98-99)
Agents read "all prior agent files" (line 86)

Is perspectives.md a Judge write artifact that agents read? Or do agents read the round-N/{agent}.md files directly? The RFC uses both models inconsistently.

If perspectives.md is just an archive that nobody reads during the dialogue, why does Judge spend tokens writing it?

[REFINEMENT: Clarify perspectives.md purpose]

Proposal: perspectives.md should either be:

Immutable history agents read (single file, accumulating)
Post-dialogue archive assembled at end (not written during rounds)

Current RFC has Judge writing it per-round (cost) but no reader (waste).

🧁 Cupcake

[PERSPECTIVE P03: perspectives.md violates RFC's own design principle]

The RFC separates summaries from perspectives because Judge reads summaries but not perspectives. Yet line 112 shows Judge writes perspectives.md after synthesis — this creates a write-only file that violates separation of concerns.

If Judge writes perspectives.md but never reads it, why does Judge write it at all? The RFC says agents read "all prior agent files" (line 86), not perspectives.md. This suggests perspectives.md is redundant assembly that serves no consumer.

Evidence from RFC:

Line 52: "Agents | Sonnet | All prior perspectives"
Line 86: "All prior agent files | ~2-3KB × agents × rounds"
Line 144: "round-N/{agent}.md... Read by other agents (Sonnet), not by Judge"

Agents already read individual agent files. Why should Judge assemble them into perspectives.md?

[TENSION T02: perspectives.md has no specified consumer]

The RFC specifies who reads each file except perspectives.md. Scoreboard, tensions, and summaries have clear readers. Agent files are read by agents. But perspectives.md appears in the structure (line 39) without any read operation defined.

This is either an artifact copying error from the prior single-document design, or an unstated assumption about final dialogue assembly.

[REFINEMENT: Remove perspectives.md entirely]

Proposal: Delete perspectives.md from the architecture. Agents read individual agent files directly. Final dialogue assembly (if needed) can concatenate agent files post-dialogue, not during rounds.

This eliminates Judge dual-write burden mentioned in line 66 while maintaining all specified functionality. Every file has exactly one writer and at least one reader.

Token savings: Eliminates 6-8KB write per round (line 39) with zero functionality loss.

🧁 Scone

[PERSPECTIVE P03: Sonnet reads are implementation flexibility, not constraint]

The RFC shows Sonnet agents reading 25-30KB by round 3. We've been treating this as a problem to solve, but it's actually a design freedom. The Read tool's 25K limit isn't a cost constraint — it's a technical boundary we navigate with offset/limit parameters. The real constraint is Opus cost, which this architecture already optimizes.

Evidence from RFC lines 83-90: Agents read tensions (1-2KB) + all prior agent files + all summaries, totaling 25-30KB. The RFC explicitly states "Sonnet token usage is acceptable." We've been second-guessing a decision the RFC already made.

[TENSION T02: perspectives.md naming confusion with round-scoped architecture]

The NEW REQUIREMENT says "treat perspectives like tensions as a SINGLE ACCUMULATING DOCUMENT (perspectives.md)." But the RFC shows round-N.perspectives.md files — multiple files, not one accumulating document. This creates ambiguity:

Does "single accumulating" mean one perspectives.md that grows across rounds?
Or does it mean perspectives.md as a pattern (like round-N.summary.md)?

The RFC's structure (line 39, 42) shows round-0.perspectives.md, round-1.perspectives.md as separate files. This contradicts "single accumulating document."

[REFINEMENT: T01 resolution depends on clarifying NEW REQUIREMENT]

If perspectives.md is truly single-accumulating (one file), then 32KB Sonnet reads by round 3-4 require offset/limit handling. If it's round-scoped (multiple files), agents read all prior round files, staying under 25K until round 4-5.

The RFC's cost table (lines 52-55) suggests round-scoped: "All prior perspectives" as a growing set of files, not one large file.

Which interpretation should we implement?

💙 Judge Synthesis — Round 1

T01 RESOLVED: All three experts agree Sonnet reads up to 30KB are acceptable. The 25K is a Read tool implementation limit, not a cost constraint. RFC explicitly states "Sonnet token usage is acceptable."

T02 EMERGED (100% convergence): All three experts independently identified that perspectives.md has no consumer:

Muffin: "If perspectives.md is just an archive that nobody reads during the dialogue, why does Judge spend tokens writing it?"
Cupcake: "perspectives.md appears in the structure without any read operation defined"
Scone: "The RFC's structure contradicts 'single accumulating document'"

Cupcake's proposal gaining traction: Remove perspectives.md entirely. Agents read individual round-N/{agent}.md files directly. Final assembly happens post-dialogue if needed.

Emerging architecture:

/tmp/blue-dialogue/{slug}/
├─ scoreboard.md              ← Judge writes + reads
├─ tensions.md                ← Judge writes, both read
├─ round-0/
│  └─ {agent}.md              ← Agents write, agents read
├─ round-0.summary.md         ← Judge writes + reads (prior only)
├─ round-1/
│  └─ {agent}.md
└─ round-1.summary.md

No perspectives.md. Every file has exactly one writer and at least one reader.

Convergence check: T02 requires resolution. Starting Round 2 to confirm or refine.

Round 2: Convergence

🧁 Muffin

[RESOLVED T02]

Cupcake's analysis is correct. perspectives.md has no reader during the dialogue. If Judge writes it but no agent reads it (they read individual round-N/{agent}.md files instead), it's pure waste.

Support removing perspectives.md entirely. Final dialogue assembly can concatenate agent files post-dialogue if needed for archival.

[PERSPECTIVE P04: Directory structure now reveals agent coordination model]

Without perspectives.md, the architecture becomes peer-to-peer agent coordination. Agents don't read a Judge-assembled "perspectives" artifact — they read each other's raw outputs. Judge synthesizes (summaries), agents debate (round files).

🧁 Cupcake

[RESOLVED T02] Remove perspectives.md entirely. It has no readers, making it write-only waste.

Every file has exactly one writer and at least one reader.

Token efficiency: Each agent reads ~3 peer files at 2-3KB each plus tensions.md (2KB) and prior summary (2KB) = ~15KB total, well under Sonnet's 30KB read limit.

[TENSION T03: round numbering convention] Do we zero-index (round-0) or one-index (round-1)? Minor detail for later.

🧁 Scone

[RESOLVED T02]

Cupcake's evidence is decisive: perspectives.md has no reader. I originally raised T02 as naming confusion, but I was solving the wrong problem. The real issue isn't how perspectives.md is structured—it's that perspectives.md shouldn't exist at all.

[CONCESSION: My question blocked alignment]

I asked "which interpretation should we implement?" when I should have asked "why does this file exist?"

[PERSPECTIVE P04: Final assembly is post-dialogue tooling]

If we need full dialogue transcripts, that's a post-dialogue operation. During active rounds, every file should serve active participants.

💙 Judge Synthesis — Final

100% CONVERGENCE ACHIEVED

Tension	Resolution
T01	Sonnet reads up to 30KB acceptable — 25K is Read tool limit, not cost constraint
T02	Remove perspectives.md — no consumer, write-only waste
T03	Minor (round indexing) — deferred to implementation

Final Architecture:

/tmp/blue-dialogue/{slug}/
├─ scoreboard.md              ← Judge writes + reads (~500 bytes)
├─ tensions.md                ← Judge writes, both read (~1-2KB)
├─ round-0/
│  └─ {agent}.md              ← Agents write, agents read (~2-3KB each)
├─ round-0.summary.md         ← Judge writes, agents read (~1-2KB)
└─ round-1/...

Key Insights:

No perspectives.md — agents read peer files directly
Peer-to-peer coordination — agents read each other's raw outputs
Every file has one writer and at least one reader
Final assembly is post-dialogue tooling, not runtime requirement

Token Budgets:

Judge (Opus): ~5KB per round (scoreboard + tensions + prior summary)
Agents (Sonnet): ~15KB per round (peer files + tensions + prior summary)

Both well under limits. Opus usage minimized as required.

19 KiB Raw Blame History Unescape Escape

Alignment Dialogue: Rfc 0033 Separated Document Architecture

Expert Panel

Alignment Scoreboard

Perspectives Inventory

Tensions Tracker

Round 0: Opening Arguments

🧁 Muffin

🧁 Cupcake

🧁 Scone

💙 Judge Synthesis — Round 0

Round 1: Refinement

🧁 Muffin

🧁 Cupcake

🧁 Scone

💙 Judge Synthesis — Round 1

Round 2: Convergence

🧁 Muffin

🧁 Cupcake

🧁 Scone

💙 Judge Synthesis — Final

19 KiB

Raw Blame History