Merged
Conversation
Agent-Logs-Url: https://github.com/githubnext/tsessebe/sessions/ed345910-c9ba-4e1d-9822-6c656c5c9a62 Co-authored-by: mrjf <180956+mrjf@users.noreply.github.com>
Agent-Logs-Url: https://github.com/githubnext/tsessebe/sessions/ed345910-c9ba-4e1d-9822-6c656c5c9a62 Co-authored-by: mrjf <180956+mrjf@users.noreply.github.com>
Copilot
AI
changed the title
[WIP] Add Test-Driven as the second autoloop strategy
Add Test-Driven autoloop strategy
Apr 23, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Ships a second autoloop strategy alongside AlphaEvolve. Where AlphaEvolve drives optimization (evolve an artifact against a scalar fitness), Test-Driven drives specification (pin behaviour in a failing test, implement until green) — covering the API-porting and bug-fix workloads that don't fit AlphaEvolve.
Layout
Mirrors
.autoloop/strategies/alphaevolve/so the two feel like siblings. No workflow changes needed — strategy discovery already routes offprogram.md's## Evolution Strategypointer.Files
strategy.md— runtime playbook. 8-step loop: load state → pick target → red → green → refactor → evaluate → update Test Harness → fold through to default loop. Includes deterministic overrides (failing tests are obligations; 3× error → forcedrethink-test), a 5-attempt cap that pauses the program withtd-stuck:rather than thrashing, and the Test Harness schema (t{NNN}ids, status transitions, range-collapse compression atharness_size_cap).CUSTOMIZE.md— creator-time guide (parallels alphaevolve's). When-to-pick rules, copy commands, the## Evolution Strategypointer block to drop intoprogram.md, marker-by-marker guidance, and a workedtsb-bugfixexample.prompts/write-test.md— red-phase framing: domain-knowledge slot, framework-setup slot, validity checklist (must fail with a useful message, must not couple to implementation details), reasoning template.prompts/make-green.md— green-phase framing: minimum-change discipline, explicit don't-modify list, anti-patterns (overfitting, speculative generality, weakening tests to shrink the diff).prompts/refactor.md— refactor-phase framing, gated on a green suite. Refactor vocabulary, an explicit not-a-refactor list (renaming for taste, formatting, comment polish), skip conditions, and the hard rule that the test set must be identical before/after — if a "refactor" needs a test edit, it's a behaviour change and belongs in a red iteration.All
<CUSTOMIZE: …>markers are preserved as fill-in points;grep -R "<CUSTOMIZE" strategy/should return nothing once an adopter resolves them for a specific program.