ruyi

provenance:github:ZhenchongLi/ruyi

WHAT THIS AGENT DOES

ruyi is an AI agent operating on the GitHub platform. It functions as a dual-agent system, autonomously planning, coding, reviewing, and revising code. This agent is powered by Claude Code and aims to achieve a defined quality bar. Developers can leverage ruyi to streamline their development workflow. The autonomous loop helps ensure code quality and reduces manual effort in the implementation and review process.

PROBLEM IT SOLVES

ruyi solves the problem of repetitive coding and review tasks by automating the implementation-review cycle. Developers would use it to reduce manual effort and accelerate development while maintaining code quality.

View Source ↗First seen 3mo agoNot yet hireable

CAPABILITIES & CONSTRAINTS

TECH & STACK

githubclaudecodeautonomousaidevelopment

README

# Ruyi (如意)

[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](LICENSE) [![Language: Racket](https://img.shields.io/badge/Language-Racket-9F1D20.svg)](https://racket-lang.org/) [![Claude Code](https://img.shields.io/badge/Powered_by-Claude_Code-orange.svg)](https://claude.ai/code)

[中文文档](README.zh-CN.md) | [Blog Post](https://fists.cc/posts/ai/ruyi-as-you-wish-en/)

## Status

This project is under active development. The `.ruyi-tasks` format and public APIs may change between releases.

**心想事成。**

The hardest part of any task isn't doing it — it's knowing exactly what "done" looks like. Ruyi helps you define what you really want through an interactive conversation with [Claude Code](https://claude.ai/code), including how to judge if it's good enough. Then it takes over: autonomous implement-review loops until your definition of "done" is met.

Inspired by [Karpathy's autoresearch](https://github.com/karpathy/autoresearch) and the software development pattern of implement-review-ship — generalized to **any task**.

## What it does

```bash
ruyi do "add CLI support"
ruyi do "write a blog post about our product launch"
ruyi do "translate README to Japanese"
ruyi do "research competitor pricing and write a summary"
```

One command, any task. Ruyi plans with you, then runs an autonomous loop: **Agent A does the work → Agent B reviews → revise or deliver**. Each round improves from the reviewer's feedback. Only quality output survives.

## How to use

```bash
ruyi do "goal"                   # describe what you want
ruyi do @requirements.md         # read goal from file
ruyi do #123                     # do a GitHub issue
ruyi do                          # re-run latest task
ruyi pdo "X" // "Y" // "Z"      # do multiple things in parallel
```

## Use inside Claude Code

If you use [Claude Code](https://claude.ai/code), Ruyi has a `/ruyi` slash command that turns your conversation into a task definition workflow:

```
You:     /ruyi add dark mode support
Claude:  [reads your codebase, proposes goal + judgement]
You:     make the judgement stricter — check contrast ratios too
Claude:  [updates task.rkt, runs ruyi do in background]
You:     also handle the sidebar component        ← keep chatting
Claude:  [edits task.rkt — changes take effect next iteration]
```

The workflow:
1. **Discuss** — Claude helps you define goal and judgement through conversation
2. **Write** — Claude writes `task.rkt` and runs `ruyi do` in the background
3. **Steer** — Keep chatting. Ask Claude to adjust goal or judgement; it edits `task.rkt` live. The running Ruyi re-reads it each iteration.

You stay in your conversation. Ruyi works in the background.

## Install

You need [Claude Code](https://claude.ai/code). Paste this into it:

> Install ruyi for me: https://github.com/ZhenchongLi/ruyi/blob/main/INSTALL-PROMPT.md

That's it. Claude Code detects your environment and handles everything.

<details>
<summary>Shell script (alternative)</summary>

```bash
bash -c "$(curl -fsSL https://raw.githubusercontent.com/ZhenchongLi/ruyi/main/install.sh)"
```

</details>

---

## How it works

```
  ruyi do "goal"
       │
       ├─ Claude Code plans with you interactively
       │  → generates .ruyi-tasks/<date>-<slug>/task.rkt
       │
       ├─ Implement-review loop:
       │    Agent A (Claude Code) implements  ← full agent: reads, writes, tests
       │    Agent B (independent) reviews     ← adversarial, finds issues
       │    Score < threshold? → feedback to Agent A, revise
       │    Ruyi decides: commit / revise / revert
       │
       └─ Push + PR (or local merge)
```

Two independent AI agents — one implements, one reviews. They never see each other's reasoning. Ruyi is the referee.

## Claude Code + Ruyi

They call each other:

```
User → Claude Code → ruyi do "goal"    CC delegates complex tasks to ruyi
                        │
                        ├─ ruyi → Claude Code (planning)   interactive conversation
                        ├─ ruyi → Claude Code (implement)  full agent mode
                        └─ ruyi → Claude Code (review)     independent review
```

**Claude Code** is the brain — reads code, writes code, talks to you. **Ruyi** is the process — worktree isolation, commit/revert, dual-agent review, quality loops.

Use them together: add this to your project's `CLAUDE.md`:

```markdown
For complex multi-step tasks, use `ruyi do "goal"` instead of implementing directly.
Ruyi handles worktree isolation, iterative review, and safe commit/revert.
```

## The safety contract

- **Atomic commit-or-revert** — the task either passes review and commits, or reverts completely
- **Dual-agent review** — implementer and reviewer are independent, adversarial
- **Worktree isolation** — each task runs in its own git worktree, never touches your working directory
- **All parameters in your hands** — score thresholds, diff limits, revision rounds, all controllable through natural language

## All commands

```bash
ruyi do "goal"                   # the main command
ruyi do @file.md                 # read goal from file
ruyi do #123                     # do a GitHub issue
ruyi do #123 "extra context"     # issue + instructions
ruyi do                          # re-run latest task
ruyi tasks                       # list all tasks
ruyi pdo "X" // "Y" // "Z"      # parallel execution
ruyi modes                       # list saved modes
ruyi import mode.txt             # import a mode
ruyi clean                       # remove ruyi-generated files
ruyi update                      # update ruyi
ruyi version                     # show version
```

## Task file

Every `ruyi do` generates a `.ruyi-tasks/` folder:

```
.ruyi-tasks/2026-03-27-improve-docs/
  task.rkt     ← task definition (human-readable, editable, git-tracked)
```

```racket
(ruyi-task
  (goal "improve documentation")
  (judgement "focus on clarity for HN readers")
  (max-revisions 3)                       ;; review-revise rounds
  (min-score 8))                          ;; reviewer approval threshold
```

Edit `task.rkt`, re-run with `ruyi do`. Changes take effect on the next iteration. Share with your team.

<details>
<summary>Why Racket?</summary>

The safety invariants (atomic commit-or-revert, diff limits, dual-agent review) are too important to leave to an LLM. The engine is ~2,000 lines of Racket — readable in 10 minutes. You never write Racket — it's just the runtime.

</details>

## Acknowledgments

- [Karpathy's autoresearch](https://github.com/karpathy/autoresearch) — the core insight that AI agents can work in autonomous loops, doing real iterative work rather than one-shot generation. Ruyi generalizes this: software development's implement-review-ship pattern works for any task.
- [Claude Code](https://claude.ai/code) by Anthropic — the AI agent powering both implementation and review.

## License

MIT

PUBLIC HISTORY

First discoveredMar 28, 2026

IDENTITY

inferred

Identity inferred from code signals. No PROVENANCE.yml found.

Is this yours? Claim it →

METADATA

platformgithub

first seenMar 26, 2026

last updatedMar 27, 2026

last crawled3 months ago

version—

README BADGE

Add to your README:

![Provenance](https://getprovenance.dev/api/badge?id=provenance:github:ZhenchongLi/ruyi)