Skip to content

docs(agent): agent-builds-an-app design overview#4921

Draft
mmabrouk wants to merge 2 commits into
big-agentsfrom
docs/agent-builds-an-app-overview
Draft

docs(agent): agent-builds-an-app design overview#4921
mmabrouk wants to merge 2 commits into
big-agentsfrom
docs/agent-builds-an-app-overview

Conversation

@mmabrouk

@mmabrouk mmabrouk commented Jun 28, 2026

Copy link
Copy Markdown
Member

What this is

The map for the "agent builds an app" initiative. A new agent on Agenta should start useful, then turn itself into a real application through a conversation: find the tools it needs, connect integrations, edit its own instructions, set a trigger or a cron job, and commit. The agent becomes the app. The user never writes config by hand; they have a conversation.

This doc frames the four design docs that ship together, says what is locked, and lists what is still open. Read it first, then read the four.

Aligned to the approved #4917 build-kit overlay model. The build kit is a read-only agent-template overlay the backend serves on the inspect response at additional_context.playground_build_kit.agent_template_overlay; the frontend applies it for a playground run and excludes it on commit, and the agent service never sees it. The advanced-drawer design folded into #4917, so this is now four sub-projects, not five.

The four sub-projects

Cross-cutting locked decisions

  • The agent becomes the app (self-modification only).
  • A read-only overlay the frontend applies, never committed. The agent service stays dumb: no run flag, no service-side merge.
  • Ship all the platform tools in the overlay (every PLATFORM_OPS entry, as { "type": "platform", "op": ... }).
  • One generic client-tool round-trip carries both config approval and connection requests.
  • Connections are frontend-owned and reference-only; the agent never holds the secret.
  • Client tools and skills are reference-tool embeds (@ag.embed); request_connection is not a platform op.
  • No forced skills. The published default is bare; a skill reaches a run only because the overlay carries its embed.

How to read

Build order: the frontend round-trip is the foundation; default config is independent and can build in parallel; builder tools and skills build on the round-trip. The advanced-drawer drawer folded into #4917, and the collapsible-sections cleanup ships separately. The decided points and the open items are gathered in the doc and do not block this design review.

Scope / risk

Doc only, no code. This PR is the overview; the substance lives in the four linked PRs.

https://claude.ai/code/session_01GYo3UEfvsZpncagqb28Mbc

@vercel

vercel Bot commented Jun 28, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
agenta-documentation Ready Ready Preview, Comment Jun 28, 2026 9:48pm

Request Review

@coderabbitai

coderabbitai Bot commented Jun 28, 2026

Copy link
Copy Markdown

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: bf03672c-0bc7-4ccb-94f1-f41e3c414088

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch docs/agent-builds-an-app-overview

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands.

@mmabrouk

Copy link
Copy Markdown
Member Author

🤖 The AI agent says:

What this covers. The map for the whole "agent builds an app" initiative. A new agent should start useful, then turn itself into a real application through a conversation: find tools, connect integrations, edit its own instructions, set a trigger or cron, and commit. This page frames the four design PRs (#4917, #4918, #4919, #4920), says what is locked, and lists what is open.

Where to look. The four sub-project summaries, the cross-cutting locked decisions, and the build order.

Feedback needed. Please confirm the framing and the cross-cutting locked decisions:

  • The agent becomes the app (self-modification only).
  • Opt-out is delete-only.
  • One generic client-tool round-trip carries both config approval and connection requests.
  • Connections are frontend-owned and reference-only; the agent never holds the secret.
  • Skills arrive by embed, not by force.
  • Defaults reach the new-agent draft through the catalog template.

This PR is the map; the four linked PRs hold the detail.

@mmabrouk

Copy link
Copy Markdown
Member Author

🤖 The AI agent says: Aligned the overview to the approved #4917 build-kit overlay model.

What changed:

  • The initiative framing, sub-project list, locked decisions, build order, and open items now describe the overlay model: a read-only agent-template overlay the backend serves at additional_context.playground_build_kit.agent_template_overlay, applied by the frontend on a playground run and excluded on commit, with the agent service kept dumb (no run flag, no service-side merge).
  • The advanced-build-kit drawer folded into docs(agent): default config as a build-kit overlay (inspect_context) #4917, so this is now four sub-projects, not five. Added a short section noting the fold and that the collapsible-sections change ships separately as a small drawer cleanup.
  • Locked decisions updated: overlay; frontend applies and excludes; client tools and skills as @ag.embed reference-tool embeds (with request_connection not a platform op); no forced skills; published default bare. Dropped the old delete-only, catalog-template, and client-executor-platform-op decisions.
  • Builder bullet now mentions the new remove_schedule / remove_subscription undo tools. Decided points (toggle ephemeral, drawer fold, kit permissions read-only, builder testing) moved to a "Decided" section; open items trimmed to the four docs.

Please review the locked-decisions list and the four-sub-project structure.


- **The agent becomes the app.** Self-modification only. The agent edits and commits itself. It
does not build other workflows in this round.
- **A read-only overlay the frontend applies, never committed.** The build kit is an agent-template

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The load-bearing locked decision: the build kit is a read-only overlay the frontend applies and excludes, served at additional_context.playground_build_kit.agent_template_overlay. The agent service stays dumb. This is the pivot that ripples through all four docs.

an `@ag.embed` in the committed config. The bodies are placeholders that capture the flow; the
final prose lands later.

## The drawer folded into the default config

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The advanced build-kit drawer is no longer a separate sub-project: its design folded into #4917, which now owns both the overlay and the drawer. The collapsible-sections change is the one piece that ships independently.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

needs-review Agent updated; awaiting Mahmoud's review

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant