fix(agent): land skill/error trace attrs under recognized ag.* namespaces (F-036) by mmabrouk · Pull Request #4857 · Agenta-AI/agenta

mmabrouk · 2026-06-25T21:17:08Z

Context

The Wave-1 agent tracing fix (PR #4855, F-029/F-030) added skill and error attributes to the agent span, but it stamped them as ag.agent.skills.loaded / ag.agent.skills.count and ag.error.message / ag.error.provider. Live QA (F-036) found these land under ag.unsupported.* in the trace viewer (e.g. ag.unsupported.agent.skills.loaded, ag.unsupported.error.message).

Root cause: Agenta's OTel ingest (api/oss/src/core/tracing/utils/attributes.py initialize_ag_attributes) strict-whitelists top-level ag.* keys against the AgAttributes pydantic model and relocates any unrecognized key into an ag.unsupported.* bucket. agent and error are not fields on that model, so both attribute groups were demoted. The data was correct and queryable, just mis-namespaced.

This change moves the attributes to existing recognized free-form ag.* buckets, so they land first-class with no api/SDK schema change:

skills: ag.agent.skills.{loaded,count} -> ag.meta.skills.{loaded,count} (ag.meta is the recognized home for run/request metadata, the same place model/system/response info lands)
error: ag.error.{message,provider} -> ag.exception.{message,provider} (ag.exception is the recognized error bucket; message mirrors the OTel exception event)

Both ag.meta and ag.exception pass ingest validation untouched (they are free-form Meta / Data dicts), so no api-side schema change is needed. Registering net-new ag.agent.* / ag.error.* namespaces would mean coordinated edits to the SDK pydantic model + the api validator + tests + back-compat — non-trivial for a pure-naming polish where the info is already present, so it was deliberately not done.

Scope / risk

Runner-only: services/agent/src/tracing/otel.ts (three attribute-name changes across the Pi extension tracer and the sandbox-agent ACP tracer) plus its unit test.
No /run wire change (the golden wire contract is untouched; skill names ride an internal env var, errors are span-side only).
No api, SDK, or FE change. No live runner restart.
Behavior risk: a downstream trace query that hard-codes ag.unsupported.agent.skills.* or ag.unsupported.error.* would need to read the new keys. No such query exists in the repo (grep of code + interface inventory is clean; only QA scratch docs referenced the old names, now updated).

Not fixed here (reported, out of scope): builtins-in-loaded

F-036 also noted the forced _agenta.* platform skill did not appear in the loaded list on a pi_agenta run (only the author skill). This is NOT a runner trace bug: the runner faithfully stamps exactly the materialized request.skills it receives. The platform default skill (_agenta.agenta-getting-started) is @ag.embed-ed only into the DEFAULT agent config template (services/oss/src/agent/schemas.py), so a custom config drops it and it never reaches the wire. The runner has no independent forced-skill installation and treats pi_agenta like pi_core for skills, so it cannot stamp a skill it never received. Force-injecting the platform skill for every pi_agenta run regardless of config is a server-side concern (the platform-skill seeding "separate workstream" noted in harnesses.py / agenta_builtins.py) and lives outside this lane's otel.ts boundary. Recorded as decision D-016.

How to QA

Prerequisites: Node 24 on PATH; from services/agent.

Run the focused unit test:
```
pnpm exec vitest run tests/unit/otel-skills-error.test.ts
```
Expected: 7 passed. The new uses recognized ag.* namespaces, never ag.agent.* / ag.error.* (F-036) test asserts the agent span carries no ag.agent.*, ag.error.*, or ag.unsupported.* keys, and that ag.meta.skills.loaded + ag.exception.message are present.
Full suite + typecheck:
```
pnpm test && pnpm run typecheck
```
Expected: 268 passed, tsc clean.
Live (orchestrator, optional): run a pi_agenta skill cell and an error cell, fetch the trace via GET /api/preview/tracing/traces/{trace_id}, and grep span attributes — the loaded skills now appear under ag.meta.skills.* and the error under ag.exception.*, not under ag.unsupported.*.

Edge cases covered by tests: no skills -> attributes omitted; recordError provider falls back to the init model provider; local-Pi self-instrument path stamps Pi's own span (skills) and emits a standalone agent_error span (error).

https://claude.ai/code/session_01GYo3UEfvsZpncagqb28Mbc

…aces (F-036) The Wave-1 tracing fix stamped skill and error attributes under ag.agent.skills.* and ag.error.*, which Agenta's OTel ingest strict-whitelists against AgAttributes and demotes to ag.unsupported.* (queryable but mis-namespaced). Move them to the recognized free-form buckets ag.meta.skills.{loaded,count} and ag.exception.{message,provider} so they land first-class with no api schema change. Runner-only (services/agent/src/tracing/otel.ts); updates the otel-skills-error unit tests and adds a namespace guard asserting no ag.agent.*/ag.error.*/ ag.unsupported.* keys. Claude-Session: https://claude.ai/code/session_01GYo3UEfvsZpncagqb28Mbc

vercel · 2026-06-25T21:17:16Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
agenta-documentation	Ready	Preview, Comment	Jun 25, 2026 9:18pm

mmabrouk · 2026-06-25T21:17:26Z

Needs review — code review of a runner-only tracing namespace fix (F-036).

What changed and what to look at:

services/agent/src/tracing/otel.ts: three attribute-name moves so skill/error trace attrs land under recognized ag.* namespaces instead of being demoted to ag.unsupported.* by Agenta's OTel ingest:
- ag.agent.skills.{loaded,count} -> ag.meta.skills.{loaded,count}
- ag.error.{message,provider} -> ag.exception.{message,provider}
Please sanity-check the namespace choice: ag.meta (free-form Meta) and ag.exception (free-form Data) are passed through untouched by initialize_ag_attributes (api/oss/src/core/tracing/utils/attributes.py:247-248), so no api/SDK schema change is needed. Is ag.meta.skills.* the right home for loaded-skills metadata (it matches the ag.meta.* convention used for model/system/response), or would you prefer a registered ag.agent.* namespace (non-trivial: SDK pydantic + api validator + tests + back-compat)?

Decision needed from you: confirm the "use recognized buckets, don't register new ones" call (D-015), and confirm that punting the builtins-in-loaded gap to the server-side platform-skill seeding workstream (D-016, out of otel.ts scope) is the right split.

Not a UX/functional check — this is observability-only and there is no /run wire change. Live runner was not restarted.

mmabrouk · 2026-06-25T21:17:27Z

@coderabbitai review

coderabbitai · 2026-06-25T21:17:44Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-06-25T21:18:19Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: ac0cdea6-3081-4266-8a68-7ecb8786ba1c

📥 Commits

Reviewing files that changed from the base of the PR and between 6324757 and 639465e.

📒 Files selected for processing (2)

services/agent/src/tracing/otel.ts
services/agent/tests/unit/otel-skills-error.test.ts

📝 Walkthrough

Summary by CodeRabbit

Bug Fixes
- Standardized tracing metadata so loaded skills now appear in the expected ag.meta.* fields.
- Updated error reporting to use ag.exception.* fields for clearer, more consistent span details.
- Kept exception events and span status behavior unchanged while improving attribute naming.
Tests
- Updated tracing tests to verify the new metadata and error attribute namespaces across both agent paths.

Walkthrough

Tracing attributes for loaded skills and run errors were renamed to new namespaces in agent OTel code, and the matching unit tests were updated to assert the revised keys.

Changes

Tracing namespace updates

Layer / File(s)	Summary
Span attribute namespace changes `services/agent/src/tracing/otel.ts`	`createAgentaOtel` and `createSandboxAgentOtel` now write loaded-skills attributes under `ag.meta.`, and sandbox-agent failure metadata is written under `ag.exception.`.
Tracing test updates `services/agent/tests/unit/otel-skills-error.test.ts`	The tracing test descriptions and assertions were updated to expect `ag.meta.` and `ag.exception.` across sandbox-agent, Pi, and `recordError` cases.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Possibly related PRs

Agenta-AI/agenta#4855: Updates the same agent OTel tracing namespaces from ag.agent.*/ag.error.* to ag.meta.*/ag.exception.*, including matching unit assertions.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly summarizes the main change: moving agent skill and error trace attributes into recognized namespaces.
Description check	✅ Passed	The description accurately matches the namespace fix and scope of the runner-only tracing changes.
Docstring Coverage	✅ Passed	Docstring coverage is 75.00% which is sufficient. The required threshold is 60.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/agent-tracing-namespace

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

dosubot Bot added size:M This PR changes 30-99 lines, ignoring generated files. Backend labels Jun 25, 2026

mmabrouk added the needs-review Agent updated; awaiting Mahmoud's review label Jun 25, 2026

vercel Bot deployed to Preview June 25, 2026 21:18 View deployment

mmabrouk merged commit ea8ab77 into big-agents Jun 25, 2026
30 of 31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(agent): land skill/error trace attrs under recognized ag.* namespaces (F-036)#4857

fix(agent): land skill/error trace attrs under recognized ag.* namespaces (F-036)#4857
mmabrouk merged 1 commit into
big-agentsfrom
fix/agent-tracing-namespace

mmabrouk commented Jun 25, 2026

Uh oh!

vercel Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

mmabrouk commented Jun 25, 2026

Uh oh!

mmabrouk commented Jun 25, 2026

Uh oh!

coderabbitai Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented Jun 25, 2026 •

edited

Loading

Summary by CodeRabbit

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mmabrouk commented Jun 25, 2026

Context

Scope / risk

Not fixed here (reported, out of scope): builtins-in-loaded

How to QA

Uh oh!

vercel Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmabrouk commented Jun 25, 2026

Uh oh!

mmabrouk commented Jun 25, 2026

Uh oh!

coderabbitai Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jun 25, 2026 •

edited

Loading

coderabbitai Bot commented Jun 25, 2026 •

edited

Loading

coderabbitai Bot commented Jun 25, 2026 •

edited

Loading