feat(otel): instrument runtime with GenAI semantic conventions by tdabasinskas · Pull Request #2620 · docker/docker-agent

tdabasinskas · 2026-05-04T07:49:26Z

Adds end-to-end OpenTelemetry instrumentation following the GenAI semantic conventions:

Provider-layer chat/embeddings/rerank CLIENT spans with gen_ai.* attributes and the gen_ai.client.token.usage / operation.duration histograms.
Runtime spans (runtime.session, runtime.stream, runtime.fallback, runtime.tool.call, runtime.run_skill, runtime.task_transfer, runtime.handoff, background_agent.run).
MCP client + server spans with params._meta propagation, plus OAuth flow spans.
A2A endpoints wrapped with otelhttp and marked as invoke_agent.
Hook executor span with verdict/decision/reason annotation; subprocess trace context propagation for hooks, LSP servers, and sandbox docker exec.
Memory, RAG, sessiontitle, evaluation, anthropic-specific spans.
Built-in tool internals (shell, filesystem, fetch, lsp, codemode, ...) surface their work as span attributes.
W3C trace context + baggage propagation across all HTTP servers and clients.
Standard OTel resource attributes (service.*, host.*, process.*, os.type)

This PR wires two opt-in env vars beyond the default OTel SDK ones:

OTEL_INSTRUMENTATION_GENAI_CAPTURE_MESSAGE_CONTENT — capture prompts, responses, tool arguments and tool results as span attributes. Off by default (PII surface).
OTEL_SEMCONV_STABILITY_OPT_IN=gen_ai_latest_experimental — emit only the spec-defined gen_ai.* keys. Default is dual-emit (both gen_ai.* and the legacy tool.name / agent / session.id keys), so existing dashboards keep working alongside spec-aware tooling.

The diff is large — ~50 files, ~5k lines. It's split into 10 topical commits (telemetry primitives → SDK init → providers → runtime → hooks → MCP → A2A → servers/cold-start → memory/RAG → tool internals) so each commit is independently reviewable. Most of the volume is in the new pkg/telemetry/genai/ and pkg/telemetry/mcp/ packages, which are pure helpers; the surface-area changes elsewhere are 1-3 lines per call site.

dgageot · 2026-05-04T17:33:21Z

@tdabasinskas not sure why, GitHub doesn't want to merge this one, because of hypothetical merge conflicts. Could you rebase?

tdabasinskas · 2026-05-04T18:40:45Z

@tdabasinskas not sure why, GitHub doesn't want to merge this one, because of hypothetical merge conflicts. Could you rebase?

Done!

aheritier · 2026-05-05T21:59:52Z

/review

tdabasinskas · 2026-05-06T13:58:52Z

/review

I don't think that worked 😅

aheritier · 2026-05-06T20:52:23Z

/review

aheritier

LGTM. Clean design, solid thread safety, good spec adherence. The inline comments are all non-blocking suggestions for follow-up.

docker-agent · 2026-05-06T21:01:03Z

❌ PR Review Failed — The review agent encountered an error and could not complete the review. View logs.

dgageot · 2026-05-07T14:54:00Z

@tdabasinskas can you rebase one more time and I'll review it?

tdabasinskas · 2026-05-07T15:18:18Z

@tdabasinskas can you rebase one more time and I'll review it?

Done!

aheritier

Re-approving — my prior approval was dismissed by the merge of upstream/main into the branch, but there are zero new author code changes since a4ce95e8. All three of my previous comments were addressed and the threads are resolved. CI is green on the merge commit.

Original assessment stands: clean design, solid thread safety, good GenAI semconv adherence. LGTM.

Every toolset goes through tools.WithName in the team-loader registry, which sandwiches a *tools.namedToolSet between the StartableToolSet and the actual implementation. %T on the embedded ToolSet therefore always reported *tools.namedToolSet regardless of whether the inner toolset was MCP, A2A, a builtin, or anything else - so the attribute could never answer the question it exists to answer ("which kind of toolset is starting right now?"). Unwrap once before formatting, mirroring what DescribeToolSet already does for the same reason. Now the attribute reads *mcp.Toolset, *builtin.ShellTool, etc., so a toolset.start without HTTP children is immediately distinguishable from a remote MCP whose POSTs are missing for some other reason.

Record tool counts at two key points in the execution flow: - Session span: total tools available after exclusion filters - MCP list span: tools successfully yielded by each server These attributes enable quick analysis of tool availability without inspecting nested spans or JSON-RPC payloads. The MCP count preserves partial results when iteration terminates early.

…errors Introduce a `classifyByStatusCode` helper that probes for an HTTP status code via a `StatusCode() int` method before falling back to substring matching. This prevents false positives when error messages incidentally contain strings like "401", "403", or "429" in request IDs, byte counts, or status-line fragments. Providers that expose HTTP status codes through a structured interface now get classified from the structural signal, while text-only errors continue to use the existing heuristic. Also add documentation clarifying that `getInstruments` binds to the global MeterProvider on first call via `sync.Once`, which affects test setup requirements.

tdabasinskas requested a review from a team as a code owner May 4, 2026 07:49

tdabasinskas mentioned this pull request May 4, 2026

OTEL, again #393

Open

tdabasinskas marked this pull request as draft May 4, 2026 07:58

tdabasinskas marked this pull request as ready for review May 4, 2026 08:52

tdabasinskas force-pushed the feat/otel-genai-semconv branch from fa4a01d to 2a69313 Compare May 4, 2026 11:16

tdabasinskas force-pushed the feat/otel-genai-semconv branch from 2a69313 to 9b08feb Compare May 4, 2026 18:40

tdabasinskas force-pushed the feat/otel-genai-semconv branch 2 times, most recently from e7194da to b6a181b Compare May 5, 2026 08:02

tdabasinskas marked this pull request as draft May 5, 2026 12:26

tdabasinskas marked this pull request as ready for review May 5, 2026 13:31

aheritier previously approved these changes May 6, 2026

View reviewed changes

Comment thread pkg/telemetry/genai/errors.go

Comment thread pkg/telemetry/genai/span.go

Comment thread pkg/telemetry/genai/metrics.go

aheritier added kind/feat PR adds a new feature (maps to feat: commit prefix) area/agent For work that has to do with the general agent loop/agentic features of the app priority:medium labels May 6, 2026

tdabasinskas dismissed aheritier’s stale review via 4cbee6b May 7, 2026 05:34

tdabasinskas requested a review from aheritier May 7, 2026 07:57

aheritier removed priority:medium labels May 7, 2026

aheritier previously approved these changes May 7, 2026

View reviewed changes

aheritier added effort:large go Pull requests that update go code labels May 7, 2026

tdabasinskas added 3 commits May 26, 2026 14:02

tdabasinskas force-pushed the feat/otel-genai-semconv branch from b43ca96 to 79bc9eb Compare May 26, 2026 11:11

tdabasinskas marked this pull request as ready for review May 26, 2026 11:11

aheritier added status/needs-rebase PR has merge conflicts or is out of date with main and removed status/needs-rebase PR has merge conflicts or is out of date with main labels May 26, 2026

aheritier marked this pull request as draft May 27, 2026 06:31

aheritier added area/api For features/issues/fixes related to the usage of the cagent API area/mcp MCP protocol, MCP tool servers, integration labels May 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(otel): instrument runtime with GenAI semantic conventions#2620

feat(otel): instrument runtime with GenAI semantic conventions#2620
tdabasinskas wants to merge 14 commits into
docker:mainfrom
cogvel:feat/otel-genai-semconv

tdabasinskas commented May 4, 2026 •

edited

Loading

Uh oh!

dgageot commented May 4, 2026

Uh oh!

tdabasinskas commented May 4, 2026

Uh oh!

aheritier commented May 5, 2026

Uh oh!

tdabasinskas commented May 6, 2026 •

edited

Loading

Uh oh!

aheritier commented May 6, 2026

Uh oh!

aheritier left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

docker-agent Bot commented May 6, 2026

Uh oh!

dgageot commented May 7, 2026

Uh oh!

tdabasinskas commented May 7, 2026

Uh oh!

aheritier left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

tdabasinskas commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dgageot commented May 4, 2026

Uh oh!

tdabasinskas commented May 4, 2026

Uh oh!

aheritier commented May 5, 2026

Uh oh!

tdabasinskas commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aheritier commented May 6, 2026

Uh oh!

aheritier left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

docker-agent Bot commented May 6, 2026

Uh oh!

dgageot commented May 7, 2026

Uh oh!

tdabasinskas commented May 7, 2026

Uh oh!

aheritier left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tdabasinskas commented May 4, 2026 •

edited

Loading

tdabasinskas commented May 6, 2026 •

edited

Loading