iframe-proxy

yaozheng-fang · 2026-06-02T07:02:58Z

What

Records the model provider's response id on the LLM trace span as the standard OpenTelemetry GenAI attribute gen_ai.response.id.

For Volcengine Ark this id equals the x-request-id response header — the most useful identifier when correlating with the provider's logs / filing tickets.

Why it's done this way

ADK builds its LlmResponse from the litellm response's message/model/usage but drops response.id, so the tracer never sees it.
A pure litellm callback can read the id, but litellm runs success callbacks detached from the OTel context (verified: get_current_span() is non-recording there), so it can't attach the id to the LLM span.
So a self-contained tracing module wraps LiteLLMClient.acompletion in-context (where the raw response is available, inside ADK's generate_content span) and sets gen_ai.response.id on the current span.

Scope / safety

New file veadk/tracing/telemetry/litellm_response_id.py + one lazy registration line in OpentelemetryTracer (same pattern as the existing patch_google_adk_telemetry).
No agent.py change, no model subclass. The wrap is applied only when a VeADK OpenTelemetry tracer is created; with no tracer nothing is patched, and when patched-but-not-recording it's a no-op.
Non-streaming is covered (the default); the streaming wrapper has no stable id at this point, so it's skipped.

Verified locally (real Ark call)

With a tracer enabled, the generate_content span's gen_ai.response.id exactly equals the live Ark response.id:

SPAN gen_ai.response.id: ['021780383672524f789d4d4546194a16e88624aed779d5ca0707c']
REAL ark response.id   : ['021780383672524f789d4d4546194a16e88624aed779d5ca0707c']
RESULT: PASS

🤖 Generated with Claude Code

ADK drops the litellm response's id when building LlmResponse, so the tracer never sees it. For Volcengine Ark that id == the x-request-id response header, the most useful id for correlating with the provider's logs. A pure litellm callback can read it but runs detached from the OTel context, so it can't attach to the LLM span. Instead, a self-contained tracing module wraps LiteLLMClient.acompletion in-context (where the raw response is available, inside ADK's generate_content span) and sets the standard OTel GenAI attribute gen_ai.response.id on the current span. The wrap is applied lazily only when a VeADK OpenTelemetry tracer is created (same pattern as patch_google_adk_telemetry); no agent.py change, no model subclass, and a no-op when tracing is off. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

zakahan approved these changes Jun 2, 2026

View reviewed changes

zakahan merged commit f4ee445 into main Jun 2, 2026
16 checks passed

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(tracing): record provider response id as gen_ai.response.id#577

feat(tracing): record provider response id as gen_ai.response.id#577
zakahan merged 1 commit into
mainfrom
feat/trace-response-id

yaozheng-fang commented Jun 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

yaozheng-fang commented Jun 2, 2026

What

Why it's done this way

Scope / safety

Verified locally (real Ark call)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants