2026年4月9日 06:15

6 ファイル変更 +240 -12

この更新の概要

Agent SDKにおいてOpenTelemetryを使用した可観測性（Observability）のガイドが新設され、トレース、メトリクス、ログの出力方法が詳細に解説されています。デスクトップアプリにおける環境変数の継承動作が修正され、macOSやWindowsでの仕様変更に伴い、新しいローカル環境エディタでの設定方法が導入されました。スキルの構成方法やステータスラインの表示に関するドキュメントが更新され、開発コンテナやプラットフォーム固有の記述も微調整されています。

agent-sdk/observability +194 -0

@@ -0,0 +1,194 @@

---

title: observability

source: https://code.claude.com/docs/en/agent-sdk/observability.md

---

# Observability with OpenTelemetry

> Export traces, metrics, and events from the Agent SDK to your observability backend using OpenTelemetry.

When you run agents in production, you need visibility into what they did:

- which tools they called

- how long each model request took

- how many tokens were spent

- where failures occurred

The Agent SDK can export this data as OpenTelemetry traces, metrics, and log events to any backend that accepts the OpenTelemetry Protocol (OTLP), such as Honeycomb, Datadog, Grafana, Langfuse, or a self-hosted collector.

This guide explains how the SDK emits telemetry, how to configure the export, and how to tag and filter the data once it reaches your backend. To read token usage and cost directly from the SDK response stream instead of exporting to a backend, see [Track cost and usage](/en/agent-sdk/cost-tracking).

## How telemetry flows from the SDK

The Agent SDK runs the Claude Code CLI as a child process and communicates with it over a local pipe. The CLI has OpenTelemetry instrumentation built in: it records spans around each model request and tool execution, emits metrics for token and cost counters, and emits structured log events for prompts and tool results. The SDK does not produce telemetry of its own. Instead, it passes configuration through to the CLI process, and the CLI exports directly to your collector.

Configuration is passed as environment variables. By default, the child process inherits your application's environment, so you can configure telemetry in either of two places:

- **Process environment:** set the variables in your shell, container, or orchestrator before your application starts. Every `query()` call picks them up automatically with no code change. This is the recommended approach for production deployments.

- **Per-call options:** set the variables in `ClaudeAgentOptions.env` (Python) or `options.env` (TypeScript). Use this when different agents in the same process need different telemetry settings. In Python, `env` is merged on top of the inherited environment. In TypeScript, `env` replaces the inherited environment entirely, so include `...process.env` in the object you pass.

The CLI exports three independent OpenTelemetry signals. Each has its own enable switch and its own exporter, so you can turn on only the ones you need.

| Signal | What it contains | Enable with |

| - | - | - |

| Metrics | Counters for tokens, cost, sessions, lines of code, and tool decisions | `OTEL_METRICS_EXPORTER` |

| Log events | Structured records for each prompt, API request, API error, and tool result | `OTEL_LOGS_EXPORTER` |

| Traces | Spans for each interaction, model request, tool call, and hook (beta) | `OTEL_TRACES_EXPORTER` plus `CLAUDE_CODE_ENHANCED_TELEMETRY_BETA=1` |

For the complete list of metric names, event names, and attributes, see the Claude Code [Monitoring](/en/monitoring-usage) reference. The Agent SDK emits the same data because it runs the same CLI. Span names are listed in [Read agent traces](#read-agent-traces) below.

## Enable telemetry export

Telemetry is off until you set `CLAUDE_CODE_ENABLE_TELEMETRY=1` and choose at least one exporter. The most common configuration sends all three signals over OTLP HTTP to a collector.

The following example sets the variables in a dictionary and passes them through `options.env`. The agent runs a single task, and the CLI exports spans, metrics, and events to the collector at `collector.example.com` while the loop consumes the response stream:

```python Python theme={null}

import asyncio

from claude_agent_sdk import query, ClaudeAgentOptions

OTEL_ENV = {

"CLAUDE_CODE_ENABLE_TELEMETRY": "1",

# Required for traces, which are in beta. Metrics and log events do not need this.

"CLAUDE_CODE_ENHANCED_TELEMETRY_BETA": "1",

# Choose an exporter per signal. Use otlp for the SDK; see the Note below.

"OTEL_TRACES_EXPORTER": "otlp",

"OTEL_METRICS_EXPORTER": "otlp",

"OTEL_LOGS_EXPORTER": "otlp",

# Standard OTLP transport configuration.

"OTEL_EXPORTER_OTLP_PROTOCOL": "http/protobuf",

"OTEL_EXPORTER_OTLP_ENDPOINT": "http://collector.example.com:4318",

"OTEL_EXPORTER_OTLP_HEADERS": "Authorization=Bearer your-token",

}

async def main():

options = ClaudeAgentOptions(env=OTEL_ENV)

async for message in query(

prompt="List the files in this directory", options=options

print(message)

asyncio.run(main())

```

```typescript TypeScript theme={null}

import { query } from "@anthropic-ai/claude-agent-sdk";

const otelEnv = {

CLAUDE_CODE_ENABLE_TELEMETRY: "1",

// Required for traces, which are in beta. Metrics and log events do not need this.

CLAUDE_CODE_ENHANCED_TELEMETRY_BETA: "1",

// Choose an exporter per signal. Use otlp for the SDK; see the Note below.

OTEL_TRACES_EXPORTER: "otlp",

OTEL_METRICS_EXPORTER: "otlp",

OTEL_LOGS_EXPORTER: "otlp",

// Standard OTLP transport configuration.

OTEL_EXPORTER_OTLP_PROTOCOL: "http/protobuf",

OTEL_EXPORTER_OTLP_ENDPOINT: "http://collector.example.com:4318",

OTEL_EXPORTER_OTLP_HEADERS: "Authorization=Bearer your-token",

};

for await (const message of query({

prompt: "List the files in this directory",

// env replaces the inherited environment in TypeScript, so spread

// process.env first to keep PATH, ANTHROPIC_API_KEY, and other variables.

options: { env: { ...process.env, ...otelEnv } },

})) {

console.log(message);

}

```

Because the child process inherits your application's environment by default, you can achieve the same result by exporting these variables in a Dockerfile, Kubernetes manifest, or shell profile and omitting `options.env` entirely.

The `console` exporter writes telemetry to standard output, which the SDK uses

as its message channel. Do not set `console` as an exporter value when running

through the SDK. To inspect telemetry locally, point

`OTEL_EXPORTER_OTLP_ENDPOINT` at a local collector or an all-in-one Jaeger

container instead.

### Flush telemetry from short-lived calls

The CLI batches telemetry and exports on an interval. It flushes any pending data when the process exits cleanly, so a `query()` call that completes normally does not lose spans. However, if your process is killed before the CLI shuts down, anything still in the batch buffer is lost. Lowering the export intervals reduces that window.

By default, metrics export every 60 seconds and traces and logs export every 5 seconds. The following example shortens all three intervals so that data reaches the collector while a short task is still running:

```python Python theme={null}

OTEL_ENV = {

# ... exporter configuration from the previous example ...

"OTEL_METRIC_EXPORT_INTERVAL": "1000",

"OTEL_LOGS_EXPORT_INTERVAL": "1000",

"OTEL_TRACES_EXPORT_INTERVAL": "1000",

}

```

```typescript TypeScript theme={null}

const otelEnv = {

// ... exporter configuration from the previous example ...

OTEL_METRIC_EXPORT_INTERVAL: "1000",

OTEL_LOGS_EXPORT_INTERVAL: "1000",

OTEL_TRACES_EXPORT_INTERVAL: "1000",

};

```

## Read agent traces

Traces give you the most detailed view of an agent run. With `CLAUDE_CODE_ENHANCED_TELEMETRY_BETA=1` set, each step of the agent loop becomes a span you can inspect in your tracing backend:

- **`claude_code.interaction`:** wraps a single turn of the agent loop, from receiving a prompt to producing a response.

- **`claude_code.llm_request`:** wraps each call to the Claude API, with model name, latency, and token counts as attributes.

- **`claude_code.tool`:** wraps each tool invocation, with child spans for the permission wait (`claude_code.tool.blocked_on_user`) and the execution itself (`claude_code.tool.execution`).

- **`claude_code.hook`:** wraps each [hook](/en/agent-sdk/hooks) execution.

Every span carries a `session.id` attribute. When you make several `query()` calls against the same [session](/en/agent-sdk/sessions), filter on `session.id` in your backend to see them as one timeline.

Tracing is in beta. Span names and attributes may change between releases. See

[Traces (beta)](/en/monitoring-usage#traces-beta) in the Monitoring reference

for the trace exporter configuration variables.

## Tag telemetry from your agent

By default, the CLI reports `service.name` as `claude-code`. If you run several agents, or run the SDK alongside other services that export to the same collector, override the service name and add resource attributes so you can filter by agent in your backend.

The following example renames the service and attaches deployment metadata. These values are applied as OpenTelemetry resource attributes on every span, metric, and event the agent emits:

```python Python theme={null}

options = ClaudeAgentOptions(

env={

# ... exporter configuration ...

"OTEL_SERVICE_NAME": "support-triage-agent",

"OTEL_RESOURCE_ATTRIBUTES": "service.version=1.4.0,deployment.environment=production",

)

```

```typescript TypeScript theme={null}

const options = {

env: {

...process.env,

// ... exporter configuration ...

OTEL_SERVICE_NAME: "support-triage-agent",

OTEL_RESOURCE_ATTRIBUTES:

"service.version=1.4.0,deployment.environment=production",

};

```

## Control sensitive data in exports

Telemetry is structural by default. Token counts, durations, model names, and tool names are always recorded, but the content your agent reads and writes is not. Three opt-in variables add content to the exported data:

| Variable | Adds |

| - | - |

| `OTEL_LOG_USER_PROMPTS=1` | Prompt text on `claude_code.user_prompt` events and on the `claude_code.interaction` span |

| `OTEL_LOG_TOOL_DETAILS=1` | Tool input arguments (file paths, shell commands, search patterns) on `claude_code.tool_result` events |

| `OTEL_LOG_TOOL_CONTENT=1` | Full tool input and output bodies as span events on `claude_code.tool`, truncated at 60 KB. Requires [tracing](#read-agent-traces) to be enabled |

Leave these unset unless your observability pipeline is approved to store the data your agent handles. See [Security and privacy](/en/monitoring-usage#security-and-privacy) in the Monitoring reference for the full list of attributes and redaction behavior.

## Related documentation

These guides cover adjacent topics for monitoring and deploying agents:

- [Track cost and usage](/en/agent-sdk/cost-tracking): read token and cost data from the message stream without an external backend.

- [Hosting the Agent SDK](/en/agent-sdk/hosting): deploy agents in containers where you can set OpenTelemetry variables at the environment level.

- [Monitoring](/en/monitoring-usage): the complete reference for every environment variable, metric, and event the CLI emits.

desktop +6 -4

@@ -296,7 +296,7 @@ Each entry in the `configurations` array accepts the following fields:

| `runtimeArgs` | string\[] | Arguments passed to `runtimeExecutable`, such as `["run", "dev"]` |

| `port` | number | The port your server listens on. Defaults to 3000 |

| `cwd` | string | Working directory relative to your project root. Defaults to the project root. Use `${workspaceFolder}` to reference the project root explicitly |

| `env` | object | Additional environment variables as key-value pairs, such as `{ "NODE_ENV": "development" }`. Don't put secrets here since this file is committed to your repo. Secrets set in your shell profile are inherited automatically. |

| `env` | object | Additional environment variables as key-value pairs, such as `{ "NODE_ENV": "development" }`. Don't put secrets here since this file is committed to your repo. To pass secrets to your dev server, set them in the [local environment editor](#local-sessions) instead. |

| `autoPort` | boolean | How to handle port conflicts. See below |

| `program` | string | A script to run with `node`. See [when to use `program` vs `runtimeExecutable`](#when-to-use-program-vs-runtimeexecutable) |

| `args` | string\[] | Arguments passed to `program`. Only used when `program` is set |

@@ -390,9 +390,11 @@ The environment you pick when [starting a session](#start-a-session) determines

### Local sessions

Local sessions inherit environment variables from your shell. If you need additional variables, set them in your shell profile, such as `~/.zshrc` or `~/.bashrc`, and restart the desktop app. See [environment variables](/en/env-vars) for the full list of supported variables.

The desktop app does not always inherit your full shell environment. On macOS, when you launch the app from the Dock or Finder, it reads your shell profile, such as `~/.zshrc` or `~/.bashrc`, to extract `PATH` and a fixed set of Claude Code variables, but other variables you export there are not picked up. On Windows, the app inherits user and system environment variables but does not read PowerShell profiles.

[Extended thinking](/en/common-workflows#use-extended-thinking-thinking-mode) is enabled by default, which improves performance on complex reasoning tasks but uses additional tokens. To disable thinking entirely, set `MAX_THINKING_TOKENS=0` in your shell profile. On Opus, `MAX_THINKING_TOKENS` is ignored except for `0` because adaptive reasoning controls thinking depth instead.

To set environment variables for local sessions and dev servers on any platform, open the environment dropdown in the prompt box, hover over **Local**, and click the gear icon to open the local environment editor. Variables you save here are stored encrypted on your machine and apply to every local session and preview server you start. You can also add variables to the `env` key in your `~/.claude/settings.json` file, though these reach Claude sessions only and not dev servers. See [environment variables](/en/env-vars) for the full list of supported variables.

[Extended thinking](/en/common-workflows#use-extended-thinking-thinking-mode) is enabled by default, which improves performance on complex reasoning tasks but uses additional tokens. To disable thinking entirely, set `MAX_THINKING_TOKENS` to `0` in the local environment editor. On Opus 4.6 and Sonnet 4.6, any other `MAX_THINKING_TOKENS` value is ignored because adaptive reasoning controls thinking depth instead. To use a fixed thinking budget on these models, also set `CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING` to `1`.

### Remote sessions

@@ -491,7 +493,7 @@ This table shows the desktop app equivalent for common CLI flags. Flags not list

| `--verbose` | Not available. Check system logs: Console.app on macOS, Event Viewer → Windows Logs → Application on Windows |

| `--print`, `--output-format` | Not available. Desktop is interactive only. |

| `ANTHROPIC_MODEL` env var | Model dropdown next to the send button |

| `MAX_THINKING_TOKENS` env var | Set in shell profile; applies to local sessions. See [environment configuration](#environment-configuration). |

| `MAX_THINKING_TOKENS` env var | Set in the local environment editor. See [environment configuration](#environment-configuration). |

### Shared configuration

devcontainer +2 -0

@@ -32,6 +32,8 @@ Always maintain good security practices and monitor Claude's activities.

3. Open the repository in VS Code

4. When prompted, click "Reopen in Container" (or use Command Palette: Cmd+Shift+P → "Dev Containers: Reopen in Container")

Once the container finishes building, open a terminal in VS Code with `` Ctrl+` `` and run `claude` to authenticate and start your first session. The container has Claude Code preinstalled, so you can begin working immediately. Your project files are mounted into the container, and any code Claude writes appears in your local repository.

## Configuration breakdown

The devcontainer setup consists of three primary components:

platforms +4 -2

@@ -5,7 +5,7 @@ source: https://code.claude.com/docs/en/platforms.md

# Platforms and integrations

> Choose where to run Claude Code and what to connect it to. Compare the CLI, Desktop, VS Code, JetBrains, web, and integrations like Chrome, Slack, and CI/CD.

> Choose where to run Claude Code and what to connect it to. Compare the CLI, Desktop, VS Code, JetBrains, web, mobile, and integrations like Chrome, Slack, and CI/CD.

Claude Code runs the same underlying engine everywhere, but each surface is tuned for a different way of working. This page helps you pick the right platform for your workflow and connect the tools you already use.

@@ -20,8 +20,9 @@ Choose a platform based on how you like to work and where your project lives.

| [VS Code](/en/vs-code) | Working inside VS Code without switching to a terminal | Inline diffs, integrated terminal, file context |

| [JetBrains](/en/jetbrains) | Working inside IntelliJ, PyCharm, WebStorm, or other JetBrains IDEs | Diff viewer, selection sharing, terminal session |

| [Web](/en/claude-code-on-the-web) | Long-running tasks that don't need much steering, or work that should continue when you're offline | Anthropic-managed cloud, continues after you disconnect |

| Mobile | Starting and monitoring tasks while away from your computer | Cloud sessions from the Claude app for iOS and Android, [Remote Control](/en/remote-control) for local sessions, [Dispatch](/en/desktop#sessions-from-dispatch) to Desktop on Pro and Max |

The CLI is the most complete surface for terminal-native work: scripting, third-party providers, and the Agent SDK are CLI-only. Desktop and the IDE extensions trade some CLI-only features for visual review and tighter editor integration. The web runs in Anthropic's cloud, so tasks keep going after you disconnect. Mobile is a thin client into those same cloud sessions or into a local session via Remote Control, and can send tasks to Desktop with Dispatch.

You can mix surfaces on the same project. Configuration, project memory, and MCP servers are shared across the local surfaces.

@@ -62,6 +63,7 @@ If you're not sure where to start, [install the CLI](/en/quickstart) and run it

- [VS Code](/en/vs-code): the Claude Code extension inside your editor

- [JetBrains](/en/jetbrains): the extension for IntelliJ, PyCharm, and other JetBrains IDEs

- [Claude Code on the web](/en/claude-code-on-the-web): cloud sessions that keep running when you disconnect

- Mobile: the Claude app for [iOS](https://apps.apple.com/us/app/claude-by-anthropic/id6473753684) and [Android](https://play.google.com/store/apps/details?id=com.anthropic.claude) for starting and monitoring tasks while away from your computer

### Integrations

skills +18 -5

@@ -277,18 +277,31 @@ Here's how the two fields affect invocation and context loading:

In a regular session, skill descriptions are loaded into context so Claude knows what's available, but full skill content only loads when invoked. [Subagents with preloaded skills](/en/sub-agents#preload-skills-into-subagents) work differently: the full skill content is injected at startup.

### Restrict tool access

### Skill content lifecycle

Use the `allowed-tools` field to limit which tools Claude can use when a skill is active. This skill creates a read-only mode where Claude can explore files but not modify them:

When you or Claude invoke a skill, the rendered `SKILL.md` content enters the conversation as a single message and stays there for the rest of the session. Claude Code does not re-read the skill file on later turns, so write guidance that should apply throughout a task as standing instructions rather than one-time steps.

[Auto-compaction](/en/how-claude-code-works#when-context-fills-up) preserves invoked skills. When the conversation is summarized to free context, Claude Code re-attaches the most recent invocation of each skill after the summary (truncated if the skill is very large). If you invoke the same skill more than once, only the latest copy is carried forward through compaction.

If a skill seems to stop influencing behavior after the first response, the skill content is still present. The model is choosing other tools or approaches. Strengthen the skill's `description` and instructions so the model keeps preferring it, or use [hooks](/en/hooks) to enforce behavior deterministically.

### Pre-approve tools for a skill

The `allowed-tools` field grants permission for the listed tools while the skill is active, so Claude can use them without prompting you for approval. It does not restrict which tools are available: every tool remains callable, and your [permission settings](/en/permissions) still govern tools that are not listed.

This skill lets Claude run git commands without per-use approval whenever you invoke it:

```yaml

---

name: safe-reader

description: Read files without making changes

allowed-tools: Read Grep Glob

name: commit

description: Stage and commit the current changes

disable-model-invocation: true

allowed-tools: Bash(git add *) Bash(git commit *) Bash(git status *)

---

```

To block a skill from using certain tools, add deny rules in your [permission settings](/en/permissions) instead.

### Pass arguments to skills

Both you and Claude can pass arguments when invoking a skill. Arguments are available via the `$ARGUMENTS` placeholder.

statusline +16 -1

@@ -894,7 +894,7 @@ echo "$dirname [$model]"

## Tips

- **Test with mock input**: `echo '{"model":{"display_name":"Opus"},"context_window":{"used_percentage":25}}' | ./statusline.sh`

- **Test with mock input**: `echo '{"model":{"display_name":"Opus"},"workspace":{"current_dir":"/home/user/project"},"context_window":{"used_percentage":25},"session_id":"test-session-abc"}' | ./statusline.sh`

- **Keep output short**: the status bar has limited width, so long output may get truncated or wrap awkwardly

- **Cache slow operations**: your script runs frequently during active sessions, so commands like `git status` can cause lag. See the [caching example](#cache-expensive-operations) for how to handle this.

@@ -926,8 +926,23 @@ Community projects like [ccstatusline](https://github.com/sirmalloc/ccstatusline

**OSC 8 links not clickable**

- Verify your terminal supports OSC 8 hyperlinks (iTerm2, Kitty, WezTerm)

- Terminal.app does not support clickable links

- If link text appears but isn't clickable, Claude Code may not have detected hyperlink support in your terminal. This commonly affects Windows Terminal and other emulators not in the auto-detection list. Set the `FORCE_HYPERLINK` environment variable to override detection before launching Claude Code:

```bash theme={null}

FORCE_HYPERLINK=1 claude

```

In PowerShell, set the variable in the current session first:

```powershell theme={null}

$env:FORCE_HYPERLINK = "1"; claude

```

- SSH and tmux sessions may strip OSC sequences depending on configuration

- If escape sequences appear as literal text like `\e]8;;`, use `printf '%b'` instead of `echo -e` for more reliable escape handling

**Display glitches with escape sequences**