5 ファイル変更 +21 -3

この更新の概要

Agent SDKのクイックスタートにおいて、TypeScriptとPythonの両方の実行手順が明確化されました。Amazon Bedrock利用時のWebSearchツールの制限に関する注意喚起が追加されています。Fast Modeのコスト構造とプロンプトキャッシュの仕組みについて詳細な解説が追加され、v2.1.86以降での動作仕様が明記されました。また、Gitリポジトリ設定における認証方法とセルフホストサービスへの対応についても追記されています。

agent-sdk/quickstart +8 -2

Agentの実行コマンドが更新され、TypeScript版のnpx tsxとPython版のuv runによる実行手順がそれぞれ追加されました。

@@ -178,11 +178,17 @@ This example uses streaming to show progress in real-time. If you don't need liv
Your agent is ready. Run it with the following command:
```bash theme={null}
python3 agent.py
npx tsx agent.ts
```
```bash theme={null}
npx tsx agent.ts
uv run agent.py
```
With your virtual environment still activated:
```bash theme={null}
python agent.py
```
After running, check `utils.py`. You'll see defensive code handling empty lists and null users. Your agent autonomously:
amazon-bedrock +1 -0

Amazon Bedrock経由で利用する場合、WebSearchツールが利用不可である旨の制限事項が追記されました。

@@ -209,6 +209,7 @@ When enabling Bedrock for Claude Code, keep the following in mind:
* `AWS_REGION` is a required environment variable. Claude Code does not read from the `.aws` config file for this setting.
* When using Bedrock, the `/logout` command is unavailable since authentication is handled through AWS credentials.
* The WebSearch tool is not available on Bedrock. See [WebSearch tool behavior](/en/tools-reference#websearch-tool-behavior).
* You can use settings files for environment variables like `AWS_PROFILE` that you don't want to leak to other processes. See [Settings](/en/settings) for more information.
### 4. Pin model versions
fast-mode +1 -1

Fast Modeを会話の途中で有効化した際の初回コスト発生の仕組みと、その後の切り替え時のキャッシュ挙動について説明が詳しくなりました。

@@ -59,7 +59,7 @@ Fast mode has higher per-token pricing than standard Opus, with the multiplier v
Fast mode pricing is flat across the full 1M token context window. For the standard Opus rate to compare against, see the [Claude pricing reference](https://platform.claude.com/docs/en/about-claude/pricing).
When you switch into fast mode mid-conversation, you pay the full fast mode uncached input token price for the entire conversation context. This costs more than if you had enabled fast mode from the start.
The first time you enable fast mode in a conversation, you pay the full fast mode uncached input token price for the entire conversation context. The deeper into a conversation you are, the more this costs, so enabling fast mode from the start is cheaper. The cost applies once per conversation, so toggling fast mode off and on again later does not repeat it. For the mechanism, see [how fast mode interacts with the prompt cache](/en/prompt-caching#turning-on-fast-mode).
## Decide when to use fast mode
prompt-caching +9 -0

Fast Mode有効化時のキャッシュキーの変化と、v2.1.86以降でトグル操作を行ってもキャッシュが維持される仕組みが解説されています。

@@ -53,6 +53,7 @@ These actions cause the next request to miss part or all of the cache. You see a
- [Switching models](#switching-models)
- [Changing effort level](#changing-effort-level)
- [Turning on fast mode](#turning-on-fast-mode)
- [Connecting or disconnecting an MCP server](#connecting-or-disconnecting-an-mcp-server)
- [Enabling or disabling a plugin](#enabling-or-disabling-a-plugin)
- [Denying an entire tool](#denying-an-entire-tool)
@@ -69,6 +70,14 @@ The [`opusplan` model setting](/en/model-config#opusplan-model-setting) resolves
The cache is keyed by [effort level](/en/model-config#adjust-effort-level) as well as model, so switching with `/effort` means the next request reads the entire conversation history with no cache hits. Once a conversation has started, Claude Code shows a confirmation dialog before applying an effort change that would invalidate the cache. A change that resolves to the same level already in effect, such as setting the model's default explicitly, skips the dialog and keeps the cache.
### Turning on fast mode
Enabling [fast mode](/en/fast-mode) adds a request header that is part of the cache key, so the next request reads the entire conversation history with no cache hits. Those uncached input tokens are billed at [fast mode rates](/en/fast-mode#understand-the-cost-tradeoff), which is why turning it on at the start of a session costs less than turning it on deep into a long one. Enabling fast mode from a non-Opus model also [switches your model](#switching-models), which starts a fresh cache on its own.
The cost applies once per conversation. After the first fast mode turn, Claude Code keeps sending the header and varies only the request's speed setting, which is not part of the cache key. Turning fast mode off, the [automatic fallback to standard speed](/en/fast-mode#handle-rate-limits) after a rate limit, and turning it back on later all keep the cache. `/clear` and `/compact` reset this, since they rebuild the cache at those points anyway.
Keeping the header across toggles requires Claude Code v2.1.86 or later. On earlier versions, every fast mode toggle and rate-limit fallback invalidates the cache.
### Connecting or disconnecting an MCP server
Tool definitions sit in the system prompt layer, so the cache invalidates when the set of tool definitions in the request changes between turns. Whether an [MCP server](/en/mcp) change does this depends on whether its tools are deferred by [tool search](/en/mcp#scale-with-mcp-tool-search) or loaded into the prefix:
settings +2 -0

Gitソースタイプにおいて、SSHキーや環境変数を利用した認証方法、およびセルフホストのGitLabやBitbucketへの対応について追記されました。

@@ -681,6 +681,8 @@ Defines additional marketplaces that should be made available for the repository
- `hostPattern`: regex pattern to match marketplace hosts (uses `hostPattern`)
- `settings`: inline marketplace declared directly in settings.json without a separate hosted repository (uses `name` and `plugins`)
The `git` source type works with any git hosting service, including self-hosted GitLab and Bitbucket. Claude Code clones the repository with the same authentication that `git clone` would use on that machine: configured credential helpers, SSH keys, or a host-specific token environment variable. See [Private repositories](/en/plugin-marketplaces#private-repositories) for setup details.
For `github` and `git` sources, set `"skipLfs": true` inside the `source` object (alongside `repo` or `url`) to skip Git LFS downloads when Claude Code clones or updates the marketplace repository. LFS pointer files remain as pointers instead of downloading their content. Use this when the repository contains large LFS objects unrelated to plugin content. Requires Claude Code v2.1.153 or later.
Each marketplace entry also accepts an optional `autoUpdate` Boolean. Set `"autoUpdate": true` alongside `source` to make Claude Code refresh that marketplace and update its installed plugins at startup. When omitted, official Anthropic marketplaces default to `true` and all other marketplaces default to `false`. See [Configure auto-updates](/en/discover-plugins#configure-auto-updates).