第1章:超越入门指南

  • Claude Code 是一个基于”读取-规划-执行-验证”循环的智能代理系统,其核心模式简单到只需350行代码即可实现——但它被包装在生产级的安全、权限和上下文管理机制中,这才是关键所在。

    Claude Code is an agentic loop (read-plan-act-verify) built on a pattern so simple it fits in 350 lines – but wrapped in production-grade safety, permissions, and context management that makes the difference.

  • 上下文窗口是最重要的单一约束——每次文件读取、命令输出和消息都会消耗有限的空间,当上下文填满时,性能会急剧下降。

    The context window is the single most important constraint – every file read, command output, and message consumes finite space, and performance degrades sharply when it fills.

  • “探索-规划-实现-提交”工作流程将廉价工作(探索、规划)前置,以减少昂贵工作(调试、回退)。

    The explore-plan-implement-commit workflow front-loads cheap work (exploration, planning) to reduce expensive work (debugging, reverting).

  • 权限模式是工作流程选择器:规划模式用于探索,自动接受用于可信实现,默认模式用于精确修改。使用 Shift+Tab 在会话期间循环切换。

    Permission modes are workflow selectors: use plan mode for exploration, auto-accept for trusted implementation, and default for surgical changes. Shift+Tab to cycle between them mid-session.

  • 研究表明开发者只能将0-20%的任务完全委托给AI;真正的价值在于迭代协作,而非”发射后不管”的自动化。

    Research shows developers can fully delegate only 0-20% of tasks; the real value is iterative collaboration, not fire-and forget automation.

  • 在组件层面评估Claude Code的适用性:前端脚手架和测试生成是立竿见影的成效;实时系统和安全关键逻辑需要人工协作。

    Assess Claude Code fit at the component level: frontend scaffolding and test generation are immediate wins; real-time systems and security-critical logic need human partnership.

  • 先尝试一次性完成,然后再协作——三分之一的成功率是一个特性而非缺陷,失败的首轮尝试会产生针对性指导所需的信息。

    Try one-shot first, then collaborate – the one-third immediate success rate is a feature, not a bug, and failed first attempts produce the information you need for targeted guidance.

  • 频繁提交,毫不犹豫地回退;回退到干净状态并重试几乎总是比修补错误方法更快。

    Commit frequently and revert without hesitation; reverting to a clean state and retrying is almost always faster than patching a wrong approach.

  • 明确约束Claude走向简洁;没有约束时,它会默认采用过度工程化的解决方案。

    Explicitly constrain Claude toward simplicity; without constraints, it defaults to over-engineered solutions.


第2章:权限与信任架构

  • 作用域层级是严格的——托管设置会覆盖一切,组织策略无法被绕过。

    The scope hierarchy is strict – Managed settings override everything, and there is no escape from organizational policy.

  • 权限规则的评估顺序为”拒绝→询问→允许”,采用首个匹配优先的语义,因此拒绝规则应写得更具体,允许规则可以写得更宽泛。

    Permission rules evaluate Deny before Ask before Allow, with first-match-wins semantics, so write deny rules narrow and allow rules broad.

  • 检查点覆盖直接文件编辑,但不包括bash命令的副作用——git才是你真正的安全网。

    Checkpoints cover direct file edits but not bash command side effects – git is your real safety net.

  • Devcontainer 提供网络隔离,但无法阻止恶意项目代码中的凭证泄露;采用分层防御,并将Anthropic的参考实现作为起点。

    Devcontainers provide network isolation but cannot prevent credential exfiltration from malicious project code; layer defenses, and use Anthropic’s reference implementation as a starting point.

  • 钩子系统包含13个事件,覆盖整个代理生命周期:PreToolUse、PostToolUse、PostToolUseFailure、PermissionRequest、Notification、SubagentStart、SubagentStop、Stop、TeammateIdle、TaskCompleted、PreCompact、SessionEnd等。

    The hook system has thirteen events covering the entire agentic lifecycle: PreToolUse, PostToolUse, PostToolUseFailure, PermissionRequest, Notification,SubagentStart, SubagentStop, Stop, TeammateIdle, TaskCompleted, PreCompact, SessionEnd, and more.

  • 三种钩子类型服务于不同的验证需求:命令钩子用于确定性检查,提示钩子用于LLM评估判断,代理钩子用于带文件访问的多轮调查。

    Three hook types serve different verification needs: command hooks for deterministic checks, prompt hooks for LLM evaluated judgments, and agent hooks for multi-turn investigation with file access.

  • 异步钩子在后台运行,不会阻塞Claude;用它们来运行测试套件和长时间验证。

    Async hooks run in the background without blocking Claude; use them for test suites and long-running validation.

  • 钩子会在启动时被快照——会话中修改会触发警告,并需要在 /hooks 菜单中审核后才能生效。

    Hooks are snapshotted at startup – mid-session modifications trigger warnings and require review in the /hooks menu before taking effect.

  • PreToolUse 钩子可以通过 updatedInput 在执行前修改工具输入,实现命令重写、安全标志注入和监控框架。

    PreToolUse hooks can modify tool input before execution via updatedInput , enabling command rewriting, safety flag injection, and monitoring harnesses.

  • MCP 服务器比原始bash命令提供更好的安全可见性;为你的敏感数据源构建MCP服务器。

    MCP servers provide better security visibility than raw bash commands for sensitive data access; build MCP servers for your sensitive data sources.

  • 智能工具正在使安全知识民主化;任何工程师现在都可以执行过去需要专家的安全审查,但攻击者也获得了同样的能力。

    Security knowledge is being democratized by agentic tools; any engineer can now perform reviews that once required specialists, but adversaries gain the same capabilities.

  • 从最大限制和模拟交易环境开始,然后在验证代理行为后逐步放宽约束。

    Start with maximum restriction and paper-trading environments, then relax constraints as you validate agent behavior


第3章:上下文工程

  • CLAUDE.md 的内容在每次请求时都会消耗上下文;保持其在500行以内,使用 /init 引导,并专注于Claude无法从代码中推断的信息。

    CLAUDE.md content costs context on every request; keep it under 500 lines, bootstrap with /init , and focus on information Claude cannot infer from your code.

  • 上下文成本对比表是你的预算指南:CLAUDE.md和MCP每次请求都消耗成本,技能成本较低(仅描述),子代理是隔离的,钩子免费。

    The context cost comparison table is your budget guide: CLAUDE.md and MCP cost every request, skills cost low (descriptions only), subagents are isolated, and hooks are free.

  • 将始终需要的规则放在CLAUDE.md中,任务特定的参考资料放在技能中;使用 disable-model-invocation: true 来实现零上下文成本的手动专用技能。

    Put always-needed rules in CLAUDE.md and task-specific reference material in skills; use disable-model-invocation: true for manual-only skills with zero context cost.

  • 使用 @path/to/import 语法保持CLAUDE.md精简,同时让Claude能够访问单独文件中的深层参考资料。

    Use @path/to/import syntax to keep CLAUDE.md lean while giving Claude access to deeper reference material in separate files.

  • CLAUDE.md 中的 Compact Instructions 在自动压缩后仍然保留;带 focus 指令的 /compact 指导保留内容;使用 /rewind 配合”Summarize from here”实现精准的上下文清理。

    Compact Instructions in CLAUDE.md survive auto-compaction; /compact with focus instructions directs what gets preserved; /rewind with “Summarize from here” enables surgical context cleanup.

  • /context 命令显示你的上下文预算去向——在抱怨Claude空间不足之前先审计它。

    The /context command shows where your context budget is going – audit it before blaming Claude for running out of space.

  • CLAUDE.md 适用于非代码领域:金融分析、数据科学、技术写作——任何跨会话工作且上下文需要持久化的领域。

    CLAUDE.md works for non-code domains: financial analysis, data science, technical writing – any domain where work spans sessions and context must persist.

  • 子代理的上下文隔离效果显著:14个子代理可以同时运行而不会耗尽主会话,因为它们的工作保留在独立的上下文窗口中。

    Subagent context isolation is dramatic: fourteen subagents can run without exhausting the main session because their work stays in separate context windows.

  • 会话结束时的 CLAUDE.md 更新会创建复利式的机构记忆;将其扩展为持续改进循环,同时捕获工作流程改进和知识。

    End-of-session CLAUDE.md updates create compounding institutional memory; extend this to a continuous improvement loop by capturing workflow improvements alongside knowledge.

  • 针对特定观察到的错误的CLAUDE.md补充比通用指令更有效。

    Targeted CLAUDE.md additions that address specific observed errors are more effective than general instructions.

  • 规格文档在上下文压缩和会话重启后仍然保留,使它们成为多会话项目的关键。

    Spec documents survive context compaction and session restarts, making them essential for multi-session projects.

  • 会话分叉(--fork-session)在保留有价值上下文的同时,让你能够将工作引向新方向。

    Session forking ( –fork-session ) preserves valuable context while letting you take work in a new direction.


第4章:多智能体编排

  • 子代理提供的更多是上下文隔离而非并行性——将冗长操作路由到子代理,以保持编排器的上下文精简。

    Subagents provide context isolation more than parallelism – route verbose operations through subagents to keep the orchestrator’s context lean.

  • 严格的两级层级(编排器和工作者,没有中层管理)是防止递归混乱的刻意设计选择。

    The strict two-level hierarchy (orchestrator and workers, no middle management) is a deliberate design choice that prevents recursive chaos.

  • 存在六种内置子代理类型(Explore、Plan、General-purpose、Bash、Claude Code Guide、statusline-setup),自定义子代理支持 maxTurns、mcpServers、技能预加载、持久内存和生命周期钩子。

    Six built-in subagent types exist (Explore, Plan, General-purpose, Bash, Claude Code Guide, statusline-setup), and custom subagents support maxTurns , mcpServers , skills preloading, persistent memory , and lifecycle hooks.

  • 代理团队提供七种协调原语(TeamCreate、TaskCreate、TaskUpdate、TaskList、Task、SendMessage、TeamDelete),包含四种消息类型和三种显示模式。

    Agent teams provide seven coordination primitives (TeamCreate, TaskCreate, TaskUpdate, TaskList, Task, SendMessage, TeamDelete) with four message types and three display modes.

  • 在并行化之前始终先规划;1万个token的规划可以防止50万个token的浪费性冲突执行。

    Always plan before parallelizing; 10,000 tokens of planning prevents 500,000 tokens of wasted, conflicting execution.

  • 对编排使用昂贵模型,对执行使用廉价模型——这种分工在经济效益上明显更有利。

    Use expensive models for orchestration and cheap models for execution – the economics heavily favor this split.

  • 多智能体个性冲突是真实存在的,有时通过简化回单智能体来解决;一个四层交易层级因复杂性适得其反而退化为一个。

    Multi-agent personality conflicts are real and sometimes resolve by simplifying back to a single agent; a four-agent trading hierarchy degraded to one because complexity was counterproductive.

  • 任务系统(Ctrl+T,存储为 .claude/tasks/ 中的JSON)通过 CLAUDE_CODE_TASK_LIST_ID 跨会话协调工作,无需完整代理团队即可实现多会话编排。

    The task system (Ctrl+T, stored as JSON in .claude/tasks/ ) coordinates work across sessions via CLAUDE_CODE_TASK_LIST_ID , enabling multi-session orchestration without full agent teams.

  • 从单智能体开始,仅当你有证据表明单智能体不足时才添加智能体。

    Start with a single agent and add agents only when you have evidence that the single agent is insufficient.

  • 最简单的多智能体模式——不同终端中的多个独立实例——往往是最有效的。

    The simplest multi-agent pattern – multiple independent instances in different terminals – is often the most effective


第5章:MCP——将Claude Code连接到一切

  • 每个MCP工具定义在会话开始时都会消耗上下文token;使用 /mcp 来测量,并使用工具搜索来推迟你暂时不需要的内容。

    Every MCP tool definition costs context tokens at session start; use /mcp to measure and tool search to defer what you do not need immediately.

  • MCP资源(@server:resource 语法)将外部数据与本地文件置于同等地位,支持在提示中直接引用。

    MCP resources ( @server:resource syntax) put external data on the same footing as local files, enabling direct references in prompts.

  • MCP连接静默失败——在会话开始时以及Claude行为异常时运行 /mcp

    MCP connections fail silently – run /mcp at session start and whenever Claude’s behavior changes unexpectedly.

  • 后台子代理无法使用MCP工具;相应设计工作流或在主代理中预取数据。

    Background subagents cannot use MCP tools; design workflows accordingly or pre-fetch data in the main agent.

  • mcp__server__tool 这样的钩子模式支持MCP工具调用的日志记录、验证和转换;双下划线命名是Claude Code使用的精确内部格式。

    Hook patterns like mcpservertool enable logging, validation, and transformation of MCP tool invocations; the double-underscore naming is the exact internal format Claude Code uses.

  • MCP服务器将凭证保留在Claude的上下文之外,使它们在敏感数据访问方面本质上比等效bash命令更安全。

    MCP servers keep credentials out of Claude’s context, making them categorically more secure than equivalent bash commands for sensitive data access.

  • 托管设置(enableAllProjectMcpServersenabledMcpjsonServersdisabledMcpjsonServers)在用户级别控制服务器批准;allowedMcpServersdenedMcpServers 在组织级别强制执行策略。

    Managed settings ( enableAllProjectMcpServers , enabledMcpjsonServers , disabledMcpjsonServers ) control server approval at the user level; allowedMcpServers and deniedMcpServers enforce policy at the organizational level.

  • .mcp.json 提交到版本控制,让一个人的配置工作惠及整个团队。

    Commit .mcp.json to version control so one person’s configuration effort benefits the entire team.

  • 三层架构(AI创建、可视化展示、无代码优化)通过将AI限制在构思阶段来最小化token成本,MCP在该阶段提供结构化数据访问。

    The three-layer architecture (AI creation, visual display, no-code refinement) minimizes token costs by restricting AI to the ideation phase, with MCP providing structured data access in that layer.

  • 在大规模(40+工具)时,按域分组服务器、延迟加载和严格命名约定可防止上下文膨胀和工具冲突。

    At scale (40+ tools), server-per-domain grouping, deferred loading, and strict naming conventions prevent context bloat and tool collisions.

  • 对于受监管环境,首先对数据进行分类,然后选择访问架构:敏感数据使用本地文件系统,受限系统集成使用MCP,非敏感研究使用直接上下文。

    For regulated environments, classify data first then choose the access architecture: local filesystem for sensitive data, MCP for restricted system integrations, direct context for non-sensitive research


第6章:CI/CD与无人值守自动化

  • -p 标志将Claude Code从交互式工具转变为可组合的Unix实用程序,它读取stdin、写入stdout,可与任何管道集成——cat error.log | claude -p 'explain' > output.txt 就是一个完整的诊断工作流。

    The -p flag transforms Claude Code from an interactive tool into a composable Unix utility that reads stdin, writes stdout, and integrates with any pipeline – cat error.log | claude -p ‘explain’ > output.txt is a complete diagnostic workflow.

  • 四个系统提示标志(--system-prompt--system-prompt-file--append-system-prompt--append-system-prompt-file)提供了从内联覆盖到版本控制附加提示的频谱——CI场景优先使用基于文件的变体。

    Four system prompt flags ( –system-prompt , –system-prompt-file , –append-system-prompt , –append system-prompt-file ) provide a spectrum from inline overrides to version-controlled additive prompts – prefer file-based variants for CI.

  • 使用 --allowedTools 的扇出模式实现安全的批处理:遍历文件,按调用限定权限,聚合JSON结果。

    The fan-out pattern with –allowedTools enables safe batch processing: loop through files, scope permissions per invocation, and aggregate JSON results.

  • 异步钩子在每次写入后后台运行测试;TaskCompleted钩子阻塞任务完成直到测试通过;基于代理的Stop钩子生成子代理在Claude完成前验证质量。

    Async hooks run tests in the background after every write; TaskCompleted hooks block task completion until tests pass; agent-based Stop hooks spawn subagents to verify quality before Claude finishes.

  • 预提交钩子(类型检查、lint、测试)创建自动化反向压力,使Claude在提交边界处自我纠正——你不再是质量瓶颈。

    Pre-commit hooks (type check, lint, test) create automated backpressure that makes Claude self-correct at the commit boundary – you stop being the quality bottleneck.

  • 带有网络隔离的Devcontainer和 --dangerously-skip-permissions 是CI中安全无人值守执行的标准模式。

    Devcontainers with network isolation and –dangerously-skip-permissions are the standard pattern for safe unattended execution in CI.

  • --from-pr 标志恢复与特定PR关联的会话,在审查周期中保留完整对话上下文。

    The –from-pr flag resumes sessions linked to a specific pull request, preserving full conversation context across review cycles.

  • CI自动修复——构建失败的自动修复——是最受期待的能力缺口,预计2026年中期推出。

    CI auto-fix – automatic remediation of build failures – is the most anticipated missing capability, expected in mid-2026.

  • 从交互式使用开始,识别重复模式,然后逐步将它们迁移到无人值守执行。

    Start with interactive usage, identify repetitive patterns, and progressively move them to headless execution


第7章:正确的IDE集成

  • Claude Code的引擎与表面无关——CLAUDE.md、设置、MCP服务器和权限在终端、编辑器扩展、桌面、Web和移动端的工作方式完全相同。

    Claude Code’s engine is surface-independent – CLAUDE.md, settings, MCP servers, and permissions work identically across terminal, editor extensions, desktop, web, and mobile.

  • 桌面应用提供带内联评论的视觉差异审查、带Git工作树隔离的并行会话、多种权限模式、SSH连接和外部服务连接器。

    The desktop application provides visual diff review with inline comments, parallel sessions with Git worktree isolation, multiple permission modes, SSH connections, and connectors to external services.

  • 云会话在托管VM上运行,关闭浏览器后仍持续存在,可从任何设备监控——使用 & 前缀将工作分派到云端,使用 /teleport 将其拉回。

    Cloud sessions run on managed VMs, persist after you close the browser, and can be monitored from any device – use the & prefix to dispatch work to the cloud and /teleport to pull it back.

  • 终端页脚显示带彩色下划线的实时PR审查状态(绿色=已批准,黄色=待处理,红色=请求更改)——每60秒更新。

    The terminal footer shows live PR review status with colored underlines (green=approved, yellow=pending, red=changes requested) – updated every 60 seconds.

  • 技能是跨代理标准:npx skills add 安装可在多种AI编码工具中使用的技能,防止工作流锁定。

    Skills are a cross-agent standard: npx skills add installs skills that work across multiple AI coding tools, preventing workflow lock-in.

  • 内联自动完成的差距是架构性的,不是偶然的;Claude Code用于代理工作、专用自动完成工具用于编辑的混合设置是稳定的最终状态。

    The inline autocomplete gap is architectural, not accidental; a hybrid setup with Claude Code for agentic work and a dedicated autocomplete tool for editing is the stable end state.

  • 终端优先的工作流在经验丰富的用户中占主导地位,因为终端更快、更灵活、与IDE无关。

    Terminal-first workflows dominate among experienced users because the terminal is faster, more flexible, and IDE-agnostic.

  • 首先将学习时间投入到终端工作流——它们可转移到每个表面和每个IDE。

    Invest learning time in terminal workflows first – they transfer to every surface and every IDE.


第8章:代理工具的提示工程

  • 验证标准(测试、预期输出、视觉目标)是杠杆率最高的单一提示技术——它们将一次性生成转换为迭代优化。

    Verification criteria (tests, expected output, visual targets) are the single highest-leverage prompting technique – they convert one-shot generation into iterative refinement.

  • 探索使用模糊提示,实现使用具体提示;最常见的失败模式是实现阶段的模糊提示。

    Use vague prompts for exploration and specific prompts for implementation; the most common failure mode is vague implementation prompts.

  • 你的lint配置、类型检查器和测试套件都是你提示的一部分——严格的工具创建自动改善Claude输出的反向压力。

    Your linting configuration, type checker, and test suite are part of your prompt – strict tooling creates backpressure that improves Claude’s output automatically.

  • 要求Claude在实现之前先规划;1万个token的规划可以防止50万个token的错误方向实现。

    Ask Claude to plan before implementing; a 10,000-token plan prevents a 500,000-token misdirected implementation.

  • 明确陈述约束——什么不能改变、权衡偏好、灵活性边界——因为隐式约束就是不可见约束。

    State constraints explicitly – what cannot change, trade-off preferences, and flexibility boundaries – because implicit constraints are invisible constraints.

  • 穷尽的提问(”开始前请详尽地问我问题”)可将复杂任务的首过质量提高30-50%。

    Exhaustive questioning (“ask me questions exhaustively before starting”) improves first-pass quality by 30-50% on complex tasks.

  • 在模糊的代码库中,使用精确文件路径和明确排除的极端特异性可防止错误组件修改。

    In ambiguous codebases, extreme specificity with exact file paths and explicit exclusions prevents wrong-component modifications


第9章:大型与遗留代码库的工作

  • Git工作树支持在同一仓库的不同分支上并行运行Claude Code会话,会话之间互不干扰。

    Git worktrees enable parallel Claude Code sessions on different branches of the same repository, with no interference between sessions.

  • Explore子代理(Haiku、只读、隔离上下文)足够便宜,可以广泛用于浏览不熟悉的代码。

    The Explore subagent (Haiku, read-only, isolated context) is cheap enough to use liberally for navigating unfamiliar code.

  • 并行研究子代理在几分钟内而非几小时内提供对大型系统的多视角理解。

    Parallel research subagents provide multi-perspective understanding of large systems in minutes instead of hours.

  • 使用 blocks/blockedBy 的任务依赖管理支持复杂迁移和重构的波浪式执行。

    Task dependency management with blocks / blockedBy enables wave-based execution for complex migrations and refactors.

  • 遗留重写成功的条件是:目标架构清晰、现有代码作为规范、工作是模块化的。

    Legacy rewrites succeed when the target architecture is clear, the existing code serves as the specification, and the work is modular.

  • Claude Code在大型代码库上的有效性受限于任务清晰度和上下文质量,而非代码库大小——一个记录在案的案例在1250万行代码库上达到了99.9%的数字精度。

    Claude Code’s effectiveness on large codebases is limited by task clarity and context quality, not by codebase size – one documented case achieved 99.9% numerical accuracy across a 12.5-million-line codebase.

  • 遗留系统的老语言障碍正在消失;代理可以读取、理解和翻译业务逻辑,而这是大多数现代开发者无法做到的。

    The language barrier for legacy systems written in older languages is disappearing; agents can read, understand, and translate business logic that most modern developers cannot.

  • 三输入工作区模式(原始数据导出、补充文本文件、详细目标提示)将文件系统驱动的分析转变为结构化、可重复的过程。

    The three-input workspace pattern (raw data exports, supplementary text file, detailed goal prompt) turns filesystem-driven analysis into a structured, repeatable process.

  • 规格驱动的重构——先研究,精确指定,并行执行——比直接跳到代码产生更好的结果,即使开发者本可以手动编写。

    Spec-driven refactoring – research first, specify precisely, execute in parallel – produces better results than jumping straight to code, even when the developer could have written it manually.

  • CLAUDE.md可以替代用于入职的传统数据目录,通过在每次会话结束时的持续改进循环来维护。

    CLAUDE.md can replace traditional data catalogs for onboarding, maintained through a continuous improvement loop at the end of each session.

  • 当代理将技术债务的经济性从”不值得花三周工程师时间”变为”三小时加审查”时,技术债务变得可以系统性地消除。

    Technical debt becomes systematically eliminable when agents change the economics from “not worth three weeks of an engineer’s time” to “three hours plus review.”


第10章:故障模式与恢复

  • 上下文耗尽是最常见的故障模式——注意重复建议、遗忘指令和连贯性下降等症状,这些都是上下文窗口已满的表现。

    Context exhaustion is the most common failure mode – watch for repeated suggestions, forgotten instructions, and degraded coherence as symptoms of a full context window.

  • Bash环境变量在命令之间不会持久;在每个命令中重新建立状态,或使用设置级别的环境配置。

    Bash environment variables do not persist between commands; re-establish state in each command or use settings-level environment configuration.

  • 检查点系统提供四种回退选项(恢复代码和对话、仅恢复对话、仅恢复代码、从此处总结),但检查点只跟踪直接文件编辑——不跟踪bash命令的文件操作;git提交是唯一可靠的通用回退机制。

    The checkpoint system offers four rewind options (restore code and conversation, restore conversation only, restore code only, summarize from here), but checkpoints track only direct file edits – not bash-command file operations; git commits are the only reliable universal rewind mechanism.

  • 五种命名反模式——厨房水槽会话、反复纠正、过度指定的CLAUDE.md、信任-验证差距、无限探索——每种都有具体的即时修复方法。

    The five named anti-patterns – kitchen sink session, correcting over and over, over-specified CLAUDE.md, trust-then-verify gap, infinite exploration – each have specific, immediate fixes.

  • 33天自主交易实验在一个连续过程中展示了每个类别的代理失败:过度自信、相关性风险、集中风险、治理悖论和自信输出的根本不可靠性。

    The 33-day autonomous trading experiment demonstrates every category of agent failure in one continuous arc: overconfidence, correlation risk, concentration risk, governance paradoxes, and the fundamental unreliability of confident output.

  • “氛围交易”将氛围编码陷阱扩展到非编码领域——对代理生成结果的虚假信心在反馈循环缓慢且赌注是财务的情况下更危险。

    “Vibe trading” extends the vibe coding trap to non-coding domains – false confidence in agent-generated results is more dangerous when feedback loops are slow and stakes are financial.

  • 投资于验证基础设施(测试、类型、lint)以提高首次尝试成功率——自动化反馈是杠杆率最高的改进。

    Invest in verification infrastructure (tests, types, linting) to increase first-attempt success rates – automated feedback is the highest-leverage improvement.

  • 对卡住的任务使用老虎机方法:提交、限时30分钟、接受或重启——从头开始通常比试图修复中途错误更好。

    Use the slot machine approach for stuck tasks: commit, time-box to thirty minutes, accept-or-restart – starting over often beats trying to fix mistakes mid-stream.

  • Claude默认过度复杂解决方案;提示和CLAUDE.md中的显式简洁约束会产生明显更好的输出。

    Claude defaults to over-complex solutions; explicit simplicity constraints in prompts and CLAUDE.md produce meaningfully better output.

  • 每种恢复策略都依赖于频繁提交——没有检查点纪律,你唯一的选择就是纠正,而纠正是最不可靠的路径。

    Every recovery strategy depends on frequent commits – without checkpoint discipline, your only option is correction, and correction is the least reliable path.


第11章:团队采用模式

  • 遵循引导式采用路径(问答、小修复、规划模式、完全自主),持续两到四周,而不是立即跳到完全自主。

    Follow the guided adoption path (Q&A, small fixes, plan mode, full autonomy) over two to four weeks rather than jumping to full autonomy immediately.

  • 将CLAUDE.md提交到版本控制;随着团队添加学习成果,其价值会复利增长,防止每个未来开发者的错误。

    Commit CLAUDE.md to version control; it compounds in value as the team adds learnings, preventing errors for every future developer.

  • 不同团队以根本不同的方式使用Claude Code:安全团队的”提交即走”自主模式看起来与产品开发团队对核心业务逻辑的同步监督完全不同。

    Different teams use Claude Code in fundamentally different ways: a security team’s “commit as you go” autonomous pattern looks nothing like a product development team’s synchronous supervision of core business logic.

  • 非技术团队(法律、营销、设计、财务)提取截然不同的价值:律师在一小时内构建自定义无障碍工具,营销团队将广告创建从两小时压缩到十五分钟,设计师直接在代码库中进行状态管理更改。

    Non-technical teams (legal, marketing, design, finance) extract categorically different value: a lawyer building a custom accessibility tool in one hour, a marketing team compressing ad creation from two hours to fifteen minutes, a designer making state management changes directly in the codebase.

  • 企业部署需要在按席位和云提供商计费之间选择,配置LLM网关以集中跟踪使用情况,并将组织范围的CLAUDE.md部署到系统目录。

    Enterprise deployment requires choosing between per-seat and cloud-provider billing, configuring LLM gateways for centralized usage tracking, and deploying organization-wide CLAUDE.md to system directories.

  • 维护到扩展的渐进过程(第一到第三个月用于MVP维护,第三到第六个月用于功能扩展,第六个月及以后用于加固和重构)既防止过早优化也防止过早放弃。

    The maintenance-to-scale progression (months one through three for MVP maintenance, three through six for feature expansion, six and beyond for hardening and refactoring) prevents both premature optimization and premature abandonment.

  • 动态激增人员配置——在不熟悉传统生产力下降的情况下将工程师转移到不熟悉的代码库——当Claude Code将入职时间从数周压缩到数小时时成为可能。

    Dynamic surge staffing – moving engineers onto unfamiliar codebases without the traditional productivity dip – becomes possible when Claude Code collapses onboarding time from weeks to hours.

  • 自定义斜杠命令将团队工作流编码为可发现、可复用的约定;一个安全团队占其组织整个单体仓库中所有自定义斜杠命令的一半。

    Custom slash commands encode team workflows as discoverable, reusable conventions; one security team accounts for half of all custom slash commands in their organization’s entire monorepo.

  • 通过VCS共享项目设置(.claude/settings.json),让克隆仓库就能配置整个Claude Code环境。

    Share project settings via VCS ( .claude/settings.json ) so that cloning the repo configures the entire Claude Code environment.


第12章:AI辅助开发的经济学与战略

  • 提示缓存通过每个会话仅收取一次全价,使丰富的上下文(CLAUDE.md、MCP工具、系统提示)在经济上可行。

    Prompt caching makes rich context (CLAUDE.md, MCP tools, system prompts) economically viable by charging full price only once per session.

  • 为任务匹配合适的模型:Opus用于架构,Sonnet用于实现,Haiku用于探索——并使用子代理在会话内混合模型。使用努力级别(低/中/高)和 CLAUDE_CODE_EFFORT_LEVEL 环境变量控制推理成本。

    Match models to tasks: Opus for architecture, Sonnet for implementation, Haiku for exploration – and use subagents to mix models within a session. Control reasoning costs with effort levels (low/medium/high) and the CLAUDE_CODE_EFFORT_LEVEL environment variable.

  • 三个复利乘数——代理能力、编排改进和积累的人类经验——产生阶跃式增益而非线性增益。

    Three compounding multipliers – agent capabilities, orchestration improvements, and accumulated human experience – produce step-function gains rather than linear ones.

  • 70-90%的时间线压缩改变项目可行性:八周的生产平台而非三到六个月,两到三天的功能而非两到三周。

    Timeline compression of 70-90% changes project viability: an eight-week production platform instead of three to six months, a two-to-three-day feature instead of two to three weeks.

  • 混合工具链(Claude Code用于仓库级推理,专用内联工具用于按键级自动完成)的表现优于单独使用任一工具,因为它们在不同交互尺度上运作。

    A hybrid toolchain (Claude Code for repo-scale reasoning, a dedicated inline tool for keystroke-level autocomplete) outperforms either tool alone because they operate at different interaction scales.

  • Claude Code的竞争优势是在大型代码库中进行仓库级规划和补丁生成;其他工具在内联自动完成、可视化多智能体管理和CI集成方面领先。

    Claude Code’s competitive strength is repo-scale planning and patch generation in large codebases; other tools lead in inline autocomplete, visual multi-agent management, and CI integration.

  • 每个人都变得更全栈:后端开发者在几小时内构建前端应用,设计师进行状态管理更改,非技术团队构建全新能力。

    Everyone becomes more full-stack: backend developers build frontend applications in hours, designers make state management changes, and non-technical teams build entirely new capabilities.

  • 当代理以压缩成本处理积压任务时,技术债务变得可以系统性地解决,处理那些以前从未超过优先级阈值的任务。

    Technical debt becomes systematically addressable when agents work through backlogs at compressed cost, tackling tasks that never cleared the priority threshold before.

  • 2026年的四个优先事项:掌握多智能体协调、通过自动化审查扩展监督、将代理编码扩展到工程之外、从第一天起嵌入安全架构。

    Four priorities for 2026: master multi-agent coordination, scale oversight through automated review, extend agentic coding beyond engineering, and embed security architecture from day one.