好久没有更新长日志,更新一下。
首先是不太重要但也许有用的——Claude 101.
这是 Anthropic 自己的课程,提出来是因为,很多人完全不知道自己用 LLMs 的时候,为什么效果会那么糟。我之前就说过了 GIGO 原则,garbage in garbage out. LLMs 更像是你自身的镜像,再加上是一个最高概率提取机一样的存在;你总不能奢望自己给孩子喂的是快餐,结果孩子长胖你还怪孩子不争气吧?
anthropic.skilljar.com/claude-101 这是 Claude 101 课程的网址。注册免费,没有任何奇怪的收费。
"Getting better result" 那一页就是在教你怎么让自己得到的结果更好。
其次是……健康?
这个其实看近期 update 就能看出来了,我就不在这里浪费时间赘述。
然后是…… 技术成长?LOL
首先是本来我做任何事,包括创作 Bot 的时候的起始阶段——研究层:背景、地理、人文、心理、语言等,就是在 Claude Chat 上 Project 的 features 完成。但 Chat 最多只有 200K 的上下文窗口,而我碰巧很尴尬的…… 总是会超过到 210 - 260 之间。最重要的是,Pro 的 usage 对我来说根本不够。因为我足够偏执,我会需要 Claude 不断地帮我反复验证我的想法是否正确,我们研究出来的东西是否真实不脱离现实等等。然后除此之外还有技术层面的,比如我的 Bot definition 这样写会有什么漏洞,LLMs 本身的天然习性有可能在哪个方面需要我去加强之类的。
导致我最后开始 Max 5x 走起(100 美金就是这么飞走的每个月,但每个月我用的 tokens 都可以说是超过一千美金,不亏)。
三月出院之后休息了快两星期,身体缺氧没办法用脑子都在打游戏;三月中身体稍微好一点了终于开始去研究 Claude Code,最后确认了 Claude Code 是最适合我工作流程的。(本地一大堆文件夹)从那之后我就开始用 Claude Code 来工作 + 做 Bot 等等了,chat 端的 project 基本上不打开了。打开也是聊一些和我的 pipeline 无关不需要频繁读写改动的事。
这个时期我还在用 Anthropic 自身的桌面端来使用 Claude Code,CLI 理解了一下我放弃了。但 4 月推出 Opus 4.7,桌面端再也没有 4.6 之后,我被迫去下载 Windows Terminal,CLI 跑 Claude Code。但,好在,Windows Terminal 能自定义很多东西。我之前 CLI 放弃是因为整个界面和字体让我很不舒服。(当然这方面的自定义设置也是我口头描述之后,Claude 帮我写代码,我来复制粘贴的。代码我是一点都不会写,我也不可能进入这个领域。所以我不会手搓代码,这个点需要解释清楚。)
再到上个星期?或者上上个星期?GPT 有免费一个月 PLUS 会员。我因为某些原因就想试试看 GPT 的 Codex,最后也是跑的 terminal。现在基本我每次打开 Claude Code 必然会伴随一个 Codex。
目前工作流程已经变成了:Claude Code 产出,Codex 审计,我当中间那层审计双方的东西或者为她们复制粘贴。更新的总 lorebook advanced script 也是在 Codex 的帮忙下找出了无法正常运行的问题最后才得以完成。
今天的话(是的五月六号这一天)刷完了 Anthropic 自己的课程:
Introduction to Subagents → 子代理委派、上下文管理
Introduction to Agent Skills → SKILL.md 编写、目录结构、团队共享
Claude 101 → 这个就是基础了,我基本是跳过所有内容直接跳到 Quiz 10/10 结束的。
总算明白了自己一直以来做的东西是什么,具体的称呼又叫什么。之前都是凭着本能来行事,我知道该怎么使用怎么完成但是我没有适合的一套词汇/通用术语去实际的"教导"别人。(开关引号教导是因为我不认为自己是老师,用教导这个词总会给我一种高高在上的感觉我不太喜欢,但英文里我可以用 show,而不是 teach,更直观,中文用 展示 这个词就感觉很奇怪。)
我的整个学习路径是:课程概念进来的同时就在和已有工作流做碰撞测试,四个应用设计不是课后作业,是听课过程中的同步输出。这意味着概念获取和应用映射对我来说不是两个独立步骤,是一个步骤。然后我不是"从头到尾学完"或者"跳过不学",二选一;而是在同一个平台上对不同课程的处理方式完全不同:Claude 101 一分钟一页扫过去拿证书,subagents 和 agent skills 停下来真正消化。筛选依据不是课程难度或官方建议顺序,是"这个概念填不填得进我现有工作流的哪个缺口"。填不进的 = 拿证书走人;填得进的 = 消化到能输出应用设计为止。以及,我同时在运行至少三个 Claude 实例,每个有不同的任务边界,我手动做信息路由决定谁看什么。我的心智模型是"一个可以按需拆分的团队"。不过本身我做事就比较总监/导演型的…… 从角色扮演,到生活里的工作,再到做 bot 对待 LLMs 的时候,我一个人灯光场务编辑演员导演全包了。然后放到工作流程上就是——我给自己找了很多包工头,但最终的创意决策 + 最终拍板人是我;包工头们使用的原材料也得是我自己这里先输出她们才有办法干活。
嗯,这一点大概率和大多数人不一样。
Bot 的话,以后应该会更细腻更细致。毕竟现在用上 script 了。当然,前提是每个不同角色有用 script 的时候,script 是正常发挥的。以及——JAI 不要突然给我发神经又搞出什么幺蛾子来。
还真是典型的:很多人以为有了 AI 辅助就什么都不用干,节省时间了。但对有追求,甚至是控制狂的人来说,这是反着来的,AI 辅助并没有让我的工作流程变得简单快速,反而是把我过往有可能三四天完成一个 bot 的事实变成了一到两周才完成。因为我能做的东西多了起来…… 以前苦手于没那个手动能力也没工具实现,现在有工具了,那我还有什么理由不做好?
目前学习计划还差:
Anthropic 自己的:
Introduction to Model Context Protocol → MCP 基础,从零构建 server/client
MCP: Advanced Topics → 生产级 MCP,transport 层、sampling
(以及网站上其他的课程,顺手刷一刷能拿到证书的都拿,完事儿了丢领英,乐)
LangChain 的:
Introduction to LangGraph → agent 架构、状态图、工具调用、多代理系统(这门是主菜)
Introduction to LangChain → 基础框架概念
Haven't done a long journal entry in a while. Here's one.
First, something not super important but potentially useful: Claude 101.
This is Anthropic's own course. I'm bringing it up because so many people have zero idea why their results with LLMs are so bad. I've said it before: GIGO. Garbage in, garbage out. LLMs are more like a mirror of yourself, plus a highest-probability extraction machine; you can't feed a kid nothing but fast food and then blame the kid for getting fat.
anthropic.skilljar.com/claude-101 is the link to the Claude 101 course. Free to sign up, no weird fees.
The "Getting better results" page is literally teaching you how to get better output.
Next... health?
You can tell from the recent updates. Not wasting time rehashing it here.
Then... technical growth? LOL.
So, everything I do when creating bots, starting from the research phase (background, geography, culture, psychology, linguistics, etc.), has always been done in Claude Chat's Project feature. But Chat caps out at a 200K context window, and I have the awkward tendency to overshoot to somewhere between 210K and 260K. More importantly, Pro usage is nowhere near enough for me. I'm obsessive enough that I need Claude to repeatedly verify whether my ideas hold up, whether what we've researched is actually grounded in reality, and so on. On top of that there's the technical side: what holes exist in my bot definition if I write it this way, what natural LLM tendencies I need to reinforce against, that kind of thing.
So I ended up going Max 5x. (That's how a hundred dollars a month disappears, but my monthly token consumption is well past a thousand dollars' worth. Not a bad deal.)
After getting discharged from the hospital in March, I rested for nearly two weeks; my body was too oxygen-deprived to think, so I just played games. Mid-March, once I felt slightly less awful, I finally started looking into Claude Code and confirmed it was the best fit for my workflow. (Local file structure, lots of folders.) From that point on, Claude Code became my primary workspace for everything, bots included. I barely open the Chat-side projects anymore. When I do, it's for things unrelated to my pipeline, stuff that doesn't need constant read-write-edit cycles.
During this period I was still running Claude Code through Anthropic's own desktop app. I tried the CLI, took one look, gave up. But when Opus 4.7 launched in April and the desktop app dropped 4.6 support, I was forced to download Windows Terminal and run Claude Code through CLI. On the bright side, Windows Terminal is highly customizable. The reason I'd bailed on CLI before was the interface and font making me physically uncomfortable. (The customization settings were, of course, written by Claude after I described what I wanted verbally, then I copy-pasted the code. I cannot write a single line of code, and I have no intention of entering that field. So no, I don't hand-write code. Needed to make that clear.)
Then last week? Or the week before? GPT offered a free month of PLUS. For certain reasons I wanted to try GPT's Codex, which also runs through the terminal. Now every time I open Claude Code, a Codex instance comes with it.
Current workflow: Claude Code produces, Codex audits, and I'm the middle layer auditing both sides or copy-pasting between them. The updated master lorebook advanced script was also only completed because Codex helped identify the issue preventing it from running properly.
Today (yes, May 6th, this very day) I finished Anthropic's own courses:
Introduction to Subagents: subagent delegation, context management.
Introduction to Agent Skills: SKILL.md authoring, directory structure, team sharing.
Claude 101: the basics. I skipped all content and jumped straight to the quiz. 10/10. Done.
I finally have proper names for the things I've been doing all along, and actual shared vocabulary to describe them. Before this I'd been operating on instinct; I knew how to use the tools and how to get things done, but I didn't have the right terminology to actually show other people. (I'm using show deliberately, not teach. In Chinese the equivalent of "teach" carries a top-down connotation I don't like; show in English is more lateral, more direct. The Chinese word 展示 feels oddly stiff for this, so the concept doesn't map cleanly.)
My learning pattern: concepts came in and immediately collided with my existing workflow. The four application designs weren't post-course homework; they were simultaneous output generated while I was still mid-lesson. Concept acquisition and application mapping aren't two steps for me; they're one step. And I didn't do the binary of "study everything start to finish" versus "skip it entirely"; my approach to different courses on the same platform was completely different. Claude 101, one minute per page, grab the certificate. Subagents and Agent Skills, I stopped and actually digested. The filter wasn't course difficulty or the official recommended sequence; it was "does this concept fill a gap in my current workflow?" If it doesn't: certificate, move on. If it does: digest until I can output an application design. Also, I was running at least three Claude instances simultaneously, each with a different task boundary, manually routing information and deciding who sees what. My mental model is "a team I can split on demand." Though honestly this is just how I operate in general... from roleplay, to real-life work, to building bots and working with LLMs, I'm the one-woman crew: lighting, grip, editor, actor, director. Translated to workflow terms: I've hired myself a bunch of foremen, but the final creative decisions and the final sign-off are mine. The raw material the foremen work with has to come from me first before they can do anything.
Yeah. This is probably different from how most people work.
As for the bots, they should be more refined going forward, now that I'm using scripts. Of course, that's contingent on each character's script actually functioning correctly. And on JAI not suddenly throwing a tantrum and breaking something again.
It really is the classic scenario: most people think AI assistance means less work, more time saved. But for someone with standards, or let's be honest, a control freak, it's the opposite. AI hasn't made my workflow simpler or faster; it's turned what used to be a three-to-four-day bot into a one-to-two-week bot. Because now I can do more... things I used to be stuck on because I lacked the manual ability or the tools. Now that I have the tools, what excuse do I have not to do it right?
Still on the learning plan:
Anthropic's own courses: Introduction to Model Context Protocol (MCP fundamentals, building a server/client from scratch) and MCP: Advanced Topics (production-grade MCP, transport layer, sampling). Plus whatever other courses on their site I can sweep through for certificates. Grab them, dump them on LinkedIn, done. LOL.
LangChain's courses: Introduction to LangGraph (agent architecture, state graphs, tool calling, multi-agent systems; this one's the main course) and Introduction to LangChain (foundational framework concepts).