Mythos is here, it's time to start tokenmaxxing
A 30-minute field report on burning $5,400 of subsidized AI inference in ten days — and what actually came out of it.
June 12thA 25-minute case for letting agents run their own loops — and how one 2:29 AM prompt produced four merged PRs by morning.
The highest-leverage move in AI-assisted coding is not writing better prompts — it is designing loops where agents orchestrate other agents, close their own feedback cycles, and handle every step that used to require a human messenger.
Most developers run agent loops manually: they prompt, read the output, copy it somewhere, paste it somewhere else, tell the agent what to do next. The insight here is that every one of those handoff steps can be included in the original prompt. The author's real example: a single late-night message asked the agent to (1) spin up a thread to make the PR, (2) spin up a reviewer when the PR landed, (3) keep the first thread in a loop addressing comments until approval, and (4) merge and trigger the next PR. Four stacked PRs were reviewed and merged by morning. The practical takeaway is to audit what you do after an agent finishes and ask whether the agent could do those steps too.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Opens with Pete Steinberger's viral tweet as a hook; host admits the memo didn't reach him and describes his old hand-holding workflow.

Started building more with loops — agents reviewing code, triggering re-reviews, watching PRs, using Hermes to bring context instead of hunting for it.

AI design tool that works within your existing code base; design system selector, Figma import, multi-model support, frame testing.

A Pete post about using Codex threads that spawn other Codex threads clicked in a way the earlier one didn't — specifically the orchestrator-skill pattern.

Uses Anthropic's published article to frame the historical progression: copy-paste → agent edits code → agent orchestrates agents.

Rejects the trend of assigning role-based identities to sub-agents (security reviewer, exploration agent) — the point is dynamic context, not hard-coded roles.

Describes setting up Claude Code to SSH into another machine, monitor a PR, and address CodeRabbit/Greptile/Macroscope comments as they arrive — ran for 6+ hours autonomously.

5.5 performance analysis of Lakebed reveals data architecture problems beyond runtime; dependency-aware validation, mutation coalescing, subscription batching all needed.

Model breaks work into 3 PRs, writes HTML plan files (a pattern credited to Thoric), and creates the first Codex thread. Host merges PR 1 then realizes he should loop harder.

Single message asks for a 4-step loop: spin PR thread -> reviewer thread -> address comments until approval -> merge and trigger next. Set at 2:29 AM, woke to 4 merged stacked PRs at 6:50 AM.

Contrasts the rigid agile sprint (work fits the shape) with dynamic loop design (shape fits the work). Extends to monitoring PRs, morning briefings, even a 5G hotspot deal-finder.

Concrete advice: list every step you take after the agent completes — run server, check it works, commit, push, file PR, copy review comments, address them — and hand each step to the agent.

If you read agent code before another agent reviewed it, you are doing the agent's job. Let peer-agent review happen first; by the time a human looks, only the hard stuff remains.

3 million tokens for one Opus loop addressing three comments. But on the $200 plan: 5 deep loops across several days = 29% of weekly budget. June: $8,600 of inference across machines on ~$600 subscription.

/goal primitive for single-thread never-ending tasks; Hermes Rust rewrite running 12+ hours. Unused subscription budget = money lost. Ask the agent to do the next step.
Every step you take after the agent finishes — running the server, checking output, filing the PR, copying review comments back in — is a step the agent could handle if you asked it to.
“The majority of your agent runs should probably not be running with prompts that you wrote. That is a crazy thing for me to say.”
“We are looking at the code too early. If you are reading the code your agent put out before another agent read it and gave feedback on it, you're wasting your own time.”
“My loops created loops, and they did a great job at it.”
“If you're on the expensive plan, you should be trying to get close to maxing it out because that's just money you're losing if you're not.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
Pete Steinberger's tweet landed like a memo no one forwarded. The host had been running his own loops for months — reading the plan, approving it, kicking off the next step, copy-pasting review comments back in — never quite noticing that the loop itself was the thing the agent could run.
Every post-completion action you take manually is a candidate to include in the original prompt. Walk the list once, then fold it into the agent's instructions.
A single orchestrating prompt that closes the entire PR cycle without human intervention at each step.
“Let me know how it goes and until next time. Peace, nerds.”
Soft close with no explicit subscribe ask — relies on implicit community follow-up
00:00
00:29
00:46
01:04
01:23
01:42
02:00
02:15
02:37
03:00
03:08
03:33
03:45
04:10
04:28
04:40
05:06
05:24
05:43
06:01
06:20
06:38
06:57
07:15
07:34
07:55
08:11
08:30
08:48
09:07
09:25
09:44
10:02
10:21
10:39
10:58
11:17
11:39
11:54
12:12
12:31
12:49
13:08
13:26
13:45
14:04
14:22
14:41
14:59
15:18
15:36
15:55
16:13
16:32
16:50
17:09
17:28
17:46
18:05
18:23
18:42
19:00
19:19
19:37
19:56
20:21
20:33
20:52
21:10
21:29
21:47
22:06
22:24
22:43
23:01
23:20
23:39
23:57
24:16
24:35A 30-minute field report on burning $5,400 of subsidized AI inference in ten days — and what actually came out of it.
June 12thA reluctant 28-minute tour of the Claude Code features every competing harness should steal.
June 17thTheo breaks down how Anthropic silently modified prompts, rewrote its system card, and built invisible safeguards into its most capable model - then got caught.
June 15thA live 14-minute breakdown of the US government export control directive that forced Anthropic to pull Fable 5 and Mythos 5 offline for all non-US citizens — including Anthropic's own employees.
June 13thA 23-minute supply-chain autopsy explaining why Elon's reckless GPU overbuy is now the most valuable compute position in the world.
June 9thA 33-minute first-take from a developer who spent $3,000 on inference in 24 hours — benchmarks, real demos, session math, and the hidden safety intervention that silently degrades the model without telling you.
June 11th