Summary

Lead
Overview of this part
Four blocks — recap
Three lessons from the series
Practical tips for adoption
Production — daily pipeline
Minimum starter kit (any theme)
Next steps for a personal blog
Related posts
Summary

Lead

Hi, I’m Pomarano.

This is Part 4 — the finale of my AI Agent Study Notes.

Series index Building Your Own AI Agents · Part 1 · Part 2 · Part 3

Part 1 covered concepts, Part 2 two-agent design, Part 3 with/without proofreading.
Here: series recap and practical tips for keeping agents running on a personal blog.

Code and specs live in pomarano/x_auto_writing. Use this as a map when you start your own small pipeline.

Japanese version: 個人運用でエージェントを続けるために（第4回）*

Overview of this part

flowchart LR
  P1["Part 1<br/>Concepts"]
  P2["Part 2<br/>Design"]
  P3["Part 3<br/>Compare"]
  P4["Part 4<br/>Summary"]
  OPS["Production<br/>daily pipeline"]

  P1 --> P2 --> P3 --> P4 --> OPS

  classDef meta fill:#eceff1,stroke:#607d8b,stroke-width:2px,color:#1a1a1a
  classDef concept fill:#e8f4fc,stroke:#3d7ea6,stroke-width:2px,color:#1a1a1a
  classDef agent fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px,color:#1a1a1a
  classDef output fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px,color:#1a1a1a
  class P1 concept
  class P2 concept
  class P3 agent
  class P4 meta
  class OPS output

Four blocks — recap

Block	Part	Takeaway	Post
What agents are	1	vs chat, three components, semi-auto	JP Part 1
Sample design	2	Copy + proofread, 6-item checklist	JP Part 2
Build & compare	3	40% → 93% pass; 8 min → 2 min edits	JP Part 3
Summary	This post	Ops tips and next steps	—

Throughout: semi-automatic Japanese X copy — agents write and proofread; humans review and post.

Replace JP links with English URLs after publish.

Three lessons from the series

2-1. Use chat and agents for different jobs

From Part 1: chat for ad-hoc advice; agents for fixed routines with spec.

Use case	Tool
Outline help, brainstorming	Chat
Same morning X draft + same checks	Agent

Do not agent-ify everything — put repeating patterns on agents.

2-2. Rules in spec; verification in another layer

“140 chars max” in spec still failed 5/5 runs in Condition A (Part 3).
Same as the harness article: polite requests alone are unstable.

Layer	Role	Example files
spec	Source of truth	`x-shuuchaku-agent-spec.md` / `x-proofread-agent-spec.md`
prompt	“Run today’s job per spec”	`automation/x-daily/prompt.md`
Proofread (verify)	Check and fix output	Proofreading agent

Change spec only; keep prompts thin — consistent since Part 2.

2-3. Two agents make results measurable

Copy-only mixes writing skill, rule following, and model quirks.
Splitting copy and proofread enables A/B comparison (Part 3).

Metric	Copy only	Copy + proofread
Checklist pass	40%	93%
Human edits	~8 min	~2 min
Post as-is	0 / 5	4 / 5

Proofreading does not replace humans for theme duplication or teaching accuracy.

Practical tips for adoption

3-1. Start small and measure

Tip	Detail
One agent OK; verify separately	Copy alone works; measure compliance via proofread or harness
Spec is canonical	Rule changes → spec only
Prefer semi-auto	Reviewable beats fully automatic for individuals
Compare with numbers	A/B, pass rate, edit minutes

3-2. When to split into multiple agents

Consider two or more when:

One prompt mixes “write” and “fix” and quality wobbles
Six+ check items and you edit by hand every time
You need to know which role failed

Copy + proofread enabled Part 3’s comparison.

3-3. Stumbles and fixes

Stumble	Fix
Length / format keeps failing	Proofreading agent or harness
Wrong or inappropriate content	Human final review
Manual start is tedious	GitHub Actions (X semi-auto)
Spec drift	One line in spec per week
Too many agents too soon	Stabilize one pipeline first

Production — daily pipeline

Run the two agents from the study notes every day.

flowchart TB
  T["Morning (fixed time)"]
  A1["① Copy agent<br/>research · draft"]
  A2["② Save draft md"]
  A3["③ Proofread agent<br/>6-item check"]
  H1["④ Human review"]
  H2{⑤ Edit?}
  H3["⑥ Post to X"]
  H4["⑦ status: posted (optional)"]

  T --> A1 --> A2 --> A3 --> H1 --> H2
  H2 -->|if needed| H1
  H2 -->|OK| H3 --> H4

  classDef agent fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px,color:#1a1a1a
  classDef human fill:#fff3e0,stroke:#e65100,stroke-width:2px,color:#1a1a1a
  class A1 agent
  class A2 agent
  class A3 agent
  class H1 human
  class H2 human
  class H3 human
  class H4 human

Stage	Owner	Work
Copy	Copy agent	One post per weekday rotation
Proofread	Proofread agent	140 chars, two-sentence shape, etc.
Review	Human	Email or GitHub → post
Log	Human (optional)	`status: posted`

Automating copy with GitHub Actions + Cursor SDK still leaves proofread and posting to humans — same design as Part 2.

Minimum starter kit (any theme)

Step	Action
1	One-sentence goal — e.g. “Save one X draft md every morning”
2	Write spec — format, bans, out-of-scope
3	Thin prompt — “Run today’s job per spec”
4	Five manual runs — note failure patterns
5	Add verification — proofread agent or harness
6	A/B measure — until numbers improve
7	Actions if needed — after ops feel stable

Template repo: pomarano/x_auto_writing

Next steps for a personal blog

Priority	Action
High	Daily rhythm — copy → proofread → human → X
High	From proofread logs, add top violations to copy spec
Medium	Actions for copy; proofread manual or chained
Low	Second pipeline (blog drafts, etc.) — same pattern

Relaunching the blog to record AI learning means agent ops are both subject and material.
Stabilize one pipeline before multiplying agents.

Post	Content
Index — AI Agent Study (JP)	Reading order
Part 1 (JP)	Concepts
Part 2 (JP)	Design
Part 3 (JP)	Metrics
Harness (EN)	JSON verification
X semi-auto (EN)	Actions · email

Generative AI for consultation; agents for fixed routines
spec + prompt + proofread brings rule compliance to a practical level
Copy + proofread: pass rate 40% → 93%, edits ~8 min → ~2 min (Part 3)
Sustain with semi-auto, spec as truth, measurable comparison
Study notes end here; daily two-agent ops is production

From Part 1‘s goal of putting AI on routines, through design, build, and compare — the arc is complete.
Try building just one small agent pipeline of your own.