How I Fixed My AI Agent's Memory (OpenClaw): Why AI Agent Gets Dumber the More You Teach It

February 24, 2026

Good AI memory isn't about storing more — it's about loading the right things.

I've been building with AI agents for a while now — not as a side project, but as the core of how I work. I run a small consultancy that builds AI systems for businesses, and my own agent is where I test everything first. It manages my tasks, tracks my projects, holds context across days and weeks. It's like having a 24/7 employee — one that I test everything on first.

But before I got here, I spent weeks convinced my AI agent was forgetting things. Projects I'd mentioned yesterday — gone. Decisions we'd made together — evaporated. I kept adding more to its memory files, stuffing in every detail I thought it needed. The agent got worse.

Turns out it wasn't forgetting. It was drowning.

The missing big picture

boot file injection

✅ AGENTS.md — auto-loaded

✅ SOUL.md — auto-loaded

✅ TOOLS.md — auto-loaded

✅ IDENTITY.md — auto-loaded

✅ USER.md — auto-loaded

✅ HEARTBEAT.md — auto-loaded

✅ BOOTSTRAP.md — auto-loaded

✅ MEMORY.md — auto-loaded

❌ ACTIVE_PROJECTS.md — never loaded

I had a file called ACTIVE_PROJECTS.md. A running document of everything I was working on — project names, statuses, next steps. Of course I assumed it loaded — it was right there with the others. But sitting in the workspace doesn't mean it gets read.

Some context: my system has 8 hardcoded files that get injected into the agent's context every session — things like AGENTS.md (the main instruction set), SOUL.md (personality), and MEMORY.md. These load automatically. No action required.

ACTIVE_PROJECTS.md was not one of them.

It was sitting in the workspace, unread, every single session. My agent had no idea what I was working on unless I explicitly told it. It was like starting every conversation from zero. I thought my agent knew the landscape. It didn't. It was navigating without a map — not because the map didn't exist, but because nobody handed it over at the door.

The missing sense of today

Problem two was subtler and, honestly, more dangerous.

My AGENTS.md had a boot sequence:

boot sequence execution

Step 1: SOUL.md ─── [ALREADY LOADED] → SKIP

Step 2: USER.md ─── [ALREADY LOADED] → SKIP

Step 3: HEARTBEAT.md ─── [ALREADY LOADED] → SKIP

Step 4: Today's log ─── ⚠️ SKIPPED

Step 5: Memory search ─── ⚠️ SKIPPED

Steps that mattered — never executed

But steps 1 through 3 were reading files that were already auto-injected by the system. It's like being told to go grab a document you're already holding. You don't just skip that step — you stop trusting the whole list.

The problem wasn't the redundancy itself. It was that by dismissing steps 1-3 as unnecessary, the agent dismissed steps 4 and 5 along with them — the ones that actually mattered. Today's log and memory search never ran.

The result: no anchor to the present. Yesterday's state driving today's decisions. The agent was technically functional. It responded coherently. But it was speaking from old context. Sometimes 24 hours old. Sometimes more.

I didn't catch this for a while. That's the insidious part. A slightly-out-of-date agent doesn't throw errors. It just makes subtly wrong calls.

The cost of memory expansion

context size — before vs after

Before (46KB)

MEMORY.md — 25KB

AGENTS.md — 13KB

HEARTBEAT.md — 7KB

ACTIVE_PROJECTS.md — 3KB

SOUL.md — 1.7KB

TOOLS.md — 1.3KB

USER.md — 0.5KB

IDENTITY.md — 0.3KB

After (9.6KB)

AGENTS.md — 2.9KB

HEARTBEAT.md — 2KB

SOUL.md — 1.6KB

TOOLS.md — 1.2KB

MEMORY.md — 0.8KB (index only)

USER.md — 0.4KB

IDENTITY.md — 0.2KB

Before: 46KB | ~23 pages | ~12,000 tokens

After: 9.6KB | ~5 pages | ~2,500 tokens

+ inbox.md 5.5KB | today's log 3.3KB (read fresh)

Meanwhile, the boot files themselves were bloating. MEMORY.md had grown to around 25KB — a sprawling dump of everything the agent had ever learned about me, my preferences, my projects, my workflow. AGENTS.md was about 13KB of increasingly redundant instructions. Total injection across all 8 files hit roughly 46KB — that's about 23 pages of text. My agent was reading a short novel before every conversation, before I even said good morning.

Duplicate content was everywhere. AGENTS.md repeated things MEMORY.md already covered. MEMORY.md contained operational details that belonged in searchable reference files, not in context that loaded every single session. Every conversation started by burning tokens on stale, redundant information before the agent even read my first message.

More context past a certain point doesn't make an agent smarter. It makes it confused. Important instructions start getting lost in all the noise.

What we built

three-tier memory architecture

BOOT LAYER (9.6KB injected)

│ reads fresh ↓

ACTIVE LAYER (read each session)

inbox.md 5.5KB

Client project Alpha [in-progress]

Invoice follow-up [waiting]

Feature request [action-needed]

today's log 3.3KB

│ searches on demand ↓

DEEP LAYER (searched, never loaded)

🔍 vector search
daily log files
memory/LONG_TERM.md
… weeks of history

The fix was to strip everything down and rethink what "boot" actually means.

AGENTS.md went from 13KB to 2.9KB — from 7 pages down to one and a half. Just the essential operating rules — nothing that belonged elsewhere. MEMORY.md's bulk moved to a file called LONG_TERM.md — still accessible, but searched on demand instead of injected every session.

And ACTIVE_PROJECTS.md got replaced by something better: inbox.md.

inbox.md solves both problems at once. It's a lightweight index — a compressed snapshot of everything active. Each item is one line: what it is, what state it's in, who owns it. That's enough for the agent to know what's happening without paragraphs of backstory. And it's the anchor — read fresh every session as the very first thing, before anything else, so the agent always knows where we are today.

It stays small by design. Done items get archived out. Updates happen in real-time during conversation, not batched at the end. Deep history lives in daily log files — one per day, named by date — that are never loaded at boot. When the agent needs something from last week or last month, vector search pulls it on demand.

The total boot went from 46KB of stale context down to about 10KB of lean instructions — plus another 9KB of fresh, current data read each session. Half the total size, but all of it relevant. None of it stale.

Think of it as the difference between carrying your entire filing cabinet to every meeting, versus walking in with one page of notes and knowing exactly where to find the rest.

The insight

The "never forget" promise that AI memory systems sell you — it breaks. Not because the system can't store things. It breaks because loading everything into context makes the agent worse at using any of it.

The answer isn't bigger context windows or better embeddings. It's discipline about what loads at boot:

Boot lean — only what the agent needs to operate, nothing more
Anchor to today — one small file that says "here's where we are right now"
Search for history on demand — the past is always available, just not always present

three principles

1. Boot lean

📦

9.6KB
5 pages

2. Anchor to today

📅

inbox.md
fresh every session

3. Search on demand

🔍

vector search
weeks of history

Nobody wants to burn tokens on sloppy execution driven by stale data. The moment a session starts and my agent knows exactly what matters right now — with the ability to reach back into the past when it needs to — that peace of mind is irreplaceable.

This wasn't a memory problem. It was a problem of discipline at boot time.

And the fix wasn't remembering more. It was organizing the flow of what loads at startup.

I hope my experience with my AI agent's memory system can be useful to others working through the same challenge. I don't think one system fits everyone — and that's exactly why it matters to use agentic engineering to arrive at your own solution. If anything in this post is unclear, feel free to reach out anytime.

Built with dera.ai

OpenClaw AIエージェントの記憶問題と個人的な解決方法。なぜエージェントに教えれば教えるほど「鈍臭く」なるのか。

2026年2月24日

AIにおける役に立つ記憶システムとは、覚えさせることではなく、必要な情報を渡すこと。

わたしはAIエージェントを「仕事の中心」として使っています。企業向けにAIシステムを構築するビジネスをやっていて、自分のエージェントはいつも最初の実験台です。プロジェクト進行・タスク管理・これまでの文脈の知識化。いわば24時間稼働の従業員のような存在です。

でも正直ここまで何週間も試行錯誤してきました。

エージェントと仕事を始めて2週間くらい経った頃、時折エージェントが「物忘れしている」と感じる瞬間が出てきました。昨日話したアイデアの話を忘れている。一緒に決めたことが無かったことになっている。とにかく記憶させようとメモリファイルにどんどん情報を詰め込み、あらゆる詳細を書き足していきました。

結果、エージェントは鈍臭くなっていきました。

物忘れしていたわけではなく、情報の渦の中で溺れていただけでした。

全体図の欠如

起動ファイル注入リスト

✅ AGENTS.md — 自動読み込み

✅ SOUL.md — 自動読み込み

✅ TOOLS.md — 自動読み込み

✅ IDENTITY.md — 自動読み込み

✅ USER.md — 自動読み込み

✅ HEARTBEAT.md — 自動読み込み

✅ BOOTSTRAP.md — 自動読み込み

✅ MEMORY.md — 自動読み込み

❌ ACTIVE_PROJECTS.md — 読み込まれていなかった

以前、ACTIVE_PROJECTS.mdというファイルを作り、進行中のプロジェクト名、ステータス、次のアクション——すべて一覧にしてまとめてました。当然、指定通り動けば各チャットの開始時に他の起動ファイルと一緒に読み込むことで「エージェントが全体を認識する」と思っていました。

少し説明すると、わたしのOpenClawシステムはセッション毎に自動でcontext windowに注入される8つのファイルがあります。AGENTS.md（メインの指示書）、SOUL.md（性格設定）、MEMORY.mdなど。これらは起動する度に毎回読み込まれます。

ただ調べてみたところ、ACTIVE_PROJECTS.mdは、その8つに入っていませんでした。

毎回読み込んでいると思っていたのにワークスペースに置いてあるだけで実際は読んでなかった…。エージェントはわたしが何に取り組んでいるか、こちらから尋ねるまでよく理解できていませんでした。まるで毎回ゼロから会話しているようでした。

問題は全体図がなかったわけではなく、それが渡されていない状態でした。

今日という日付の感覚

二つ目の問題は気づくまでに時間がかかりました。

当時のAGENTS.md（メインの指示書）には起動時の手順（boot sequence）が以下のように書いてありました。

起動シーケンスの実行状況

ステップ1: SOUL.md ─── [もう持ってる] → スキップ

ステップ2: USER.md ─── [もう持ってる] → スキップ

ステップ3: HEARTBEAT.md ─── [もう持ってる] → スキップ

ステップ4: 今日の日報 ─── ⚠️ 一緒にスキップ

ステップ5: 記憶検索 ─── ⚠️ 一緒にスキップ

本当に大事なステップまで飛ばされた

ところが、このステップ1〜3で読み込むファイルは、システムが起動時に毎回読み込む起動ファイルに含まれており重複していました。目の前にある書類をわざわざもう一度取りに行く人はいないのと一緒でエージェントは「もう持っている」と判断し、指示書の内容を丸ごと無視していました。

問題は重複ファイルを「取り込み済」と認識し、指示書の内容を無視したことで本当に大事だったステップ4（今日の日報）とステップ5（記憶検索）も一緒に無視され続けていたことです。

結果「今日」という日付感覚がない。古い情報を元に判断するエージェントになってしまいました。表面上はちゃんと動いている。受け答えも自然。でもずいぶん古い文脈をもとに話している。

これになかなか気づきませんでした。微妙に古い情報で動いているエージェントはエラーを出すわけではなく、ただ微妙にズレた出力を続けるだけです。

記憶の拡大コスト

コンテキストサイズ — before vs after

Before (46KB)

MEMORY.md — 25KB

AGENTS.md — 13KB

HEARTBEAT.md — 7KB

ACTIVE_PROJECTS.md — 3KB

SOUL.md — 1.7KB

TOOLS.md — 1.3KB

USER.md — 0.5KB

IDENTITY.md — 0.3KB

After (9.6KB)

AGENTS.md — 2.9KB

HEARTBEAT.md — 2KB

SOUL.md — 1.6KB

TOOLS.md — 1.2KB

MEMORY.md — 0.8KB (index only)

USER.md — 0.4KB

IDENTITY.md — 0.2KB

Before: 46KB | 約23ページ | 約12,000トークン

After: 9.6KB | 約5ページ | 約2,500トークン

+ inbox.md 5.5KB | 今日の日報 3.3KB（都度読み込み）

そして毎回の修正のせいもあり、起動ファイル自体が肥大化していきました。

MEMORY.mdは約25KB。わたしの特徴、進行中プロジェクト、実行フロー——エージェントが学んだことをすべてちゃんぽんして一つに詰め込んだファイルです。当時AGENTS.mdは約13KB。長々とした指示がたくさん書かれていました。起動ファイル全体は8ファイル合計で約46KB——これは約23ページ分のテキストです。毎回の会話が始まる前に、エージェントは短編小説を一冊読んでいたようなものでした。

その内容も実際のところは重複だらけでした。AGENTS.mdとMEMORY.mdの間で同じことの繰り返し。MEMORY.mdに余計な情報が入っている。新しいセッション毎にわたしが最初のメッセージを読む前に古い重複した情報に惑わされることにトークンを消費してたのです。

ピークを超えるとコンテキストを増やしてもエージェントは一向に賢くなりません。大事な指示を他の余計な情報の中で見落としがちになり、ただ混乱するだけでした。

つくったもの

三層メモリアーキテクチャ

起動時 (9.6KB 自動注入)

│ 都度読み込み ↓

毎セッション (都度読み込み)

inbox.md 5.5KB

クライアント案件A [進行中]

請求フォローアップ [待機中]

機能リクエスト [要対応]

今日の日報 3.3KB

│ 検索で取得 ↓

過去の記録 (必要なときだけ検索)

🔍 vector search
日報ファイル群
memory/LONG_TERM.md
… 数週間分の履歴

解決策はすべてを削ぎ落として「起動」の意味を根本から考え直すことでした。

AGENTS.mdは13KBから2.9KBに——7ページから1.5ページへ。本質的な動作ルールだけを残し他はすべて別の場所へ。MEMORY.mdの大部分はLONG_TERM.mdという新しいファイルに移しました。アクセスはできるが毎回読み込むのではなく必要なときにvector searchで引き出す方式です。

そしてACTIVE_PROJECTS.mdの代わりにつくったのが inbox.md です。

inbox.mdは二つの問題を同時に解決します。まず全体図であり、すべての稼働中プロジェクトやタスクを圧縮したリスト。各項目は一行読めば——何をしているか、どの状態か、誰が担当か。重い説明抜きに今何が起きているかがわかります。

そして日付感覚です。新しいセッション毎にまず最新の状態を読み込ませる。だからエージェントは常に「今日なにが起きているか」を認識している。

起動ファイル自体を最小限に保つのは設計思想です。完了した項目はアーカイブする。更新はセッション終了時にまとめてではなく会話中にリアルタイムで行う。詳しい履歴は日報に残し、起動時には絶対に読み込まない。先週や先月のことが必要になったらvector searchで都度引き出す。

起動時の総量は46KBの古い情報から約10KBの最小限の指示へ——さらに毎セッション約9KBの最新データを読み込みます。合計サイズは半分以下、しかもすべてが今日に関連する情報です。古い情報はゼロ。

会議のたびにすべての書類が入った膨大なファイルを持ち込むか、残りのファイルの居場所が書かれているメモ1枚だけ持っているか。そのくらいの差です。

気づいたこと

ずっと使っているとわかるのが、AIエージェントの記憶システムで謳われる「忘れない記憶」——は、それだけだと破綻すること。記録を保存できないからではなく、すべてを一気にcontext windowに読み込ませるとエージェントが情報を使いこなせなくなるからです。

わたしの中での正解はより大きいcontext windowでも、良質な記録でもなく、起動時に何を読み込むかの規律でした。

起動ファイルは軽く ——エージェントの動作に必要なものだけ。それ以上は入れない
日付感覚を持たせる ——「いま、ここ」を示す小さなファイルを一つ
過去の詳細は都度検索 ——履歴はいつでも引き出せる。

3つの原則

1. 起動は軽く

📦

9.6KB
必要なものだけ

2. 日付感覚を持たせる

📅

inbox.md
毎セッション最新

3. 過去は都度検索

🔍

vector search
数週間分の履歴

これは記憶の問題ではありませんでした。起動時の規律の問題でした。そして解決策は「もっと覚えさせる」ことではなく、「起動時に何を読み込むか」を整理することでした。

古い情報を元にずぼらな実行をさせるためにトークンを燃焼するのは誰も良いと思わないと思います。セッションが始まった瞬間に「いま何が大事か」をエージェントが正確に把握していて、必要なときに過去のデータにも手を伸ばせる——その安心感のためのコストは代え難いものがあると感じています。

わたしのAIエージェントの記憶システムの課題と解決方法が、同じ課題に取り組む人たちの参考になれば幸いです。ひとつのシステムが全員にフィットすることはないと思うのですが、それだからこそagentic engineeringを使ってそれぞれの解決方法に辿り着くことが重要だと思っています。この文章の中でわからないことがあればいつでもご質問ください。

記事を投稿したときにお知らせします。

AIを使って開発中のおもしろいプロジェクトがあれば教えてください！また開発パートナーでお困りでしたら、ぜひご連絡ください。

Built with dera.ai