Tài liệu MemoryAI

Cài một lần. Bộ não của anh em chạy mãi mãi. Hướng dẫn đầy đủ cho mọi IDE, mọi ngôn ngữ.

Cài extension cho gọn — khỏi paste config thủ công

Kiro · Cursor · Windsurf · VS Code. Paste API key vào, xong. Extension tự ghi MCP server, hooks và rules cho anh em.

VS Marketplace Open VSX

Hoặc CLI: code --install-extension memoryai.memoryai-vsx · rồi chạy MemoryAI: Connect trong command palette.

— hoặc đọc hướng dẫn cài thủ công bên dưới (Claude Code / Antigravity / bot integrations) —

Anh em mới bắt đầu? Đọc 3 phần này thôi:

1. Lấy API key · 2. Cài cho IDE của anh em · 3. Dán rule auto-bootstrap. Xong. Mọi thứ tự động sau đó.

Cài 3 phút

Lấy API key miễn phí

Vào memoryai.dev/demo bấm "Tạo brain miễn phí". Hoặc gọi API trực tiếp:

curl -X POST https://memoryai.dev/v1/admin/provision \
  -H "Content-Type: application/json" \
  -d '{"name": "my-agent", "tos_accepted": true}'

Copy api_key trong response. Bắt đầu là plan Personal — 14 ngày dùng thử Pro miễn phí, không cần thẻ.

Cài MCP server

npx memoryai-mcp@latest

Hoặc cài global một lần: npm install -g memoryai-mcp. Yêu cầu Node.js 18+.

Dán config vào IDE

Xem mục Cài theo IDE bên dưới — chọn IDE của anh em, copy file config, thay YOUR_KEY bằng key vừa lấy.

Dán rule auto-bootstrap

Một file rule duy nhất để AI tự gọi memory mỗi session. Xem mục Auto-bootstrap rule.

Xong rồi đó.

Mở IDE lên, AI tự load brain, tự nhớ, tự bảo vệ context. Anh em không cần làm thêm gì.

Lấy API key

Có 3 cách:

Cách 1: Demo page (dễ nhất)

Vào /demo.html, bấm "Tạo brain miễn phí". Key tự lưu vào localStorage của browser. Dùng được luôn cho demo.

Cách 2: API call

curl -X POST https://memoryai.dev/v1/admin/provision \
  -H "Content-Type: application/json" \
  -d '{
    "name": "my-agent",
    "tos_accepted": true,
    "email": "[email protected]"
  }'

Response:

{
  "tenant_id": "...",
  "api_key": "hm_sk_...",
  "plan": "personal",
  "trial_pro_until": "2026-06-12T00:00:00Z"
}

Cách 3: Đăng ký + thanh toán

Xem bảng giá. Hỗ trợ VNPay, MoMo, chuyển khoản, hoá đơn VAT đầy đủ.

Bảo mật key

Key này quyền truy cập toàn bộ brain. Đừng commit lên git. Đừng share public. Nếu lộ — vào /v1/admin/rotate-key để đổi.

Cài theo IDE

Cấu trúc config giống nhau cho mọi IDE — chỉ khác đường dẫn file. Chọn IDE của anh em:

Claude Code (CLI) — Khuyến nghị

1 lệnh. Memory tự động hoàn toàn.

Package memoryai-claude cài 3 lifecycle hooks — memory tự recall trước mỗi prompt và tự save sau mỗi turn. Không cần agent quyết định gì.

npx memoryai-claude@latest install

Restart Claude Code 1 lần. Xong. CLI sẽ tự:

Bước	Làm gì
1	Hỏi API key — bỏ trống = tự tạo key free
2	Ghi 3 hooks vào `~/.claude/settings.json`: • SessionStart → load context đầu phiên • UserPromptSubmit → recall trước mỗi prompt • Stop → save sau mỗi turn
3	Thêm note vào `~/.claude/CLAUDE.md`
4	Backup file gốc trước khi sửa

Cập nhật lên bản mới

Chạy lại đúng lệnh install — nhớ kèm @latest để không dính bản npx đã cache cũ. Hook cũ sẽ được ghi đè (và backup), data trên server giữ nguyên.

npx memoryai-claude@latest install

Lệnh phụ:

npx memoryai-claude@latest doctor     # check hooks + ping server
npx memoryai-claude@latest status     # xem scope (user/project)
npx memoryai-claude@latest uninstall  # gỡ hooks, không xóa data

Cờ: --project (project scope) · --endpoint URL (self-host) · --key KEY (skip prompt) · MEMORYAI_NONINTERACTIVE=1 (CI)

Tùy chỉnh ngưỡng compact (tokens)

Mặc định MemoryAI bắt đầu lưu (compact) ở 160K tokens và ép compact ở trần cứng 192K (1.2× điểm compact). Áp dụng cho mọi model — Claude 200K, Gemini 1M, GPT-5 — để chi phí dự đoán được.

Bạn chỉ chọn 1 con số duy nhất: điểm compact. Trần cứng tự suy ra ở 1.2× nên không phải chỉnh gì thêm. Cài đặt lưu trên server theo API key, nên chỉ cần set 1 lần — áp dụng cho mọi IDE/host bạn dùng (Claude Code, Kiro, Cursor, Windsurf...).

npx memoryai-claude@latest config                       # xem ngưỡng hiện tại
npx memoryai-claude@latest config compact 200000        # compact 200K → trần 240K (1.2x)
npx memoryai-claude@latest config compact reset         # gỡ override → về mặc định
npx memoryai-claude@latest config reset                 # gỡ tất cả → về defaults

Lưu ý: đặt compact sẽ tự đặt trần cứng = 1.2× trong cùng bước, nên compact luôn fire trước khi chạm trần. Trên model context bé (32K, 16K) hệ thống tự cap để compact vẫn fire trước khi context bị cắt.

Cách khác: chỉ MCP config (ít tự động hơn)

File: ~/.claude/settings.json

{
  "mcpServers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      }
    }
  }
}

MCP-only = agent tự quyết khi nào gọi tools. CLI hooks = tự động hoàn toàn.

Restart Claude Code. Gõ /mcp để check — phải thấy memoryai connected.

Cursor

File: ~/.cursor/mcp.json

{
  "mcpServers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      }
    }
  }
}

Mở Settings → MCP, bật toggle memoryai. Restart Cursor.

Kiro

File: .kiro/settings/mcp.json (workspace) hoặc ~/.kiro/settings/mcp.json (global)

{
  "mcpServers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      },
      "disabled": false,
      "autoApprove": [
        "memory_bootstrap", "memory_recall", "memory_store",
        "context_guard_check", "context_guard_compact",
        "memory_pitfall_check", "l2_inject"
      ]
    }
  }
}

Workspace config sẽ override user config. autoApprove giúp Kiro không hỏi confirm mỗi lần gọi tool.

Windsurf

File: ~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      }
    }
  }
}

VS Code (Copilot / Cline)

File: .vscode/mcp.json (workspace)

{
  "servers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      }
    }
  }
}

Note: VS Code dùng key servers, không phải mcpServers.

Antigravity

File: ~/.antigravity/mcp.json

{
  "mcpServers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      }
    }
  }
}

Claude Desktop

File:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "memoryai": {
      "command": "npx",
      "args": ["-y", "memoryai-mcp"],
      "env": {
        "HM_ENDPOINT": "https://memoryai.dev",
        "HM_API_KEY": "YOUR_KEY"
      }
    }
  }
}

Restart Claude Desktop hoàn toàn (Quit, không phải close window).

Auto-bootstrap rule

Sau khi cài MCP, anh em cần một file rule để AI tự gọi memory mỗi lần mở session. Mỗi IDE có kiểu rule khác nhau, nhưng nội dung giống hệt:

IDE	Đường dẫn file
Claude Code	`~/.claude/CLAUDE.md`
Cursor	`.cursor/rules/memoryai.mdc`
Kiro	`.kiro/steering/memoryai.md`
Windsurf	`.windsurfrules` (workspace root)
VS Code (Copilot)	`.github/copilot-instructions.md`
IDE khác	File system prompt / rules của IDE đó

Cách 1: Một dòng curl (nhanh nhất)

# Claude Code
curl -s https://memoryai.dev/install/SKILL.md > ~/.claude/CLAUDE.md

# Cursor (Linux/macOS)
mkdir -p .cursor/rules && curl -s https://memoryai.dev/install/SKILL.md > .cursor/rules/memoryai.mdc

# Kiro
mkdir -p .kiro/steering && curl -s https://memoryai.dev/install/SKILL.md > .kiro/steering/memoryai.md

Cách 2: Copy-paste manual

Copy nội dung dưới đây, dán vào file rule tương ứng:

# MemoryAI — Auto Memory + Context Guard

Bạn có MemoryAI — bộ não persistent nhớ qua mọi session.
Mọi thứ dưới đây là TỰ ĐỘNG. Tuân theo mỗi session.

## 1. Session start (LUÔN LUÔN làm đầu tiên)

Gọi `memory_bootstrap` ngay khi user gửi tin nhắn đầu tiên. Trả về:
- User identity + preferences (DNA-protected)
- Recent activity 24h
- Active goals
- Mood + persona

KHÔNG hỏi user lặp lại context cũ.

## 2. Trong session — auto-store (không cần hỏi)

| Khi nào | Tool | memory_type |
|---|---|---|
| User nói preference ("tao thích...") | memory_store | preference |
| Quyết định được chốt | memory_store | decision |
| User chia sẻ identity (role, công ty) | memory_store | identity |
| Fact quan trọng học được | memory_store | fact |
| Workflow chạy ổn định | memory_store | procedure |
| Lỗi cần tránh lần sau | memory_store | pitfall |

KHÔNG store: "ok", "thanks", debug output, thứ đã có trong code/git.

## 3. Recall TRƯỚC KHI làm

- Trước đề xuất kiến trúc → memory_recall trước
- User nói "you did this before" → memory_recall ngay, đừng đoán
- Trước action rủi ro (deploy, rm, push, migrate) → memory_pitfall_check

## 4. Context Guard (mỗi 15 messages)

Gọi context_guard_check với:
{"estimated_tokens": current_count, "max_tokens": model_window}

Response:
- safe → tiếp tục
- compact_soon → chuẩn bị summarize
- compact_now → gọi context_guard_compact NGAY với summary đầy đủ

## 5. Smart Context Inject — cho deep context

Khi cần composed context block trong budget:
l2_inject {query, context_budget_tokens: 2000}
→ Trả về DNA + decisions + facts + recent đã merge sẵn. Cache 10 phút.

## 6. Important

- Nếu MemoryAI down → tiếp tục bình thường (non-fatal)
- preferences/decisions/identity là DNA — không bao giờ expire
- Facts thường fade theo thời gian nếu không dùng

## Quick MCP tools

| Tool | Khi nào |
|---|---|
| memory_bootstrap | Session start |
| memory_recall | Cần context cũ |
| memory_store | Save preference/decision/identity/fact/pitfall |
| memory_pitfall_check | Trước action rủi ro |
| context_guard_check | Mỗi 15 messages |
| context_guard_compact | Context full |
| l2_inject | Cần composed context block |
| memory_changelog | Catch up sau khi away

Done. 2 bước thôi.

(1) Dán MCP config → MemoryAI connected. (2) Dán rule trên → AI biết cách dùng. Mở IDE — mọi thứ tự động.

Python SDK

pip install hmc-memory

Quick example

from memoryai import AsyncMemoryAI

async with AsyncMemoryAI(
    api_key="hm_sk_...",
    base_url="https://memoryai.dev"
) as mem:
    # Session start — load brain context
    ctx = await mem.bootstrap(task="building chatbot")

    # Store memory
    await mem.store(
        "User prefers Vietnamese for casual, English for code",
        memory_type="preference",
        zone="important"
    )

    # Recall (semantic + graph + FTS fusion)
    results = await mem.recall(
        "what language does user prefer?",
        depth="deep",
        limit=5
    )

    # Context guard
    guard = await mem.ide_guard_check(
        estimated_tokens=85000,
        max_tokens=200000
    )
    if guard.get("should_compact"):
        await mem.ide_guard_compact(
            content="conversation summary...",
            task_context="chat session"
        )

    # Smart context inject — composed context block in budget
    l2 = await mem.l2_inject(
        query="user preferences for stack",
        context_budget_tokens=2000
    )
    # l2["composed_context"] → paste into your prompt

Sync version

from memoryai import MemoryAI

mem = MemoryAI(api_key="hm_sk_...", base_url="https://memoryai.dev")
ctx = mem.bootstrap(task="my task")
results = mem.recall("query")
mem.close()

Node.js SDK

npm install @cortex-memory/memoryai

import { MemoryAI } from '@cortex-memory/memoryai';

const mem = new MemoryAI({
  apiKey: 'hm_sk_...',
  baseUrl: 'https://memoryai.dev'
});

// Session start
const ctx = await mem.bootstrap({ task: 'my project' });

// Store
await mem.store('User prefers dark mode', {
  memoryType: 'preference',
  zone: 'important'
});

// Recall
const results = await mem.recall('user preferences', {
  depth: 'deep',
  limit: 5
});

// Context Guard
const guard = await mem.ideGuardCheck({
  estimatedTokens: 85000,
  maxTokens: 200000
});

if (guard.should_compact) {
  await mem.ideGuardCompact({
    content: summarizeConversation(messages),
    taskContext: 'current task'
  });
}

// Smart Context Inject
const l2 = await mem.l2Inject({
  query: 'user stack preferences',
  contextBudgetTokens: 2000
});
console.log(l2.composed_context);

OpenAI Proxy

Dùng MemoryAI như drop-in replacement cho OpenAI API. Tự động inject memory vào mỗi request.

from openai import OpenAI

# Chỉ cần đổi base_url + thêm header
client = OpenAI(
    api_key="hm_sk_...",
    base_url="https://memoryai.dev/v1/openai-proxy",
    default_headers={"X-Memory-Inject": "true"}
)

response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "what was my last decision?"}]
)
# Response sẽ có context được inject từ brain

Proxy chạy như nào?

Khi anh em gọi qua proxy, MemoryAI tự động: (1) recall context relevant cho query, (2) inject vào system prompt, (3) forward đến OpenAI/Anthropic/provider, (4) extract facts từ response, (5) store về brain. Round-trip thêm khoảng 100-200ms cho recall.

REST API

Mọi tính năng đều có REST endpoint. Dùng curl, Postman, hoặc bất kỳ HTTP client nào.

Base URL + Auth

Base: https://memoryai.dev
Auth: Authorization: Bearer hm_sk_...

Store memory

curl -X POST https://memoryai.dev/v1/store \
  -H "Authorization: Bearer hm_sk_..." \
  -H "Content-Type: application/json" \
  -d '{
    "content": "User thích cà phê đen không đường",
    "memory_type": "preference",
    "zone": "important",
    "tags": ["coffee", "preference"]
  }'

Recall

curl -X POST https://memoryai.dev/v1/recall \
  -H "Authorization: Bearer hm_sk_..." \
  -H "Content-Type: application/json" \
  -d '{
    "query": "user prefer drink",
    "depth": "deep",
    "limit": 5,
    "language": "auto"
  }'

language: "auto" — server tự detect tiếng Việt vs English (96% chính xác cho VN, 100% cho EN). Tiếng Việt được match qua Vietnamese FTS dictionary tự build.

Smart Context Inject

curl -X POST https://memoryai.dev/v1/l2/inject \
  -H "Authorization: Bearer hm_sk_..." \
  -H "Content-Type: application/json" \
  -d '{
    "query": "what does user need for current task",
    "context_budget_tokens": 2000
  }'

Response có composed_context dán thẳng vào prompt.

Brain Export

curl -X POST https://memoryai.dev/v1/brain/export \
  -H "Authorization: Bearer hm_sk_..." \
  -H "Content-Type: application/json" \
  -d '{"scope": "full"}' \
  -o brain-bundle.json

JSON bundle theo Bundle Format v1 (CC BY 4.0). Anh em download, sau này import vào server self-host (lite-build) hoặc bất kỳ implementation nào tuân Protocol v1.

Public stats (no auth)

curl https://memoryai.dev/v1/internal-stats
curl https://memoryai.dev/v1/spec/info
curl https://memoryai.dev/v1/benchmark/pricing

Endpoints public, không cần auth. Dùng để monitor hệ thống hoặc embed live numbers vào landing page.

Đầy đủ API: /redoc · OpenAPI spec: /openapi.json

Context Guard

Context Guard tự bảo vệ session khỏi context overflow. Khi window đầy, AI tự compact conversation về brain — không mất data, không cần anh em can thiệp.

Ngưỡng mặc định

Trạng thái	% context	Hành động
SAFE	< 30%	Tiếp tục bình thường
COMPACT_SOON	30-50%	AI chuẩn bị summary
COMPACT_NOW	> 50%	AI gọi compact ngay, save về brain

Preset gợi ý

Preset	compact_pct	critical_pct	Phù hợp khi
Conservative (mặc định khi có MemoryAI)	0.20	0.30	Brain đã save tất cả ngoài → compact sớm an toàn
Balanced	0.30	0.50	Dùng chung, không có external memory
Aggressive	0.50	0.70	Session dài, tin AI compact nhanh

Khuyến nghị: dùng 20/30 với MemoryAI active

Memories đã được save bên ngoài rồi — không cần giữ mọi thứ trong context. Compact sớm, compact thường xuyên. Không mất gì cả.

API: check + compact

POST /v1/ide/guard/check
Authorization: Bearer YOUR_KEY

{
  "estimated_tokens": 100000,
  "max_tokens": 200000,
  "compact_pct": 0.20,
  "critical_pct": 0.30
}

Response:

{
  "recommendation": "compact_now",
  "urgency": "high",
  "should_compact": true,
  "usage_percent": 50.0,
  "dna_memories": 12,
  "bootstrap_ready": true
}

Compact

POST /v1/ide/guard/compact

{
  "content": "Summary đầy đủ: thảo luận pricing Kiro, chốt 2K plan giá $20/acc, margin 92.7%, Gemini 1.5 cho L1, Claude cho L2...",
  "task_context": "pricing decision"
}

Buffer Fallback

Nếu bot gửi summary ngắn hoặc useless ("context guard triggered"), server tự dùng buffer nội bộ (đã accumulate từ recall queries trước đó). Anh em không cần lo phải gửi summary perfect — nhưng summary tốt thì memory sinh ra cũng tốt hơn.

Tuỳ chỉnh per-tenant

v2.3.2+ dùng token tuyệt đối, không phải %. Anh em pick ngưỡng theo cost ceiling muốn giam:

# Set absolute thresholds via per-request override
curl -X POST https://memoryai.dev/v1/ide/guard/turn-check \
  -H "Authorization: Bearer hm_sk_..." \
  -H "Content-Type: application/json" \
  -d '{"turn_count": 30, "compact_at_tokens": 100000, "critical_at_tokens": 150000}'

Hoặc set qua MCP env:

"env": {
  "HM_COMPACT_AT": "100000",
  "HM_CRITICAL_AT": "150000"
}

Pick ngưỡng theo cost ceiling muốn giam — nó áp dụng đồng đều cho mọi model, không phụ thuộc model window. Tăng số lên cho session dài hơn (vd 400K/600K trên model 1M).

Priority: per-request override > env vars > server defaults.

Smart Context Inject

Khi anh em cần composed context block trong budget — DNA + decisions + facts + recent đã merge sẵn — gọi endpoint này. Server tự chọn relevant nhất, fit budget tokens, paste thẳng vào prompt của top model.

API

POST /v1/l2/inject

{
  "query": "what should I do for current task",
  "context_budget_tokens": 2000,
  "include_dna": true,
  "include_recent": true
}

Response:

{
  "composed_context": "## DNA\n- ...\n## Recent Decisions\n- ...\n## Insights\n- ...",
  "tokens_used": 1847,
  "components": ["dna:5", "decisions:3", "facts:8", "recent:4"],
  "cache_hit": false,
  "latency_ms": 142
}

Cách dùng

Paste composed_context thẳng vào system prompt hoặc đầu user message. AI nhận được context đã được pre-compose, không phải raw dump.

Cache 10 phút

Query lặp lại trong 10 phút = trả về từ cache (latency ~10ms). Phù hợp pattern: agent gọi nhiều lần trong cùng task.

Brain Export — Vendor Sovereignty

DNA #3: Brain thuộc về user, không phải vendor. Anh em export brain ra JSON bundle bất kỳ lúc nào, import vào server self-host hoặc implementation khác tuân Protocol v1.

Export

curl -X POST https://memoryai.dev/v1/brain/export \
  -H "Authorization: Bearer hm_sk_..." \
  -d '{"scope": "full"}' \
  -o brain-bundle.json

Scope options:

full — toàn bộ chunks + edges + entities + sessions + mood (mặc định)
dna_only — chỉ preferences/decisions/identity (lightweight identity portable)
since — incremental, cần thêm "since": "2026-05-01T00:00:00Z"

Bundle Format v1

{
  "format": "memoryai-bundle",
  "version": "1.0",
  "manifest": {
    "exported_at": "2026-05-29T...",
    "tenant_hash": "...",
    "checksum_sha256": "..."
  },
  "chunks": [...],
  "edges": [...],
  "entities": [...],
  "sessions": [...],
  "embedding_model": "text-embedding-3-small"
}

Import vào instance khác

curl -X POST https://your-instance.com/v1/brain/import \
  -H "Authorization: Bearer NEW_KEY" \
  -H "Content-Type: application/json" \
  --data-binary @brain-bundle.json

Idempotent — chunks dedup theo content_hash. Embeddings reused nếu model match, drop nếu khác (re-embed lazily).

Khi nào export?

(1) Trước migrate sang lite-build self-host. (2) Trước trial expire (giữ brain). (3) Backup định kỳ. (4) Audit / compliance (GDPR DSR, VN PDPA Decree 13/2023).

31 MCP Tools

Đầy đủ tools available qua MCP — gọi từ Claude Code, Cursor, Kiro, Windsurf, hoặc bất kỳ MCP client nào.

Core (mỗi session đều dùng)

Tool	Khi dùng
`memory_bootstrap`	Session start — load DNA + recent + persona
`memory_store`	Save preference / decision / identity / fact / pitfall / procedure
`memory_recall`	Search semantic + graph + FTS fusion
`memory_pitfall_check`	Trước action rủi ro (deploy, rm, migrate)
`memory_changelog`	Catch up sau khi away — see what brain learned overnight

Context Guard

Tool	Khi dùng
`context_guard_check`	Mỗi 15 messages — check pressure
`context_guard_compact`	Khi ≥ 50% — compact session về brain
`context_guard_bootstrap`	Full bootstrap với task context

Smart Context

Tool	Khi dùng
`l2_inject`	Lấy composed context block trong budget
`l2_status`	Check cache + stats
`reasoning_store`	Deep reasoning memory (Pro+)
`reasoning_recall`	Recall reasoned insights (Pro+)

Brain ownership (DNA #3)

Tool	Khi dùng
`brain_export`	Export portable JSON bundle
`brain_import`	Restore bundle (idempotent)
`spec_info`	Protocol v1 contract reference

Multi-agent (DNA #1)

Tool	Khi dùng
`inherit_brain`	Một brain → nhiều agents
`fed_query`	Federated query qua nhiều brains
`trust_agents`	Reputation leaderboard (team+)
`trust_chunk`	Per-chunk trust info
`twin_respond`	Cognitive Twin respond (promax+)
`twin_status`	Twin readiness check

Insights + benchmark

Tool	Khi dùng
`memory_explore`	Explore neural graph từ chunk_id
`memory_clusters`	View topic clusters
`memory_recover`	Recover session sau break
`memory_predict`	Pre-cache likely next memories
`memory_feedback`	Rate recall → brain learns
`memory_cognitive_profile`	Self-model: persona + mood + goals
`benchmark_recall_vs_full`	Smart recall vs full-context dump
`benchmark_pricing`	Public pricing reference
`memory_plan_save`	Save work state cho session sau
`memory_plan_resume`	Restore work state
`memory_goal_track`	Track goals qua sessions

Cách hoạt động

Anh em không cần đọc phần này để dùng. Đọc nếu muốn hiểu sao memory persist được, sao recall ra đúng thứ cần.

Cách brain tổ chức memory

Recent memory
       Conversations, raw chunks, fresh decisions
       Searched via hybrid recall

Distilled memory
       Patterns and summaries auto-built in the background
       Returns near-instant on repeat queries

Long-term archive
       Compressed sessions, DNA preserved forever
       Bootstrap pulls "who is the user" on demand

Flow một session bình thường

Open IDE
  → MCP auto-connects
  → AI reads rule → calls memory_bootstrap
  → Loads: DNA + recent activity + preferences + entities + persona
  → AI biết bạn là ai và đang làm gì

Trong session:
  → AI auto-stores quyết định + preferences quan trọng
  → Memories tagged + embedded + linked vào graph
  → Smart fusion 4 nguồn cho recall

Context đầy:
  → AI gọi context_guard_check → "compact_now"
  → AI gọi context_guard_compact → save session về brain
  → Mọi thứ preserved

Background (24/7, không cần user):
  → Sleep consolidation (LLM phân tích insights overnight)
  → association strengthening (memories co-recalled gắn chặt hơn)
  → Decay (memories không dùng fade dần)
  → Community detection (tìm topic clusters)
  → DNA-protected memories KHÔNG bao giờ expire

Session sau:
  → memory_bootstrap load lại tất cả
  → Bao gồm insights mới từ overnight consolidation
  → Cycle tiếp tục. Brain càng dùng càng smart hơn.

Recall fusion

Khi anh em gọi memory_recall, server merge nhiều nguồn thành một bảng xếp hạng:

Vector similarity (semantic — meaning match)
Full-text search (lexical — exact word match, có Vietnamese tokenizer)
Neural graph traversal (memories liên quan)
Recency boost + DNA priority boost

Có 4 depth modes:

Depth	Latency	Khi dùng
`instant`	~50ms	Vector only — quick lookup
`fast`	~100ms	FTS only — exact match
`deep`	~200ms	Full fusion (mặc định)
`exhaustive`	~400ms	Deep + thêm results

Brain hoạt động như thế nào

MemoryAI mô phỏng cơ chế bộ não thật:

Encoding — chuyển input thành memory chunk có embedding + tags
Consolidation — định kỳ trích insight từ raw chunks lúc rảnh
Natural fade — memories không dùng phai dần
DNA protection — preferences / decisions / identity / pitfalls / procedures không bao giờ expire
Association linking — memories co-recalled gắn chặt hơn qua edges
Spreading activation — recall lan toả qua graph
Mood tracking — adapt response style theo trạng thái user
Community detection — auto-cluster topics
Pattern extraction — tìm recurring themes
Self-model — persona + goals + entities + procedures

Open by design

Cách brain hành xử (không phải code internal) là public — anh em có thể đọc, viết clone. Cái không public: tham số fine-tune, prompt cụ thể, weight per-tenant. Đó là sản phẩm của hàng nghìn iteration trên brain thật.

Khắc phục lỗi

Vấn đề	Nguyên nhân	Cách fix
MCP tools không hiện	IDE chưa restart sau edit config	Restart IDE hoàn toàn (Quit, không chỉ close window)
Connection refused	Server unreachable hoặc URL sai	Check `HM_ENDPOINT=https://memoryai.dev`
401 Unauthorized	API key sai hoặc hết hạn	Check `HM_API_KEY`, gọi `/v1/admin/whoami` để verify
403 Forbidden	Plan không support tool đó	Check pricing — một số tool yêu cầu Pro+ / Team+
AI không auto-bootstrap	Thiếu rule file	Tạo file rule (xem section auto-rule)
npx not found	Node.js chưa cài	Cài Node 18+: nodejs.org
Memory không persist	Endpoint sai hoặc plan free hết quota	Verify endpoint, check `/v1/admin/usage`
Context Guard không trigger	Rule thiếu hoặc `estimated_tokens=0`	Add `context_guard_check` vào rule. Truyền `estimated_tokens > 0`
Recall scores all 1.0	Scores capped sau layer weighting (bình thường)	Dùng field `layer` để phân biệt DNA vs sessions vs archive
Tiếng Việt không match	Recall query bằng English, content tiếng Việt	Thêm `"language": "auto"` hoặc `"vi"`
Smart context inject quá chậm	Cache miss + budget lớn	Giảm `context_budget_tokens`, query đơn giản hơn
Brain Export thiếu data	Scope `since` với date sai	Dùng `"scope": "full"` để export tất cả

Debug tips

Check connection: curl https://memoryai.dev/v1/health
Check key: curl -H "Authorization: Bearer YOUR_KEY" https://memoryai.dev/v1/admin/whoami
Check usage: curl -H "Authorization: Bearer YOUR_KEY" https://memoryai.dev/v1/admin/usage
Check MCP version: npx memoryai-mcp --version
Force update MCP: npm install -g memoryai-mcp@latest

MemoryAI down?

Khi server không reachable, AI sẽ tiếp tục làm việc bình thường (graceful degradation). Bạn không bị block. Khi server up lại, memory tự sync. Để check status real-time: /v1/health.

MemoryAI Protocol v1

Protocol v1 là spec mở (CC BY 4.0) — anyone có thể implement. Reference implementation MIT-licensed.

3 conformance levels

Level	Yêu cầu
Producer	Có thể export bundle theo Bundle Format v1
Consumer	Có thể import bundle, dedup theo content_hash
Bidirectional	Cả Producer + Consumer

Public spec endpoints (no auth)

Endpoint	Trả về
`GET /v1/spec`	Markdown spec đầy đủ (CC BY 4.0)
`GET /v1/spec/info`	JSON contract: format, version, conformance levels, endpoints, memory types
`GET /v1/internal-stats`	Live system stats (memories, calls, uptime)
`GET /v1/benchmark/pricing`	Assumed $/1M-token pricing per LLM

Đầy đủ: memoryai.dev/spec/v1

Hỏi đáp

Brain của tôi thuộc về ai?

Của anh em. Brain Export trả về JSON bundle vendor-neutral — anh em download bất kỳ lúc nào, import vào server self-host (lite-build) hoặc bất kỳ implementation nào tuân Protocol v1. Đây là DNA #3, không thương lượng.

Có lock-in không?

Không. Brain Export = JSON spec mở (CC BY 4.0). MCP package open source. SDK mã nguồn mở. Server code self-hostable (Q3 2027 ra lite-build). Lock-in chỉ xảy ra khi vendor giữ format proprietary — MemoryAI không làm vậy.

Trial Pro 14 ngày tự renew không?

Không. Hết 14 ngày, tự về plan Personal (free). Anh em không bị charge bất ngờ. Nếu muốn nâng cấp, vào pricing-vn chọn plan, thanh toán qua VNPay/MoMo/chuyển khoản.

Có hỗ trợ tiếng Việt thật sự không?

Có. MemoryAI có Vietnamese FTS dictionary tự build (PostgreSQL vietnamese_simple dict). Test detection: 96% chính xác cho VN, 100% cho EN. Recall query tiếng Việt match content tiếng Việt qua semantic + lexical fusion.

Server đặt ở đâu? Data ở đâu?

Server primary: VN. Data tuân VN PDPA Decree 13/2023, full DSR support qua public API (export/forget/delete). Privacy-preserving audit log (sha256 hashing). SOC 2 Type II observation period bắt đầu 29/05/2026, certified Q1 2028.

Một brain dùng được nhiều agent không?

Có. DNA #1: One brain. ∞ agents. Forever. Plan Personal trở lên unlimited agents/tenant. Một API key chia cho 10-50 AI agents khác nhau (chat, code, doctor, lawyer, robot — tương lai). Tất cả share một brain.

Khi tôi chết, brain đi đâu?

Theo will của anh em. Plan Pro+ có inheritance feature — designate người kế thừa brain. Person đó nhận access qua brain_export bundle. Đây là DNA #1 ở scale lifetime.

Khác vector store / RAG ở đâu?

MemoryAI hành xử như cortex thật: DNA-protected memory không phai, memory không dùng tự fade, association tự strengthen, brain tự consolidation. Trong khi vector store chỉ là database. Tự test trên brain của bạn tại /demo.html.

Tôi muốn self-host?

Lite-build sẽ có Q3 2027 (sau khi go-world). Hiện tại Enterprise plan unlock self-host docker compose. Liên hệ [email protected].

API rate limit như nào?

Plan	Calls/phút	Calls/ngày
Personal	30	1.000
Pro	120	10.000
Team	300	50.000
Business	600	200.000
Enterprise	Custom	Custom

Vượt rate limit → 429 với Retry-After header. Không charge thêm — chỉ throttle.

Tôi gặp bug?

GitHub issues: github.com/CH270035/memoryai/issues
Email: [email protected]
Telegram (founder): @tonyadam01

Sẵn sàng dùng?

Brain miễn phí 30 giây. Không thẻ tín dụng. Không lock-in.

Tạo brain miễn phí Xem giá

Trang chủ · Giá · Demo · English · Protocol v1 · GitHub

Đã thông báo Bộ Công Thương