Madeinoz Knowledge System¶

Persistent personal knowledge management system powered by Graphiti knowledge graph - automatically extracts entities, relationships, and temporal context from conversations, documents, and ideas.

What It Does¶

The Madeinoz Knowledge System transforms your AI conversations into a permanent, searchable knowledge base:

Automatically Learns: Extracts entities and relationships as you work
Connects Concepts: Maps how ideas relate over time
Semantic Search: Finds relevant knowledge using natural language
Builds Context: Compounds knowledge across sessions
Never Forgets: Persistent storage with temporal tracking

Core principle: Work normally, knowledge handles itself.

System Architecture¶

The knowledge graph sits at the center, automatically organizing your conversations, documents, code, and notes into searchable entities, episodes, facts, and relationships. As memories age, they transition through lifecycle states (ACTIVE → DORMANT → ARCHIVED → EXPIRED) based on importance and stability scores, while Prometheus metrics and Grafana dashboards provide real-time observability.

Quick Start¶

New to the system? Follow this path:

Overview - What the system does and your first steps (5 min)
Installation Guide - Step-by-step setup instructions (15 min)
Basic Usage - How to capture and search knowledge (10 min)
Quick Reference - Commands at a glance

Total time to get started: 30 minutes

Documentation Sections¶

Getting Started¶

Start here if you're new:

Overview - What the system does and quick start
Quick Reference - Commands and natural language triggers

Installation¶

Set up the system:

Installation Guide - Complete setup instructions
Requirements - Prerequisites and dependencies
Verification - Confirm everything works

Usage¶

Learn how to use the system:

Basic Usage - Capturing and searching knowledge
Advanced Usage - Bulk import, backup, multiple graphs
Memory Decay & Lifecycle - Memory prioritization, decay scoring, and lifecycle management
Monitoring - Prometheus and Grafana dashboards

Concepts¶

Understand how it works:

Architecture - System design and components
Knowledge Graph - Episodes, entities, facts explained

Troubleshooting¶

Fix common issues:

Common Issues - Solutions to frequent problems

Reference¶

Detailed specifications:

CLI Reference - Command-line interface
Configuration - Environment variables and settings
Observability & Metrics - Prometheus metrics, monitoring, caching
Model Guide - Ollama and LLM configuration
Benchmarks - Model performance comparisons

Natural Language Commands¶

The system responds to natural conversation:

Say This	System Does
"Remember that..."	Captures knowledge with entity extraction
"What do I know about X?"	Searches knowledge base semantically
"How are X and Y related?"	Finds relationships between concepts
"What did I learn today?"	Temporal search - filter by date
"What did I learn recently?"	Shows recent knowledge additions
"Knowledge status"	Displays system health and statistics

Database Backends¶

Two graph database options:

Backend	Best For	Web UI
Neo4j (default)	CTI/OSINT data, rich queries	localhost:7474
FalkorDB	Simple setup, lower resources	localhost:3000

Key Features¶

Prompt Caching (Gemini via OpenRouter)¶

Reduce LLM costs by up to 15-20% (est) with prompt caching for Gemini models via OpenRouter:

# Enable in ~/.claude/.env (disabled by default)
MADEINOZ_KNOWLEDGE_PROMPT_CACHE_ENABLED=true
MADEINOZ_KNOWLEDGE_MODEL_NAME=google/gemini-2.0-flash-001
MADEINOZ_KNOWLEDGE_OPENAI_API_KEY=sk-or-v1-your-key
MADEINOZ_KNOWLEDGE_OPENAI_BASE_URL=https://openrouter.ai/api/v1

Note: Prompt caching is disabled by default and must be explicitly enabled. The system uses explicit cache_control markers (similar to Anthropic) - not implicit caching. OpenRouter manages the cache lifecycle automatically when enabled. See Observability & Metrics for details.

Observability & Metrics¶

Monitor API usage, costs, and performance via Prometheus:

# View metrics
curl http://localhost:9091/metrics | grep graphiti_

# Key metrics available:
# - graphiti_api_cost_total (USD spent)
# - graphiti_llm_request_duration_seconds (latency)
# - graphiti_cache_hit_rate (caching effectiveness)

See Observability & Metrics for full documentation.

Common Commands¶

Server Management¶

# Check status
bun run server-cli status

# Start server (production mode)
bun run server-cli start

# Stop server
bun run server-cli stop

# Restart server
bun run server-cli restart

# View logs
bun run server-cli logs

# View logs with options
bun run server-cli logs --mcp --tail 50

Development Mode¶

For development and testing server code changes:

# Start in development mode (uses dev ports and env files)
bun run server-cli start --dev

# Restart in development mode
bun run server-cli restart --dev

# Dev mode differences:
# - Neo4j Browser: http://localhost:7475 (instead of 7474)
# - MCP Server: http://localhost:8001/mcp/ (instead of 8000)
# - Uses /tmp/madeinoz-knowledge-*-dev.env files
# - Safe for development without affecting production

Memory Sync¶

# Manual sync (from installed location)
bun run ~/.claude/hooks/sync-memory-to-knowledge.ts

# Dry run (see what would sync)
bun run ~/.claude/hooks/sync-memory-to-knowledge.ts --dry-run

Need Help?¶

Check the Troubleshooting Guide
Review Key Concepts
Look for examples in the Usage Guide

Quick Reference Card¶

MCP Tools¶

Tool	Purpose	Example
`add_memory`	Store knowledge	`{"name": "Note", "episode_body": "...", "group_id": "main"}`
`search_memory_nodes`	Find entities	`{"query": "Python frameworks", "limit": 10}`
`search_memory_facts`	Find relationships	`{"query": "how X relates to Y"}`
`get_episodes`	Temporal retrieval	`{"group_id": "main", "last_n": 10}`
`get_status`	Health check	`{}`

Environment Variables (Essential)¶

# LLM (required)
MADEINOZ_KNOWLEDGE_LLM_PROVIDER=openai
MADEINOZ_KNOWLEDGE_MODEL_NAME=google/gemini-2.5-flash
MADEINOZ_KNOWLEDGE_OPENAI_API_KEY=sk-or-v1-...
MADEINOZ_KNOWLEDGE_OPENAI_BASE_URL=https://openrouter.ai/api/v1

# Embeddings (required)
MADEINOZ_KNOWLEDGE_EMBEDDER_PROVIDER=ollama
MADEINOZ_KNOWLEDGE_EMBEDDER_MODEL=mxbai-embed-large
MADEINOZ_KNOWLEDGE_EMBEDDER_DIMENSIONS=1024
MADEINOZ_KNOWLEDGE_EMBEDDER_BASE_URL=http://host.docker.internal:11434/v1

# Caching (optional, recommended)
MADEINOZ_KNOWLEDGE_PROMPT_CACHE_ENABLED=true
MADEINOZ_KNOWLEDGE_PROMPT_CACHE_METRICS_ENABLED=true

Ports¶

Port	Service	Environment
8000	MCP Server	Production
8001	MCP Server	Development
7474	Neo4j Browser	Production
7475	Neo4j Browser	Development
9090	Prometheus Metrics	Production
9091	Prometheus Metrics	Development

Key Metrics¶

Metric	What It Measures
`graphiti_api_cost_total`	Total USD spent on LLM API
`graphiti_total_tokens_total`	Total tokens consumed
`graphiti_cache_hit_rate`	Cache effectiveness (%)
`graphiti_llm_request_duration_seconds`	Request latency
`graphiti_llm_errors_total`	API error count

Limits & Constraints¶

Limit	Default	Configurable	Notes
Rate limit	60 req/60s per IP	Yes	`RATE_LIMIT_MAX_REQUESTS`, `RATE_LIMIT_WINDOW_SECONDS`
Concurrent LLM requests	10	Yes	`SEMAPHORE_LIMIT`
Search results (nodes)	10	Yes	`max_nodes` parameter
Search results (facts)	10	Yes	`max_facts` parameter
Search results (episodes)	10	Yes	`max_episodes` parameter
Cache minimum tokens	1024	No	Requests < 1024 tokens skip caching
Episode body size	No limit	N/A	Limited only by LLM context window

Note: Episode body size is not explicitly limited. Very large episodes (>100KB) may cause slow processing or LLM context overflow. For bulk imports, consider chunking large documents.

Credits & Acknowledgments¶

This system is built on Graphiti by Zep AI, with graph database support from Neo4j and FalkorDB. It is part of the Personal AI Infrastructure (PAI) ecosystem.

See Acknowledgments for full credits to projects, research, and the community that inspired this system.