Open Source Projects

Tools and libraries we've built and released to the community. All MIT licensed, all actively maintained.

soul.py

Featured

Persistent identity and memory for any LLM agent

Your AI forgets everything when the conversation ends. soul.py fixes that — from simple markdown injection to full RAG + RLM hybrid retrieval. v0.1 uses pure markdown files. v2.0 indexes those memories and automatically routes queries to semantic search (RAG) or exhaustive reasoning (RLM) based on what the question needs. Human-readable files you can edit and git-version, with intelligent retrieval under the hood.

v0.1 Pure markdown, zero deps Demo →

v1.0 Semantic RAG retrieval Demo →

v2.0 Auto query routing: RAG + RLM Demo →

v0.2.0 Modulizer: 50% token savings, zero-deps Writeup →

GitHub PyPI 📖 Amazon 📦 Gumroad Bundle v0.1 Writeup v2.0 Writeup Modulizer Writeup 📄 arXiv Paper 🤗 HuggingFace Dataset 📊 Benchmark Dashboard Benchmark Results

pip install soul-agent

🔥 Community Response

soul.py hit #1 on r/ollama with 50,000+ views in under 48 hours.

soul.py — Persistent memory for any LLM in 10 lines (works with Ollama, no database)
by u/the_ai_scientist in r/ollama

🗺️ Roadmap

Planned features and improvements. PRs welcome!

Vector Database Support

✅ Qdrant (current)
✅ ChromaDB (local, zero-config) v0.1.2
🔜 RuVector — self-learning vector DB (GNN improves search over time, tamper-proof audit chain, graph queries, MIT/free forever) — github.com/ruvnet/ruvector
🔲 pgvector (PostgreSQL)
🔲 FAISS (local, fast)
🔲 Pinecone (cloud)
🔲 Weaviate

Embedding Providers

✅ Azure OpenAI (current)
✅ OpenAI direct v0.1.2
🔲 Cohere
🔲 Local embeddings (sentence-transformers)
🔲 Ollama embeddings

CLI & Developer Experience

✅ soul init wizard
✅ soul chat interactive CLI v0.1.2
✅ soul status memory stats v0.1.2
✅ Graceful Ollama/local handling in CLI v0.1.3
✅ soul modulize memory segmentation v0.2.0
✅ soul modules list/reindex v0.2.0
🔲 config.yaml file support
🔲 VSCode extension

Memory Features

✅ Timestamped conversation logging
✅ RAG + RLM hybrid routing
✅ Modulizer — auto-segment large memory into indexed modules v0.2.0
✅ Two-phase retrieval — read index, fetch relevant modules only v0.2.0
🔲 Automatic memory summarization
🔲 Memory importance scoring
🔲 Tiered memory (hot/warm/cold)
🔲 Archive-before-prune — index to vector DB before deleting old files
🔲 Frozen storage — S3/GCS backup for disaster recovery
🔲 Memory export/import
🔜 Typed memory structures — classify memories as World Facts, Experiences, Opinions, or Mental Models for more precise retrieval
🔜 Confidence scores — attach confidence weight to stored beliefs; memories reinforced over time carry higher weight in retrieval
🔜 reflect() operation — periodic synthesis of raw memories into higher-order mental models; makes soul.py compound knowledge over time, not just accumulate it
🔜 Temporal decay weighting — recent memories score higher in retrieval; configurable decay curve
🔜 Lightweight entity graph (mode="graph") — regex-based entity extraction, graph traversal for multi-hop queries. Top LoCoMo scorers all use structured extraction.

Retrieval Enhancements

🔲 LLM Reranking — score/filter RAG results before generation
🔜 Hybrid retrieval (semantic + BM25) — parallel keyword + vector search with score fusion; catches exact terms that drift in embedding space
🔲 Hybrid search — combine BM25 + semantic scores
🔲 Query expansion — LLM rewrites query for better recall
🔲 Dynamic snippet extraction — context windows around matches
🔜 Cross-encoder reranking (mode="rerank") — 150ms post-retrieval pass with full query-document attention
🔜 Temporal reasoning — temporal edges in entity graph for state change tracking

Integrations

✅ Anthropic Claude
✅ OpenAI
✅ Ollama / OpenAI-compatible
✅ Google Gemini v0.1.6
✅ LangChain memory backend langchain-soul v0.1.1
✅ LlamaIndex integration llamaindex-soul v0.1.1
🔲 n8n node (official)

⚡ litecrew

New

Multi-agent orchestration in ~100 lines. No magic.

CrewAI has 15,000 lines of code. LangGraph requires a PhD to debug. litecrew is ~150 lines you can read during lunch. It does less — that's the feature. Define agents, hand off between them, track tokens. When you need more, graduate to CrewAI + crewai-soul.

BYOK Your API keys never leave your machine

Zero deps Core library has no required dependencies

Memory Optional soul-agent integration

GitHub PyPI Writeup

pip install litecrew

✨ What It Does

✅ Agent: model + tools + system prompt
✅ Sequential: A → B → C handoffs
✅ Parallel: Fan out to multiple agents
✅ Tools: OpenAI function calling format
✅ Tokens: Built-in usage tracking
✅ Memory: Optional soul-agent integration

🚫 What It Doesn't (By Design)

No hierarchical agent management
No complex state machines
No streaming, callbacks, or YAML config
No human-in-the-loop workflows

If you need these, fork it or graduate to CrewAI.

🔍 SoulSearch

v0.3

AI browser extension with private memory, Ollama support, and browser automation.

SoulSearch brings persistent memory and identity directly into Chrome. v0.3 adds Ollama support — run local LLMs with no API keys. Agent mode browses pages, fills forms, and searches the web. Memory stored in a private Git repo you control.

🌐 Landing Page GitHub → v0.3 Blog Post →

🛒 Chrome Web Store — v0.2 stable | v0.3 features: GitHub only

git clone https://github.com/menonpg/soulsearch && git checkout feat/ollama-support

Key features (v0.3)

✅ Ollama support — run llama3.2, qwen2.5 locally
✅ Session memory — per-conversation notes + global Git-backed
✅ Side Panel agent — persistent UI during page automation
✅ Brave Search tool — agent searches the web
✅ Separate agent provider — mix Ollama + Claude
✅ Memory in your private Git repo (GitHub / GitLab / Gitea)
✅ API keys stored on device only — never synced
✅ Page context extraction + browser automation
🔲 Electron full browser shell (v2)

🏛️ soul-legacy

New

Your digital estate vault — encrypted, AI-queryable, with a dead man's switch

When someone dies, their family spends months hunting for documents. soul-legacy fixes that. Store your assets, insurance, wills, debts, beneficiaries, and final wishes in one encrypted vault. Upload documents and ask questions in plain English. Configure a dead man's switch to automatically grant scoped access to your designated inheritors when the time comes.

CLI Local-first, encrypted on your device

Web UI localhost:8080 or cloud hosted

Anchoring Local HMAC signing (default) or Polygon blockchain (optional)

PyPI Demo Cloud App Writeup

pip install soul-legacy

🧬 soul-schema

New

Auto-document your data warehouse in 3 minutes

You inherit 100 tables. Zero docs. Columns named cust_ltv, flg_b2b, reg_cd. The person who knew what they meant left in 2019. soul-schema connects to any database, reads the schema, and uses an LLM to generate human-readable descriptions for every table and column. Corrections are "locked" — future runs won't overwrite your edits. The semantic layer learns over time.

dbt Export to schema.yml

Vanna Export training data for Text-to-SQL

Ollama Air-gapped mode, fully local

GitHub PyPI Writeup

pip install soul-schema

🧠 crewai-soul

New

The soul ecosystem for CrewAI agents

CrewAI's built-in memory is a black box. crewai-soul stores memories in human-readable markdown files you can edit and git-version. Same drop-in API, full RAG+RLM hybrid retrieval under the hood via soul-agent. Choose local (file-based) or managed (SoulMate API) — same great memory either way.

Local File-based (SOUL.md + MEMORY.md)

SoulMate Managed cloud API

Schema Database intelligence via soul-schema

GitHub PyPI Comparison

pip install crewai-soul

✨ What's Included

soul-agent: RAG + RLM hybrid memory (required dep)
soul-schema: Database semantic layers (required dep)
SoulMateMemory: Drop-in managed cloud backend
SchemaMemory: Database context for Text-to-SQL agents

🦜 langchain-soul

New

The soul ecosystem for LangChain

Drop-in persistent memory for LangChain. Same soul-agent RAG+RLM, same SoulMate cloud option, same SchemaMemory for database intelligence. Works with ConversationChain, RunnableWithMessageHistory, and any LangChain component that uses memory.

Local File-based (SOUL.md + MEMORY.md)

SoulMate Managed cloud API

Schema Database intelligence via soul-schema

GitHub PyPI Writeup

pip install langchain-soul

🦙 llamaindex-soul

New

The soul ecosystem for LlamaIndex

Drop-in chat storage for LlamaIndex. Uses soul-agent's hybrid RAG+RLM retrieval under the hood. Works with ChatMemoryBuffer, FunctionAgent, and any LlamaIndex component that uses chat stores. Same file-based or SoulMate cloud options as the rest of the ecosystem.

Local Markdown-based chat storage

SoulMate Managed cloud API

Schema Database intelligence via soul-schema

GitHub PyPI Writeup

pip install llamaindex-soul

🐳 soul-stack

New

One Docker command to give n8n persistent memory

n8n is stateless by design — every workflow execution starts fresh. soul-stack fixes that. A single Docker container running n8n + soul.py + Jupyter Lab. Your workflows can now remember previous interactions, build context over time, and make intelligent decisions based on history. Works with Anthropic, OpenAI, or 100% local with Ollama.

Port 5678 n8n workflow automation

Port 8000 soul.py memory API

Port 8888 Jupyter Lab

Docker Hub GitHub Writeup

docker run -d -p 8000:8000 -p 8888:8888 -p 5678:5678 -e ANTHROPIC_API_KEY=sk-ant-... pgmenon/soul-stack:latest

✨ Features (v0.1.3)

Multi-provider: Anthropic, OpenAI, or Ollama (100% local)
Backend selection: BM25 (default), ChromaDB, or Qdrant via SOUL_BACKEND
OpenAI embeddings: Direct support, not just Azure
CLI tools: soul chat and soul status with graceful Ollama handling

🛡️ agent-validator

Rules-as-Markdown AI agent governance — 33 compliance checks before deployment

Point it at a GitHub repo. Get back a structured report card — PASS / WARN / FAIL across security, structure, safety, and governance rules — before the agent ever touches production. Rules are plain Markdown: readable, editable, PR-able. No DSL, no YAML schema, no proprietary format. Google A2A compatible (serves /.well-known/agent.json and tasks/send JSON-RPC). Powered by soul.py for consistent governance auditor persona across all validations.

HARD Security gates — secrets, banned imports, SSRF, A2A compliance

SOFT Warnings — rate limiting, PII redaction, error handling

QUALITY Best practices — README, CHANGELOG, SOUL.md, test coverage

GitHub Blog Post PyPI 📄 arXiv Paper

pip install soul-agent-validator

Looking for Enterprise?

SoulMate brings soul.py to production at scale — HIPAA-compliant healthcare, telecom support for millions of customers, financial services personalization. The commercial embodiment of persistent AI memory.

Licensing: The SoulMate API backend (soulmate-api) is source-available under BSL 1.1. Self-host freely, deploy on your own cloud — you just can't resell it as a competing hosted service. Automatically converts to MIT on March 4, 2030. Also available on Docker Hub.

Learn About SoulMate →

Want to Contribute?

All projects welcome PRs. Check the GitHub issues for good first contributions, or open a discussion if you have ideas.

View All Repos → Follow on Product Hunt →

Open Source Projects

soul.py

🔥 Community Response

📖 The Book — Now on Amazon!

🗺️ Roadmap

Vector Database Support

Embedding Providers

CLI & Developer Experience

Memory Features

Retrieval Enhancements

Integrations

⚡ litecrew

✨ What It Does

🚫 What It Doesn't (By Design)

🔍 SoulSearch

Key features (v0.3)

🏛️ soul-legacy

🧬 soul-schema

🧠 crewai-soul

✨ What's Included

🦜 langchain-soul

🦙 llamaindex-soul

🐳 soul-stack

✨ Features (v0.1.3)

🛡️ agent-validator

Looking for Enterprise?

Want to Contribute?