knowledge-base

Author	SHA1	Message	Date
Pratik Narola	06875473b2	feat: enable reranker, automate backups, tune extraction prompt Three S-effort wins from the post-migration audit: #1 Enable Cohere reranker on both Memory.search call sites (rerank=True), over-fetch top_k=max(limit*3, 30) to give the reranker a 30-50 candidate pool, then truncate to the caller's limit. Bump reranker config to rerank-v3.5 (4096 ctx, multilingual — matters for Hindi/Hinglish traffic) and top_n 10 → 50 so the output cap doesn't truncate below typical over-fetch sizes. Cohere was configured but never invoked; this is the single biggest quality lift the audit surfaced. #2 Add scripts/backup_qdrant.sh and scripts/restore_test.sh. Daily snapshot of both collections back-to-back, docker cp to local YYYY-MM-DD dir, optional rclone off-host, prune local >14d, emit Prometheus textfile metric. Weekly restore_test.sh restores into a transient collection and asserts point count parity. Closes the zero-automated-backup gap. #3 Add CUSTOM_FACT_EXTRACTION_INSTRUCTIONS, wired via MemoryConfig's custom_instructions field. mem0 appends this as its own '## Custom Instructions' section in the additive-extraction user prompt (verified against generate_additive_extraction_prompt) — does not replace mem0's role/format guidance. Re-prioritizes the default consumer-organizer few-shots toward work/projects/ relationships/recurring context, the actual usage pattern here.	2026-05-23 19:53:59 +05:30
Pratik Narola	7cc8fc5112	deploy: route embedder through OpenAI-compat proxy instead of Ollama The custom OpenAI-compatible endpoint (LiteLLM) serves the same qwen3-embedding model and is reachable from the container in all deployments; direct Ollama may not be. Vectors stay compatible because the underlying model is the same. Captured from a beast production hotfix.	2026-05-23 15:03:02 +05:30
Pratik Narola	0f0addb36b	chore: migrate to mem0ai v2.0.2 (V3 memory pipeline) Pin mem0ai[nlp]==2.0.2 and fastembed for the new hybrid-search pipeline. Drop OSS graph memory (removed upstream in 2.0.0, PR #4805): remove Neo4j service, env vars, volumes, and driver deps; mark /graph/relationships deprecated. Rewrite Memory.search/get_all/chat/health call sites to use the v2 filters={} + top_k API (entity IDs at top level now raise ValueError). Tighten MCP remove_memory ownership check to O(1) verify_memory_ownership so it doesn't silently truncate at the new top_k=20 default. Downgrade base image to python:3.12-slim for spaCy. Adds scripts/migrate_qdrant_to_v3.py (scroll+upsert with per-user count parity check) and docs/MIGRATION_RUNBOOK.md covering snapshot, dump, collection rebuild, cutover, and rollback procedures.	2026-05-23 14:49:45 +05:30
Pratik Narola	82accabc73	fix: properly log settings as structlog kwargs instead of dropping them	2026-01-16 00:40:53 +05:30
Pratik Narola	6f9b545c15	fix: remove invalid positional arg from structlog call	2026-01-16 00:38:41 +05:30
Pratik Narola	5bcecf4649	fix: use structlog instead of logging in mem0_manager for kwargs support	2026-01-16 00:34:51 +05:30
Pratik Narola	a228780146	production improvements: configurable embeddings, v1.1, O(1) ownership, retries - Make Ollama URL configurable via OLLAMA_BASE_URL env var - Add version: v1.1 to Mem0 config (required for latest features) - Make embedding model and dimensions configurable - Fix ownership check: O(1) lookup instead of fetching 10k records - Add tenacity retry logic for database operations	2026-01-15 23:01:18 +05:30
Pratik Narola	50edce2d3c	security hardening: add auth, rate limiting, fix info disclosure - Add auth to /models and /users endpoints - Add rate limiting to all endpoints (10-120/min based on operation type) - Fix 11 info disclosure issues (detail=str(e) -> generic message) - Fix 2 silent except blocks with proper logging - Fix 7 raise e -> raise for proper exception chaining - Fix health check to not expose exception details - Update tests with X-API-Key headers and security tests	2026-01-15 22:41:24 +05:30
Pratik Narola	997865283f	added auth	2025-10-23 22:22:07 +05:30
Pratik Narola	e7b839810b	added reranker	2025-10-21 14:25:51 +05:30
Pratik Narola	f625f8f556	Migrated to Qdrant	2025-09-09 12:54:46 +05:30
Pratik Narola	28a8953ac5	Stable checkpoint with chat interface	2025-09-04 17:07:49 +05:30
Pratik Narola	f929165a89	updated versions fixed issues	2025-09-02 23:40:10 +05:30
Pratik Narola	aa7742c5ad	Added support for run id agent id	2025-09-02 17:28:41 +00:00
Pratik Narola	c864fe8895	small chang	2025-08-12 16:25:43 +05:30
Pratik Narola	7689409950	Initial commit: Production-ready Mem0 interface with monitoring - Complete Mem0 OSS integration with hybrid datastore - PostgreSQL + pgvector for vector storage - Neo4j 5.18 for graph relationships - Google Gemini embeddings integration - Comprehensive monitoring with correlation IDs - Real-time statistics and performance tracking - Production-grade observability features - Clean repository with no exposed secrets	2025-08-10 17:34:41 +05:30

16 commits