nextcloud-mcp-server

Author	SHA1	Message	Date
github-actions[bot]	f44bf3e8f2	bump: version 0.30.0 → 0.31.0 nextcloud-mcp-server-0.31.0 v0.31.0	2025-11-10 07:02:49 +00:00
Chris Coutinho	37141003d8	Merge pull request #283 from cbcoutinho/feat/adr-010-webhook-vector-sync docs: Add ADR-010 for webhook-based vector sync	2025-11-10 08:02:22 +01:00
Chris Coutinho	c787abf2f3	fix: add retry logic for ETag conflicts in category change test The test_attachments_category_change_handling test was failing in CI with HTTP 412 Precondition Failed errors. This is caused by the background vector scanner (runs every 10 seconds) modifying notes between when the test fetches the ETag and when it attempts to update the category. Solution: Added retry logic (up to 3 attempts) that refetches the latest ETag and retries the update operation when encountering 412 errors. This handles the race condition gracefully while still catching genuine errors.	2025-11-10 07:41:02 +01:00
Chris Coutinho	b32324cb76	feat: skip tracing for health and metrics endpoints Health check and metrics endpoints are frequently polled and don't provide meaningful trace data. This change skips OpenTelemetry span creation for: - /health/* (liveness, readiness checks) - /metrics (Prometheus metrics) These endpoints still record Prometheus metrics (request count, latency, in-flight requests) but no longer create trace spans, reducing tracing noise and storage costs. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 07:24:27 +01:00
Chris Coutinho	640a7818f9	fix: optimize Notes API pagination with pruneBefore parameter The Nextcloud Notes API intentionally returns all note IDs (with only 'id' field) in the last chunk to enable deletion detection. Without using the pruneBefore parameter, this causes duplicates - all notes appear with full data in chunks, then again with minimal data in the last chunk. This commit implements proper pruneBefore support: - NotesClient.get_all_notes() now accepts prune_before timestamp parameter - Scanner calculates max(indexed_at) from Qdrant to use as prune threshold - Only notes modified after this timestamp are sent with full data - Deduplication logic handles the API's deletion detection pattern - Significantly reduces data transfer for incremental syncs The behavior is documented in Notes API v1 spec - this is not an API bug, but a feature we weren't utilizing correctly. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 07:19:26 +01:00
Chris Coutinho	8e5d0b5df1	Merge pull request #276 from cbcoutinho/renovate/pin-dependencies chore(deps): pin qdrant/qdrant docker tag to 0fb8897	2025-11-10 06:48:01 +01:00
Chris Coutinho	851d21f56e	Merge pull request #284 from cbcoutinho/renovate/lock-file-maintenance chore(deps): lock file maintenance	2025-11-10 06:47:35 +01:00
renovate-bot-cbcoutinho[bot]	fb1af697f7	chore(deps): lock file maintenance	2025-11-10 05:13:55 +00:00
renovate-bot-cbcoutinho[bot]	bf4eed6007	chore(deps): pin qdrant/qdrant docker tag to 0fb8897	2025-11-10 05:12:36 +00:00
Chris Coutinho	3a41860d27	docs: Add ADR-010 for webhook-based vector sync Add architecture decision record for integrating Nextcloud webhooks into the vector database synchronization system. Key features: - Webhook endpoint at /webhooks/nextcloud receives push notifications - Complements existing polling (ADR-007) without replacing it - Optional authentication via WEBHOOK_SECRET - Simple architecture: webhooks are just another DocumentTask producer - Administrators can reduce polling frequency when webhooks are configured Benefits: - Reduced latency: seconds to minutes instead of up to 1 hour - Lower API load: ~95% reduction when polling frequency is increased - Better scalability: only process changed documents - No changes required to scanner or processor components 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 05:28:36 +01:00
github-actions[bot]	126b5a7626	bump: version 0.29.2 → 0.30.0 nextcloud-mcp-server-0.30.0 v0.30.0	2025-11-10 02:50:11 +00:00
Chris Coutinho	4d3ff1abe1	Merge pull request #282 from cbcoutinho/feat/multi-embedding-model-support feat(vector): Support multiple embedding models with auto-generated collection names	2025-11-10 03:49:48 +01:00
Chris Coutinho	d80e54ff97	feat(helm): Add document chunking configuration Add support for configurable document chunking parameters to Helm chart to match docker-compose and application capabilities. Changes: 1. values.yaml: - Add documentChunking section with chunkSize (512) and chunkOverlap (50) - Include comprehensive comments explaining chunking strategies - Positioned between vectorSync and qdrant sections 2. templates/deployment.yaml: - Add DOCUMENT_CHUNK_SIZE and DOCUMENT_CHUNK_OVERLAP env vars - Always set (not conditional), used by vector sync processor - Environment variables follow same pattern as config.py defaults 3. README.md: - Add documentChunking parameter table in Vector Search section - Document chunking strategies (small/medium/large chunks) - Explain overlap recommendations (10-20% of chunk size) Validation: - helm lint: Passes - helm template: Environment variables correctly generated - Custom values: Work as expected (tested with chunkSize=1024) - Always present: Not conditional on vectorSync.enabled This maintains feature parity between Helm and docker-compose deployments, allowing users to tune chunking for their embedding models and use cases. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 03:34:16 +01:00
Chris Coutinho	157e433d65	fix: Support in-memory Qdrant for CI testing Changes to make tests work without external qdrant/ollama dependencies: 1. docker-compose.yml (mcp service): - Switch from QDRANT_URL (network mode) to QDRANT_LOCATION=":memory:" - Comment out QDRANT_URL and QDRANT_API_KEY (not needed for in-memory) - Keep OLLAMA_BASE_URL commented out (use SimpleEmbeddingProvider fallback) 2. nextcloud_mcp_server/vector/qdrant_client.py: - Fix collection creation bug in in-memory mode - Previously: All ValueError exceptions were re-raised - Now: Only dimension mismatch ValueError is re-raised - Allows "Collection not found" ValueError to trigger auto-creation 3. tests/integration/test_sampling.py: - Update test to handle all sampling unsupported cases - Check for multiple fallback search_method values - Skip test gracefully when sampling unavailable This configuration enables: - CI testing without external services (qdrant, ollama) - In-memory vector database (ephemeral but sufficient for tests) - SimpleEmbeddingProvider for embeddings (feature hashing, 384 dims) - Automatic collection creation on first use Test result: test_semantic_search_answer_successful_sampling now passes (skipped with appropriate message when sampling unsupported) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 03:21:27 +01:00
Chris Coutinho	94d16092c0	ci: Add qdrant profile to docker compose up command	2025-11-10 03:09:50 +01:00
Chris Coutinho	cb39b3fca4	feat(vector): Add configurable chunk size and overlap for document embedding Enable users to tune document chunking parameters to match their embedding model and content type by adding DOCUMENT_CHUNK_SIZE and DOCUMENT_CHUNK_OVERLAP environment variables. - config.py: Added `document_chunk_size` (default: 512) and `document_chunk_overlap` (default: 50) configuration fields with validation: - Ensures overlap < chunk_size - Warns if chunk_size < 100 words - Prevents negative overlap values - processor.py: Updated DocumentChunker instantiation to use config settings instead of hardcoded values (line 174-177) - tests/unit/test_config.py: Added TestChunkConfigValidation class with 9 tests covering: - Default values - Valid configurations - Validation errors (overlap >= chunk_size, negative overlap) - Warning for small chunk sizes - Environment variable loading - docs/configuration.md: Added comprehensive "Document Chunking Configuration" section with: - Chunk size selection guidance (256-384 vs 512 vs 768-1024 words) - Overlap recommendations (10-20% of chunk size) - Configuration examples for different use cases - Added env vars to reference table - docs/semantic-search-architecture.md: Added "Document Chunking Strategy" section with: - Chunking process explanation - Example showing sliding window behavior - Search behavior with chunks - Tuning recommendations - env.sample: Added complete "Semantic Search & Vector Sync Configuration" section with: - Vector sync settings - Qdrant configuration (3 modes) - Ollama embedding service - Document chunking configuration - docker-compose.yml: Added commented examples for DOCUMENT_CHUNK_SIZE and DOCUMENT_CHUNK_OVERLAP with usage notes \`\`\`bash DOCUMENT_CHUNK_SIZE=512 DOCUMENT_CHUNK_OVERLAP=50 \`\`\` 1. \`overlap\` must be less than \`chunk_size\` 2. \`overlap\` cannot be negative 3. Warning issued if \`chunk_size\` < 100 words Precise matching (small notes, specific queries): \`\`\`bash DOCUMENT_CHUNK_SIZE=256 DOCUMENT_CHUNK_OVERLAP=25 \`\`\` Balanced (default, general purpose): \`\`\`bash DOCUMENT_CHUNK_SIZE=512 DOCUMENT_CHUNK_OVERLAP=50 \`\`\` Contextual (long documents, broader topics): \`\`\`bash DOCUMENT_CHUNK_SIZE=1024 DOCUMENT_CHUNK_OVERLAP=100 \`\`\` ✅ User control - Tune chunking to match embedding model capabilities ✅ Experimentation - Test different chunk sizes for optimal results ✅ Model alignment - Match chunk size to embedding context window ✅ Backward compatible - Defaults maintain existing behavior ✅ Well validated - Comprehensive tests prevent misconfiguration All 22 config validation tests pass (9 new tests for chunking): - Default values work correctly - Validation prevents invalid configurations - Environment variables load properly - Warning system works as expected With configurable chunk sizes, users can now experiment with different Ollama embedding models and tune chunk parameters for optimal semantic search quality. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 02:47:57 +01:00
Chris Coutinho	f3050e9b45	chore: Remove /health and /metrics endpoints from logging	2025-11-10 02:07:45 +01:00
Chris Coutinho	e575c8e57b	feat(vector): Support multiple embedding models with auto-generated collection names This PR enables safe switching between embedding models and multi-server deployments by implementing auto-generated Qdrant collection names based on deployment ID and model name. ## Problem Previously, all deployments used a single hardcoded collection name "nextcloud_content", which caused two critical issues: 1. Dimension mismatches when switching models: Changing OLLAMA_EMBEDDING_MODEL (e.g., nomic-embed-text at 768D → all-minilm at 384D) would cause runtime errors as vectors couldn't be inserted into a collection with incompatible dimensions. 2. Collection collisions in multi-server setups: Multiple MCP servers sharing a single Qdrant instance would overwrite each other's data, making horizontal scaling impossible. ## Solution ### Auto-Generated Collection Naming Collections are now automatically named using the pattern: \`{deployment-id}-{model-name}\` Deployment ID: Uses \`OTEL_SERVICE_NAME\` if configured (and not default value), otherwise falls back to \`hostname\` for simple Docker deployments. Model Name: From \`OLLAMA_EMBEDDING_MODEL\` with path separators sanitized. Examples: - \`my-mcp-server-nomic-embed-text\` (with OTEL_SERVICE_NAME=my-mcp-server) - \`mcp-container-all-minilm\` (simple Docker, hostname=mcp-container) Override: Users can still set \`QDRANT_COLLECTION\` explicitly to bypass auto-generation for backward compatibility. ### Dimension Validation Added startup validation that checks collection dimensions match the embedding service. If a mismatch is detected, the server fails fast with a clear error message explaining: - Expected vs actual dimensions - Likely cause (model change) - Solutions (delete collection, use different name, or revert model) ### Improved Sampling Error Handling Enhanced MCP sampling rejection handling to treat user rejections as normal behavior rather than errors: - User rejections ("rejected", "denied") → INFO log, no traceback - Unsupported clients → INFO log, no traceback - Other MCP errors → WARNING log, no traceback - Unexpected errors → ERROR log WITH traceback This aligns with the MCP specification where clients SHOULD prompt users for approval/denial of sampling requests. ## Changes ### Core Implementation - nextcloud_mcp_server/config.py: Added \`get_collection_name()\` method with deployment ID detection and model name sanitization - nextcloud_mcp_server/vector/qdrant_client.py: Dimension validation on collection open with helpful error messages - nextcloud_mcp_server/vector/{scanner,processor}.py: Updated to use \`get_collection_name()\` - nextcloud_mcp_server/auth/userinfo_routes.py: Vector sync status uses \`get_collection_name()\` - nextcloud_mcp_server/server/semantic.py: - Updated semantic search tools to use \`get_collection_name()\` - Improved sampling rejection error handling (McpError vs Exception) ### Documentation - docs/semantic-search-architecture.md: New comprehensive architecture document (557 lines) covering background sync, semantic search flow, RAG implementation, and deployment modes - docs/configuration.md: Added detailed "Qdrant Collection Naming" section with examples and multi-server deployment guidance - docker-compose.yml: Added comments explaining collection naming behavior - README.md: Updated semantic search descriptions to clarify experimental status, Notes-only support, and infrastructure requirements ## Migration Guide For existing single-server deployments: Option 1 (Recommended): Use explicit collection name for continuity \`\`\`bash QDRANT_COLLECTION=nextcloud_content # Keep existing collection \`\`\` Option 2: Allow auto-generation and re-embed \`\`\`bash # Remove QDRANT_COLLECTION override # New collection will be created based on deployment ID + model # Requires re-embedding all documents (may take time) \`\`\` For new multi-server deployments: Set unique OTEL service names per server: \`\`\`bash # Server 1 OTEL_SERVICE_NAME=mcp-prod OLLAMA_EMBEDDING_MODEL=nomic-embed-text # → Collection: "mcp-prod-nomic-embed-text" # Server 2 OTEL_SERVICE_NAME=mcp-staging OLLAMA_EMBEDDING_MODEL=nomic-embed-text # → Collection: "mcp-staging-nomic-embed-text" \`\`\` ## Benefits ✅ Safe model switching: Each model gets its own collection, preventing dimension mismatch errors ✅ Multi-server support: Multiple MCP servers can share one Qdrant instance without conflicts ✅ Clear ownership: Collection names show which deployment and model owns the data ✅ Better error messages: Dimension validation provides actionable guidance ✅ Backward compatible: Existing deployments can continue using \`QDRANT_COLLECTION\` override ## Testing Validated with: - Single-server deployments (default hostname-based naming) - Multi-server deployments (OTEL service name-based naming) - Model switching scenarios (dimension validation) - Collection override scenarios (backward compatibility) Next steps: Testing various Ollama embedding models to investigate optimal chunk sizes and performance characteristics. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 01:18:30 +01:00
github-actions[bot]	a0576aa9a2	bump: version 0.29.1 → 0.29.2 nextcloud-mcp-server-0.29.2 v0.29.2	2025-11-09 18:28:34 +00:00
Chris Coutinho	4a6c60113b	fix(helm): Set default strategy to Recreate	2025-11-09 19:27:55 +01:00
Chris Coutinho	a0cb1ac9fe	Merge pull request #281 from cbcoutinho/renovate/qdrant-1.x chore(deps): update helm release qdrant to v1	2025-11-09 18:38:22 +01:00
renovate-bot-cbcoutinho[bot]	de4f1032aa	chore(deps): update helm release qdrant to v1	2025-11-09 17:08:13 +00:00
Chris Coutinho	178be5da6d	Merge pull request #279 from cbcoutinho/renovate/ollama-1.x chore(deps): update helm release ollama to v1.34.0	2025-11-09 18:04:08 +01:00
Chris Coutinho	61d8c851c9	Merge pull request #272 from cbcoutinho/renovate/softprops-action-gh-release-2.x chore(deps): update softprops/action-gh-release action to v2.4.2	2025-11-09 17:02:19 +01:00
Chris Coutinho	a8c63c8379	Merge pull request #278 from cbcoutinho/renovate/azure-setup-helm-4.x chore(deps): update azure/setup-helm action to v4.3.1	2025-11-09 17:01:59 +01:00
renovate-bot-cbcoutinho[bot]	3147180ccd	chore(deps): update helm release ollama to v1.34.0	2025-11-09 11:08:18 +00:00
renovate-bot-cbcoutinho[bot]	380578dd2e	chore(deps): update softprops/action-gh-release action to v2.4.2	2025-11-09 11:07:57 +00:00
renovate-bot-cbcoutinho[bot]	10c5557aea	chore(deps): update azure/setup-helm action to v4.3.1	2025-11-09 11:07:52 +00:00
github-actions[bot]	7772b1ac2e	bump: version 0.29.0 → 0.29.1 nextcloud-mcp-server-0.29.1 v0.29.1	2025-11-09 08:54:26 +00:00
Chris Coutinho	0513bec105	Merge pull request #275 from cbcoutinho/feature/observability-monitoring fix(observability): isolate metrics endpoint to dedicated port	2025-11-09 09:54:00 +01:00
Chris Coutinho	4e89e92b65	fix(observability): isolate metrics endpoint to dedicated port Security fix: Move Prometheus metrics endpoint from main HTTP port to dedicated port 9090 to prevent external exposure of metrics data. Changes: - Use prometheus_client.start_http_server() for dedicated metrics server - Remove /metrics route from main application routes - Metrics now only accessible on port 9090 (configurable via METRICS_PORT) - Main application port no longer serves /metrics endpoint This follows security best practice of isolating monitoring endpoints from application traffic. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:53:36 +01:00
github-actions[bot]	af96378cb6	bump: version 0.28.0 → 0.29.0 nextcloud-mcp-server-0.29.0 v0.29.0	2025-11-09 08:29:53 +00:00
Chris Coutinho	c5da11aa4c	Merge pull request #274 from cbcoutinho/feature/observability-monitoring feature/observability monitoring	2025-11-09 09:29:25 +01:00
Chris Coutinho	5e4667a643	fix(readiness): Only check external Qdrant in network mode The readiness probe incorrectly tried to connect to an external Qdrant service even when using memory or persistent mode (embedded Qdrant). This caused pods to never become ready in Kubernetes deployments using the default configuration. Root cause: - In memory/persistent modes, QDRANT_URL env var is NOT set - Readiness check used default 'http://qdrant:6333' anyway - Tried to connect to non-existent service - Connection failed -> 503 -> pod stuck in not-ready state Fix: - Only check external Qdrant health if QDRANT_URL is explicitly set (network mode) - For embedded modes (memory/persistent), report status as 'embedded' without blocking - Background scanner tasks don't block readiness (already non-blocking via anyio.start_soon) This allows pods to become ready immediately when using embedded Qdrant, while still validating external Qdrant connectivity in network mode. Fixes: Kubernetes pods failing readiness check with default Qdrant configuration 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:28:09 +01:00
Chris Coutinho	093ac5b5ba	feat(helm): Add observability support with ServiceMonitor and Grafana dashboard Add comprehensive observability configuration to Helm chart: Helm Values: - Add observability configuration section for metrics, tracing, and logging - Add serviceMonitor configuration (disabled by default) - Add prometheusRule configuration (disabled by default) Templates: - Update deployment to include observability environment variables - Update deployment to expose metrics port (9090) - Update service to expose metrics port - Add ServiceMonitor template for Prometheus Operator - Add PrometheusRule template with critical and warning alerts Dashboards: - Add comprehensive Grafana dashboard JSON with 6 panels: - Request Rate (by method and endpoint) - Error Rate (5xx errors percentage) - Request Latency (P50/P95 by endpoint) - Top MCP Tools (by invocation volume) - Nextcloud API Latency (by app) - Vector Sync Queue Size - Add dashboard README with import instructions Alert Rules: - Critical: Server down, high error rate (>5%), high latency (>1s), dependency down - Warning: Token validation errors (>1%), vector sync queue high (>100), Qdrant slow (>500ms) All features are opt-in via values.yaml configuration. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:10:11 +01:00
github-actions[bot]	ae81f0334e	bump: version 0.27.3 → 0.28.0 nextcloud-mcp-server-0.28.0 v0.28.0	2025-11-09 08:04:06 +00:00
Chris Coutinho	23f3a231a5	Merge pull request #273 from cbcoutinho/feature/observability-monitoring Feature/observability monitoring	2025-11-09 09:03:40 +01:00
Chris Coutinho	7be40a33e1	fix(vector): Handle missing 'modified' field in notes gracefully The vector scanner crashed when encountering notes without a 'modified' field, causing KeyError and preventing initial sync from completing. Changes: - Use dict.get() with fallback value (0) instead of direct key access - Log warnings for notes missing 'modified' field - Apply fix to both initial sync and incremental sync code paths This ensures the scanner continues processing all notes even if some have missing metadata fields, preventing scanner crashes that could affect deployment readiness. Fixes: Notes without 'modified' field causing scanner crash and readiness check failure 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:03:05 +01:00
Chris Coutinho	578de4d7d6	feat(observability): Add comprehensive monitoring with Prometheus and OpenTelemetry - Add Prometheus metrics for HTTP, MCP tools, Nextcloud API, OAuth, vector sync, and DB operations - Add OpenTelemetry distributed tracing with OTLP export - Add structured JSON logging with trace context correlation - Add ObservabilityMiddleware for automatic HTTP instrumentation - Add app_name attribute to all client classes for per-app metrics - Add configuration for metrics, tracing, and logging via environment variables - Add documentation in docs/observability.md - Fix graceful degradation when tracing is disabled (default state) - Fix uvicorn logging configuration to use observability formatters 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 08:54:04 +01:00
github-actions[bot]	8f0f989c6d	bump: version 0.27.2 → 0.27.3 nextcloud-mcp-server-0.27.3 v0.27.3	2025-11-09 06:52:31 +00:00
Chris Coutinho	f8a2935c22	fix(ci): Use helm dependency build instead of update to use Chart.lock	2025-11-09 07:52:00 +01:00
github-actions[bot]	137dc80075	bump: version 0.27.1 → 0.27.2 nextcloud-mcp-server-0.27.2 v0.27.2	2025-11-09 06:45:44 +00:00
Chris Coutinho	725ac65e6a	fix(helm): update Qdrant dependency condition to match new mode structure The Qdrant subchart was being included by default even in memory/persistent modes. Changed the dependency condition from `qdrant.enabled` to `qdrant.networkMode.deploySubchart` to align with the three-mode structure. Now the Qdrant subchart is ONLY deployed when: - qdrant.mode: "network" - qdrant.networkMode.deploySubchart: true Verified all three modes: - Memory mode (:memory:): No subchart, QDRANT_LOCATION=:memory: - Persistent mode (path): No subchart, QDRANT_LOCATION=/app/data/qdrant, PVC created - Network mode (subchart): Qdrant subchart deployed, QDRANT_URL=http://...:6333 - Network mode (external): No subchart, QDRANT_URL=<external-url> 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:45:06 +01:00
github-actions[bot]	f51edff25d	bump: version 0.27.0 → 0.27.1 nextcloud-mcp-server-0.27.1 v0.27.1	2025-11-09 06:22:00 +00:00
Chris Coutinho	50ba6ccc88	fix(ci): add Helm repository setup to chart release workflow The chart-releaser was failing because it couldn't resolve the dependencies (Qdrant and Ollama subcharts) when packaging. Changes: - Add azure/setup-helm action to install Helm v3.16.0 - Add step to add Qdrant and Ollama Helm repositories - Run helm dependency update before chart-releaser runs This fixes the error: "Error: no repository definition for https://qdrant.github.io/qdrant-helm, https://otwld.github.io/ollama-helm" 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:21:17 +01:00
github-actions[bot]	538bbc375e	bump: version 0.26.1 → 0.27.0 v0.27.0	2025-11-09 06:15:27 +00:00
Chris Coutinho	d4c686eba7	Merge pull request #271 from cbcoutinho/docs/adr-007-background-vector-sync feat: implement ADR-007 background vector sync and semantic search	2025-11-09 07:15:00 +01:00
Chris Coutinho	167e49788e	feat(helm): add Qdrant local mode support with three deployment options [skip ci] Add support for three Qdrant deployment modes in Helm chart: 1. In-memory mode (:memory:) - Default, zero-config, ephemeral storage 2. Persistent local mode (path-based) - File-based storage with PVC 3. Network mode (URL-based) - Dedicated Qdrant service or external instance Changes: - Restructured qdrant configuration in values.yaml with mode selector - Added conditional environment variable logic in deployment.yaml - Created PVC template for persistent local mode with optional existingClaim - Added qdrantPvcName helper template in _helpers.tpl - Updated README.md with Helm registry URL (https://cbcoutinho.github.io/nextcloud-mcp-server) Breaking change: Default changed from requiring qdrant.enabled to using in-memory mode (:memory:) when no Qdrant configuration is provided. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:14:19 +01:00
Chris Coutinho	857d8f2152	feat: add Qdrant local mode support with in-memory and persistent storage Adds flexible Qdrant deployment modes to reduce infrastructure requirements for local development and smaller deployments: Configuration Changes: - Add QDRANT_LOCATION environment variable (mutually exclusive with QDRANT_URL) - Three modes: network (URL), in-memory (:memory:, default), persistent (file path) - Settings dataclass validation via __post_init__ ensures mutual exclusivity - API key warning when set in local mode (ignored, only for network mode) Client Initialization: - Auto-detect mode: network (url + api_key) vs local (:memory: or path=) - In-memory: AsyncQdrantClient(":memory:") - zero config default - Persistent: AsyncQdrantClient(path="/app/data/qdrant") - file storage - Network: AsyncQdrantClient(url, api_key) - production mode Docker Compose Updates: - Qdrant service moved to optional profile (--profile qdrant) - MCP service uses QDRANT_LOCATION=:memory: by default - Added mcp-data volume for persistent storage (/app/data) - No hard dependency on qdrant service Documentation: - Comprehensive configuration guide in docs/configuration.md - All three modes documented with pros/cons - Docker Compose examples for each mode - Environment variable reference table Tests: - 13 new config validation tests (mutual exclusivity, defaults, warnings) - Persistent mode integration test (create, close, reopen, verify persistence) - All 82 unit tests + 5 smoke tests pass Breaking Change: - Default changed from QDRANT_URL=http://qdrant:6333 to QDRANT_LOCATION=:memory: - Simplifies local development (no external service needed) - Production deployments: explicitly set QDRANT_URL or QDRANT_LOCATION Related: ADR-007 background vector sync implementation 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:07:07 +01:00
Chris Coutinho	72232f937a	refactor: migrate vector sync from asyncio.Queue to anyio memory object streams Replace asyncio.Queue with anyio.create_memory_object_stream() throughout the vector sync system for better library consistency and improved shutdown semantics. ## Changes Made scanner.py: - Changed parameter type from `asyncio.Queue` to `MemoryObjectSendStream[DocumentTask]` - Replaced all `await document_queue.put()` calls with `await send_stream.send()` - Wrapped scanner loop in `async with send_stream:` context manager for automatic cleanup - Updated log messages: "Queued" → "Sent" - Removed `import asyncio` (no longer needed) processor.py: - Changed parameter type from `asyncio.Queue` to `MemoryObjectReceiveStream[DocumentTask]` - Replaced `asyncio.wait_for(document_queue.get(), timeout=1.0)` with `anyio.fail_after(1.0)` + `await receive_stream.receive()` - Removed all `document_queue.task_done()` calls (not needed with streams) - Added `anyio.EndOfStream` exception handling for graceful shutdown when scanner closes - Removed `import asyncio` (no longer needed) app.py: - Removed `import asyncio` from top-level imports - Added `from anyio.streams.memory import MemoryObjectReceiveStream, MemoryObjectSendStream` - Updated AppContext dataclass: - Replaced `document_queue: Optional[asyncio.Queue]` with: - `document_send_stream: Optional[MemoryObjectSendStream]` - `document_receive_stream: Optional[MemoryObjectReceiveStream]` - Updated `app_lifespan_basic()`: - Replaced `asyncio.Queue(maxsize=...)` with `anyio.create_memory_object_stream(max_buffer_size=...)` - Pass `send_stream` to scanner_task - Pass `receive_stream.clone()` to each processor_task (enables multiple consumers) - Updated AppContext yield to include both streams - Updated `starlette_lifespan()`: - Same changes as app_lifespan_basic for streamable-http transport - Removed `import asyncio as asyncio_module` (no longer needed) - Updated app.state storage to use send_stream and receive_stream semantic.py: - Updated `nc_get_vector_sync_status()` tool: - Access `document_receive_stream` instead of `document_queue` from lifespan context - Use `stream_stats.current_buffer_used` instead of `queue.qsize()` for pending count - More reliable metrics (qsize() was not guaranteed accurate) ## Benefits 1. Library Consistency: Pure anyio throughout codebase (was mixing asyncio.Queue with anyio.Event and anyio.create_task_group) 2. Graceful Shutdown: `async with send_stream:` automatically closes stream on exit, signaling EndOfStream to all processors 3. Better Timeout Handling: `anyio.fail_after()` is more idiomatic than `asyncio.wait_for()` 4. Stream Cloning: Easy to add multiple consumers via `receive_stream.clone()` 5. Better Statistics: `.statistics()` provides accurate buffer metrics (qsize() was unreliable) 6. Type Safety: Separate send/receive types prevent accidental misuse 7. No task_done() tracking: Streams handle completion automatically ## Testing - ✅ All 69 unit tests passing - ✅ All 5 smoke tests passing - ✅ No regressions in functionality - ✅ Graceful shutdown behavior improved ## References - https://anyio.readthedocs.io/en/stable/why.html#queue-fix - https://anyio.readthedocs.io/en/stable/streams.html#memory-object-streams 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 06:43:44 +01:00

1 2 3 4 5 ...

975 Commits