nextcloud-mcp-server

Author	SHA1	Message	Date
Chris Coutinho	da65155cde	Merge pull request #293 from cbcoutinho/fix/grafana-folder-label-validation fix: Move grafana_folder from labels to annotations	2025-11-13 13:10:15 +01:00
Chris Coutinho	4e43d15153	fix: Move grafana_folder from labels to annotations Fixes Kubernetes label validation error when deploying dashboard ConfigMap. Problem: - Kubernetes labels cannot contain spaces (validation regex: [A-Za-z0-9][-A-Za-z0-9_.]*[A-Za-z0-9]) - Previous implementation had grafana_folder: "Nextcloud MCP" as a label - Deployment failed with: "Invalid value: 'Nextcloud MCP'" Solution: - Move grafana_folder from labels to annotations (annotations allow spaces) - Keep grafana_dashboard="1" as label for ConfigMap discovery - Grafana sidecar reads folder name from folderAnnotation parameter Changes: - dashboard-configmap.yaml: Move grafana_folder to annotations section - dashboards/README.md: Fix kubectl commands to use annotations - values.yaml: Update comments to clarify annotation usage This follows the standard kube-prometheus-stack pattern where: - Labels are used for ConfigMap discovery (strict validation) - Annotations are used for metadata like folder names (relaxed validation) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 13:08:45 +01:00
github-actions[bot]	15951c38fa	bump: version 0.32.1 → 0.33.0 nextcloud-mcp-server-0.33.0 v0.33.0	2025-11-13 10:58:05 +00:00
Chris Coutinho	2de0590839	Merge pull request #292 from cbcoutinho/feat/grafana-dashboard-and-vector-metrics feat: Add Grafana dashboard and vector sync metric instrumentation	2025-11-13 11:57:40 +01:00
Chris Coutinho	4ea5ed72d4	feat: Add Grafana dashboard and vector sync metric instrumentation Implement comprehensive observability for vector database synchronization with Grafana dashboard and Prometheus metrics. ## Part 1: Grafana Dashboard Created all-in-one operations dashboard with 7 rows and 34 panels: ### Dashboard Structure: - Overview Row: Request rate, error rate, P95 latency, active requests - HTTP Metrics (RED): Request/error rates by endpoint, latency percentiles - MCP Tools: Call volume, error rates, execution duration by tool - Nextcloud API: API calls/latency by app, retry patterns - OAuth & Authentication: Token validations, exchanges, cache hit rate - Dependencies & Health: Status for Nextcloud/Qdrant/Keycloak/Unstructured - Vector Sync: Processing throughput, queue depth, Qdrant operations ### Helm Chart Integration: - Added dashboard-configmap.yaml template for automatic provisioning - Configured Grafana sidecar auto-discovery (label: grafana_dashboard="1") - Added dashboards configuration section in values.yaml (opt-in) - Updated Chart.yaml with dashboard annotations - Enhanced NOTES.txt with dashboard deployment instructions - Comprehensive documentation in dashboards/README.md Dashboard supports dynamic filtering via variables: - datasource: Prometheus data source selection - namespace: Filter by Kubernetes namespace - pod: Multi-select pod filtering - interval: Query interval (1m/5m/10m/30m/1h) ## Part 2: Vector Sync Metric Instrumentation Implemented metric recording throughout vector sync pipeline: ### metrics.py: Added convenience functions: - record_vector_sync_scan() - Track documents per scan - record_vector_sync_processing() - Track processing duration/status - record_qdrant_operation() - Track database operations - update_vector_sync_queue_size() - Track queue depth ### scanner.py: - Record number of documents found in each scan - Enables monitoring of scan throughput ### processor.py: - Record processing duration for each document - Track success/failure status with timing - Record Qdrant upsert/delete operations - Handle all code paths (success, deletion, error) ### semantic.py: - Wrap Qdrant query_points with try/except - Record search operation success/failure ## Metrics Exposed: - mcp_vector_sync_documents_scanned_total - mcp_vector_sync_documents_processed_total{status} - mcp_vector_sync_processing_duration_seconds (histogram) - mcp_vector_sync_queue_size (gauge) - mcp_qdrant_operations_total{operation,status} This enables monitoring of: - Scan and processing throughput - Processing latency (P50/P95/P99) - Error rates for processing and Qdrant operations - Queue depth trends - Complete observability of vector sync pipeline ## Testing: Verified locally that metrics are recorded correctly: - 36 documents scanned - 3 documents processed (avg 7.5s each) - 3 successful Qdrant upsert operations - Search operations tracked ## Deployment: Enable dashboard provisioning in Helm values: ```yaml dashboards: enabled: true grafanaFolder: "Nextcloud MCP" ``` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 11:49:20 +01:00
Chris Coutinho	d1829fbbd6	Merge pull request #291 from cbcoutinho/renovate/ghcr.io-astral-sh-uv-0.x chore(deps): update ghcr.io/astral-sh/uv docker tag to v0.9.9	2025-11-13 08:02:35 +01:00
renovate-bot-cbcoutinho[bot]	8332542959	chore(deps): update ghcr.io/astral-sh/uv docker tag to v0.9.9	2025-11-12 23:11:29 +00:00
Chris Coutinho	619ba5684d	build: Add ./worktrees to .gitignore	2025-11-12 08:27:33 +01:00
github-actions[bot]	747d297008	bump: version 0.32.0 → 0.32.1 nextcloud-mcp-server-0.32.1 v0.32.1	2025-11-12 02:16:57 +00:00
Chris Coutinho	ba8486b73b	Merge pull request #289 from cbcoutinho/fix/dynamic-embedding-dimensions fix: add dynamic dimension detection for Ollama embedding models	2025-11-12 03:16:29 +01:00
Chris Coutinho	6812e1aca7	fix: add dynamic dimension detection for Ollama embedding models This fixes dimension mismatch errors when using embedding models with non-standard dimensions (e.g., qwen3-embedding:4b produces 2560-dim vectors instead of the hardcoded 768). Changes: - OllamaEmbeddingProvider: Detect dimensions dynamically by generating test embedding instead of hardcoding to 768 - qdrant_client: Call dimension detection before collection creation - app.py: Initialize Qdrant collection before starting background tasks in streamable-http transport path - tests: Fix integration tests to properly mock EmbeddingService wrapper Fixes dimension mismatch error: "could not broadcast input array from shape (2560,) into shape (768,)" All integration tests passing (6/6). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-12 02:46:30 +01:00
github-actions[bot]	49a9dd43c6	bump: version 0.31.1 → 0.32.0 nextcloud-mcp-server-0.32.0 v0.32.0	2025-11-11 23:54:43 +00:00
Chris Coutinho	f6656fee06	Merge pull request #288 from cbcoutinho/feat/webhook-testing-validation feat: webhook-based vector sync with management UI and validation	2025-11-12 00:54:20 +01:00
Chris Coutinho	7e93097137	feat(ollama): Pull model on startup if not available in ollama	2025-11-12 00:37:26 +01:00
Chris Coutinho	0eae33a918	ci: Fix logging warning and cli mock	2025-11-11 23:42:00 +01:00
Chris Coutinho	3430b2409d	build: Set default logging to text	2025-11-11 23:19:37 +01:00
Chris Coutinho	adde0e5623	fix: improve webapp tab UI with CSS Grid and viewport-filling container Fixes layout issues on the webhooks admin tab: - Add min-height to container to fill viewport consistently - Use CSS Grid to overlay tab panes without jumpiness - Add smooth htmx fade transitions for content swaps - Adjust vector sync polling interval from 3s to 10s - Add .playwright-mcp/ to gitignore for test screenshots The CSS Grid approach allows tabs to overlay without absolute positioning, preventing content cutoff while maintaining smooth transitions without container resizing jumps. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 23:07:44 +01:00
Chris Coutinho	12c96af819	feat: add dynamic vector sync status updates with htmx polling Implement real-time vector sync status updates in the /app UI without requiring page refreshes. The status (indexed documents, pending documents, sync state) now updates automatically every 3 seconds. Changes: - Add vector_sync_status_fragment() endpoint that returns HTML fragment with current vector sync status - Modify user_info_html() to use htmx loading for vector sync section with hx-trigger="load" on initial render - Status fragment includes hx-trigger="every 3s" for continuous polling - Add /app/vector-sync/status route to browser_routes The implementation uses htmx (already loaded on page) to poll the status endpoint, providing near real-time updates with minimal overhead. The endpoint queries Qdrant for indexed count and reads from memory streams for pending count, returning only the status HTML fragment. Pattern follows existing webhook management UI which also uses htmx for dynamic loading. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 21:04:31 +01:00
Chris Coutinho	d86a185e04	refactor: move webapp from /user/page to /app Simplified the webapp routing structure by consolidating the admin UI to a single clean endpoint. Changes: - Moved webapp from /user/page to /app (root of mount) - Removed /user JSON endpoint (no longer needed) - Updated mount point from /user to /app in app.py - Updated all route path checks (3 locations) - Updated OAuth redirects to point to /app - Updated all HTMX endpoint references - Updated documentation (ADR-007, CHANGELOG) - Added redirect from /app to /app/ for trailing slash handling New Route Structure: - /app - Main webapp (HTML UI with tabs) - /app/revoke - Revoke background access - /app/webhooks - Webhook management UI - /app/webhooks/enable/{preset_id} - Enable webhook preset - /app/webhooks/disable/{preset_id} - Disable webhook preset Breaking Change: Existing bookmarks to /user or /user/page will no longer work. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 20:53:43 +01:00
Chris Coutinho	f4759e424d	feat: add webhook management UI and BeforeNodeDeletedEvent support Added comprehensive webhook management capabilities including: Webhook Client & API: - Added WebhooksClient for Nextcloud webhooks API integration - Create, list, update, and delete webhooks programmatically - Support for event filters in webhook registration Webhook Presets: - Added preset system for common webhook configurations - notes_sync: BeforeNodeDeletedEvent for Notes file operations - calendar_sync: Calendar events (create, update, delete) - deck_sync: Deck card operations - files_sync: File system changes - forms_sync: Form submissions (conditional) - Filter presets by installed apps Admin UI: - Added multi-pane app view with tabs (User Info, Vector Sync, Webhooks) - Webhooks tab for admin users only - Enable/disable preset webhooks via UI - View currently registered webhooks - Uses htmx for dynamic loading and Alpine.js for tab state - Admin permission checking via OCS API CLI Improvements: - Refactored CLI to separate module (cli.py) - Updated entry point in pyproject.toml BeforeNodeDeletedEvent Fix: - Updated ADR-010 to document NodeDeletedEvent issue - BeforeNodeDeletedEvent includes node.id before deletion - NodeDeletedEvent lacks node.id (file already deleted) - Implemented per Nextcloud maintainer recommendation Testing: - Added comprehensive webhook client tests - Added webhook preset filtering tests - Added admin permission tests Configuration: - Updated docker-compose.yml Qdrant settings 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 20:35:08 +01:00
Chris Coutinho	1bced88c97	refactor: consolidate database storage for webhooks and OAuth tokens Refactored the storage system to use a unified SQLite database for both webhook tracking and OAuth token storage, available in both BasicAuth and OAuth modes. Changes: - Renamed refresh_token_storage.py → storage.py - Made TOKEN_ENCRYPTION_KEY optional (only required for OAuth token ops) - Added registered_webhooks table with schema versioning - Added webhook storage methods (store, get, delete, list, clear) - Initialize storage in both BasicAuth and OAuth modes - Updated webhook routes to persist registrations in database - Database-first pattern for webhook status checks (performance) - Updated all imports across codebase Storage Behavior: - Database created automatically at startup if needed - Existing databases detected and reused - Server fails fast if database initialization fails - No migrations needed (OAuth feature is experimental) Testing: - Added 13 comprehensive unit tests for webhook storage - All 118 unit tests pass - All 5 smoke tests pass - Verified fail-fast behavior on initialization errors 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 20:01:49 +01:00
Chris Coutinho	b58e7238ae	feat: validate Nextcloud webhook schemas and document findings Manual testing of Nextcloud webhook_listeners app to validate webhook payloads against ADR-010 expected schemas and document implementation requirements for webhook-based vector synchronization. ## Changes - Add test webhook endpoint at /webhooks/nextcloud in app.py - Captures and logs webhook payloads for analysis - Returns 200 OK immediately for webhook delivery confirmation - Create webhook-testing-findings.md with comprehensive test results - Captured payloads for 5/6 webhook event types - Critical findings: missing node.id in deletions, type mismatches - Implementation recommendations with code examples - Update ADR-010 with Appendix A: Manual Webhook Testing Results - Document actual vs expected webhook behavior - Update event mapping table with tested webhook status - Add 6 specific implementation recommendations - Include testing implications for future development ## Testing Results ✅ NodeCreatedEvent - fires correctly, includes node.id (integer) ✅ NodeWrittenEvent - fires correctly, includes node.id (integer) ✅ NodeDeletedEvent - fires but missing node.id field (path only) ✅ CalendarObjectCreatedEvent - fires correctly with full iCal ✅ CalendarObjectUpdatedEvent - fires correctly with full iCal ❌ CalendarObjectDeletedEvent - does not fire (potential NC bug) ## Key Findings 1. NodeDeletedEvent missing node.id field - requires path-based fallback 2. node.id returns integer not string - needs casting for consistency 3. Multiple webhooks fire per operation - needs deduplication logic 4. Calendar deletion webhooks don't fire - reported as issue #53497 5. Calendar webhooks include full iCal content - enables rich parsing ## GitHub Issues - Created issue #56371: NodeDeletedEvent missing node.id field - Commented on issue #53497: CalendarObjectDeletedEvent not firing Closes #283 --- _This commit was generated with the help of AI, and reviewed by a Human_	2025-11-11 12:13:20 +01:00
Chris Coutinho	0005e0dce0	Merge pull request #286 from cbcoutinho/renovate/docker.io-library-mariadb-lts chore(deps): update docker.io/library/mariadb:lts docker digest to 404ebf2	2025-11-11 09:17:23 +01:00
Chris Coutinho	636e5105c3	Merge pull request #287 from cbcoutinho/renovate/astral-sh-setup-uv-7.x chore(deps): update astral-sh/setup-uv action to v7.1.3	2025-11-11 09:17:16 +01:00
renovate-bot-cbcoutinho[bot]	ee7080afb3	chore(deps): update astral-sh/setup-uv action to v7.1.3	2025-11-10 23:10:10 +00:00
renovate-bot-cbcoutinho[bot]	b52f482a51	chore(deps): update docker.io/library/mariadb:lts docker digest to 404ebf2	2025-11-10 23:10:04 +00:00
github-actions[bot]	ce666934f2	bump: version 0.31.0 → 0.31.1 nextcloud-mcp-server-0.31.1 v0.31.1	2025-11-10 22:21:48 +00:00
Chris Coutinho	cdf69b3ea8	Merge pull request #285 from cbcoutinho/feat/otel-tracing-improvements refactor: simplify OpenTelemetry tracing configuration	2025-11-10 23:21:18 +01:00
Chris Coutinho	a6e5f3d8ff	refactor: simplify OpenTelemetry tracing configuration Simplifies the OpenTelemetry tracing setup by removing the redundant OTEL_ENABLED flag and using the presence of OTEL_EXPORTER_OTLP_ENDPOINT to determine if tracing should be enabled. This follows the standard OpenTelemetry environment variable conventions more closely. Changes: - Remove OTEL_ENABLED/tracing_enabled flag in favor of checking if OTEL_EXPORTER_OTLP_ENDPOINT is set - Add OTEL_EXPORTER_VERIFY_SSL configuration option for OTLP endpoints with self-signed certificates (defaults to false for development) - Move HTTPXClientInstrumentor initialization to module level to ensure httpx calls are traced across all Nextcloud API requests - Add tracing spans to vector sync operations (scan_user_documents) - Fix authorization header logging to only warn about missing headers in OAuth mode (BasicAuth mode doesn't use Authorization headers) - Update observability documentation to reflect simplified configuration - Refactor Dockerfile to use --no-editable flag for uv sync Breaking changes: - OTEL_ENABLED environment variable is removed - Tracing is now automatically enabled when OTEL_EXPORTER_OTLP_ENDPOINT is set Migration guide: - Remove OTEL_ENABLED=true from environment configuration - Tracing will be enabled automatically if OTEL_EXPORTER_OTLP_ENDPOINT is configured 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 22:48:37 +01:00
github-actions[bot]	f44bf3e8f2	bump: version 0.30.0 → 0.31.0 nextcloud-mcp-server-0.31.0 v0.31.0	2025-11-10 07:02:49 +00:00
Chris Coutinho	37141003d8	Merge pull request #283 from cbcoutinho/feat/adr-010-webhook-vector-sync docs: Add ADR-010 for webhook-based vector sync	2025-11-10 08:02:22 +01:00
Chris Coutinho	c787abf2f3	fix: add retry logic for ETag conflicts in category change test The test_attachments_category_change_handling test was failing in CI with HTTP 412 Precondition Failed errors. This is caused by the background vector scanner (runs every 10 seconds) modifying notes between when the test fetches the ETag and when it attempts to update the category. Solution: Added retry logic (up to 3 attempts) that refetches the latest ETag and retries the update operation when encountering 412 errors. This handles the race condition gracefully while still catching genuine errors.	2025-11-10 07:41:02 +01:00
Chris Coutinho	b32324cb76	feat: skip tracing for health and metrics endpoints Health check and metrics endpoints are frequently polled and don't provide meaningful trace data. This change skips OpenTelemetry span creation for: - /health/* (liveness, readiness checks) - /metrics (Prometheus metrics) These endpoints still record Prometheus metrics (request count, latency, in-flight requests) but no longer create trace spans, reducing tracing noise and storage costs. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 07:24:27 +01:00
Chris Coutinho	640a7818f9	fix: optimize Notes API pagination with pruneBefore parameter The Nextcloud Notes API intentionally returns all note IDs (with only 'id' field) in the last chunk to enable deletion detection. Without using the pruneBefore parameter, this causes duplicates - all notes appear with full data in chunks, then again with minimal data in the last chunk. This commit implements proper pruneBefore support: - NotesClient.get_all_notes() now accepts prune_before timestamp parameter - Scanner calculates max(indexed_at) from Qdrant to use as prune threshold - Only notes modified after this timestamp are sent with full data - Deduplication logic handles the API's deletion detection pattern - Significantly reduces data transfer for incremental syncs The behavior is documented in Notes API v1 spec - this is not an API bug, but a feature we weren't utilizing correctly. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 07:19:26 +01:00
Chris Coutinho	8e5d0b5df1	Merge pull request #276 from cbcoutinho/renovate/pin-dependencies chore(deps): pin qdrant/qdrant docker tag to 0fb8897	2025-11-10 06:48:01 +01:00
Chris Coutinho	851d21f56e	Merge pull request #284 from cbcoutinho/renovate/lock-file-maintenance chore(deps): lock file maintenance	2025-11-10 06:47:35 +01:00
renovate-bot-cbcoutinho[bot]	fb1af697f7	chore(deps): lock file maintenance	2025-11-10 05:13:55 +00:00
renovate-bot-cbcoutinho[bot]	bf4eed6007	chore(deps): pin qdrant/qdrant docker tag to 0fb8897	2025-11-10 05:12:36 +00:00
Chris Coutinho	3a41860d27	docs: Add ADR-010 for webhook-based vector sync Add architecture decision record for integrating Nextcloud webhooks into the vector database synchronization system. Key features: - Webhook endpoint at /webhooks/nextcloud receives push notifications - Complements existing polling (ADR-007) without replacing it - Optional authentication via WEBHOOK_SECRET - Simple architecture: webhooks are just another DocumentTask producer - Administrators can reduce polling frequency when webhooks are configured Benefits: - Reduced latency: seconds to minutes instead of up to 1 hour - Lower API load: ~95% reduction when polling frequency is increased - Better scalability: only process changed documents - No changes required to scanner or processor components 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 05:28:36 +01:00
github-actions[bot]	126b5a7626	bump: version 0.29.2 → 0.30.0 nextcloud-mcp-server-0.30.0 v0.30.0	2025-11-10 02:50:11 +00:00
Chris Coutinho	4d3ff1abe1	Merge pull request #282 from cbcoutinho/feat/multi-embedding-model-support feat(vector): Support multiple embedding models with auto-generated collection names	2025-11-10 03:49:48 +01:00
Chris Coutinho	d80e54ff97	feat(helm): Add document chunking configuration Add support for configurable document chunking parameters to Helm chart to match docker-compose and application capabilities. Changes: 1. values.yaml: - Add documentChunking section with chunkSize (512) and chunkOverlap (50) - Include comprehensive comments explaining chunking strategies - Positioned between vectorSync and qdrant sections 2. templates/deployment.yaml: - Add DOCUMENT_CHUNK_SIZE and DOCUMENT_CHUNK_OVERLAP env vars - Always set (not conditional), used by vector sync processor - Environment variables follow same pattern as config.py defaults 3. README.md: - Add documentChunking parameter table in Vector Search section - Document chunking strategies (small/medium/large chunks) - Explain overlap recommendations (10-20% of chunk size) Validation: - helm lint: Passes - helm template: Environment variables correctly generated - Custom values: Work as expected (tested with chunkSize=1024) - Always present: Not conditional on vectorSync.enabled This maintains feature parity between Helm and docker-compose deployments, allowing users to tune chunking for their embedding models and use cases. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 03:34:16 +01:00
Chris Coutinho	157e433d65	fix: Support in-memory Qdrant for CI testing Changes to make tests work without external qdrant/ollama dependencies: 1. docker-compose.yml (mcp service): - Switch from QDRANT_URL (network mode) to QDRANT_LOCATION=":memory:" - Comment out QDRANT_URL and QDRANT_API_KEY (not needed for in-memory) - Keep OLLAMA_BASE_URL commented out (use SimpleEmbeddingProvider fallback) 2. nextcloud_mcp_server/vector/qdrant_client.py: - Fix collection creation bug in in-memory mode - Previously: All ValueError exceptions were re-raised - Now: Only dimension mismatch ValueError is re-raised - Allows "Collection not found" ValueError to trigger auto-creation 3. tests/integration/test_sampling.py: - Update test to handle all sampling unsupported cases - Check for multiple fallback search_method values - Skip test gracefully when sampling unavailable This configuration enables: - CI testing without external services (qdrant, ollama) - In-memory vector database (ephemeral but sufficient for tests) - SimpleEmbeddingProvider for embeddings (feature hashing, 384 dims) - Automatic collection creation on first use Test result: test_semantic_search_answer_successful_sampling now passes (skipped with appropriate message when sampling unsupported) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 03:21:27 +01:00
Chris Coutinho	94d16092c0	ci: Add qdrant profile to docker compose up command	2025-11-10 03:09:50 +01:00
Chris Coutinho	cb39b3fca4	feat(vector): Add configurable chunk size and overlap for document embedding Enable users to tune document chunking parameters to match their embedding model and content type by adding DOCUMENT_CHUNK_SIZE and DOCUMENT_CHUNK_OVERLAP environment variables. - config.py: Added `document_chunk_size` (default: 512) and `document_chunk_overlap` (default: 50) configuration fields with validation: - Ensures overlap < chunk_size - Warns if chunk_size < 100 words - Prevents negative overlap values - processor.py: Updated DocumentChunker instantiation to use config settings instead of hardcoded values (line 174-177) - tests/unit/test_config.py: Added TestChunkConfigValidation class with 9 tests covering: - Default values - Valid configurations - Validation errors (overlap >= chunk_size, negative overlap) - Warning for small chunk sizes - Environment variable loading - docs/configuration.md: Added comprehensive "Document Chunking Configuration" section with: - Chunk size selection guidance (256-384 vs 512 vs 768-1024 words) - Overlap recommendations (10-20% of chunk size) - Configuration examples for different use cases - Added env vars to reference table - docs/semantic-search-architecture.md: Added "Document Chunking Strategy" section with: - Chunking process explanation - Example showing sliding window behavior - Search behavior with chunks - Tuning recommendations - env.sample: Added complete "Semantic Search & Vector Sync Configuration" section with: - Vector sync settings - Qdrant configuration (3 modes) - Ollama embedding service - Document chunking configuration - docker-compose.yml: Added commented examples for DOCUMENT_CHUNK_SIZE and DOCUMENT_CHUNK_OVERLAP with usage notes \`\`\`bash DOCUMENT_CHUNK_SIZE=512 DOCUMENT_CHUNK_OVERLAP=50 \`\`\` 1. \`overlap\` must be less than \`chunk_size\` 2. \`overlap\` cannot be negative 3. Warning issued if \`chunk_size\` < 100 words Precise matching (small notes, specific queries): \`\`\`bash DOCUMENT_CHUNK_SIZE=256 DOCUMENT_CHUNK_OVERLAP=25 \`\`\` Balanced (default, general purpose): \`\`\`bash DOCUMENT_CHUNK_SIZE=512 DOCUMENT_CHUNK_OVERLAP=50 \`\`\` Contextual (long documents, broader topics): \`\`\`bash DOCUMENT_CHUNK_SIZE=1024 DOCUMENT_CHUNK_OVERLAP=100 \`\`\` ✅ User control - Tune chunking to match embedding model capabilities ✅ Experimentation - Test different chunk sizes for optimal results ✅ Model alignment - Match chunk size to embedding context window ✅ Backward compatible - Defaults maintain existing behavior ✅ Well validated - Comprehensive tests prevent misconfiguration All 22 config validation tests pass (9 new tests for chunking): - Default values work correctly - Validation prevents invalid configurations - Environment variables load properly - Warning system works as expected With configurable chunk sizes, users can now experiment with different Ollama embedding models and tune chunk parameters for optimal semantic search quality. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 02:47:57 +01:00
Chris Coutinho	f3050e9b45	chore: Remove /health and /metrics endpoints from logging	2025-11-10 02:07:45 +01:00
Chris Coutinho	e575c8e57b	feat(vector): Support multiple embedding models with auto-generated collection names This PR enables safe switching between embedding models and multi-server deployments by implementing auto-generated Qdrant collection names based on deployment ID and model name. ## Problem Previously, all deployments used a single hardcoded collection name "nextcloud_content", which caused two critical issues: 1. Dimension mismatches when switching models: Changing OLLAMA_EMBEDDING_MODEL (e.g., nomic-embed-text at 768D → all-minilm at 384D) would cause runtime errors as vectors couldn't be inserted into a collection with incompatible dimensions. 2. Collection collisions in multi-server setups: Multiple MCP servers sharing a single Qdrant instance would overwrite each other's data, making horizontal scaling impossible. ## Solution ### Auto-Generated Collection Naming Collections are now automatically named using the pattern: \`{deployment-id}-{model-name}\` Deployment ID: Uses \`OTEL_SERVICE_NAME\` if configured (and not default value), otherwise falls back to \`hostname\` for simple Docker deployments. Model Name: From \`OLLAMA_EMBEDDING_MODEL\` with path separators sanitized. Examples: - \`my-mcp-server-nomic-embed-text\` (with OTEL_SERVICE_NAME=my-mcp-server) - \`mcp-container-all-minilm\` (simple Docker, hostname=mcp-container) Override: Users can still set \`QDRANT_COLLECTION\` explicitly to bypass auto-generation for backward compatibility. ### Dimension Validation Added startup validation that checks collection dimensions match the embedding service. If a mismatch is detected, the server fails fast with a clear error message explaining: - Expected vs actual dimensions - Likely cause (model change) - Solutions (delete collection, use different name, or revert model) ### Improved Sampling Error Handling Enhanced MCP sampling rejection handling to treat user rejections as normal behavior rather than errors: - User rejections ("rejected", "denied") → INFO log, no traceback - Unsupported clients → INFO log, no traceback - Other MCP errors → WARNING log, no traceback - Unexpected errors → ERROR log WITH traceback This aligns with the MCP specification where clients SHOULD prompt users for approval/denial of sampling requests. ## Changes ### Core Implementation - nextcloud_mcp_server/config.py: Added \`get_collection_name()\` method with deployment ID detection and model name sanitization - nextcloud_mcp_server/vector/qdrant_client.py: Dimension validation on collection open with helpful error messages - nextcloud_mcp_server/vector/{scanner,processor}.py: Updated to use \`get_collection_name()\` - nextcloud_mcp_server/auth/userinfo_routes.py: Vector sync status uses \`get_collection_name()\` - nextcloud_mcp_server/server/semantic.py: - Updated semantic search tools to use \`get_collection_name()\` - Improved sampling rejection error handling (McpError vs Exception) ### Documentation - docs/semantic-search-architecture.md: New comprehensive architecture document (557 lines) covering background sync, semantic search flow, RAG implementation, and deployment modes - docs/configuration.md: Added detailed "Qdrant Collection Naming" section with examples and multi-server deployment guidance - docker-compose.yml: Added comments explaining collection naming behavior - README.md: Updated semantic search descriptions to clarify experimental status, Notes-only support, and infrastructure requirements ## Migration Guide For existing single-server deployments: Option 1 (Recommended): Use explicit collection name for continuity \`\`\`bash QDRANT_COLLECTION=nextcloud_content # Keep existing collection \`\`\` Option 2: Allow auto-generation and re-embed \`\`\`bash # Remove QDRANT_COLLECTION override # New collection will be created based on deployment ID + model # Requires re-embedding all documents (may take time) \`\`\` For new multi-server deployments: Set unique OTEL service names per server: \`\`\`bash # Server 1 OTEL_SERVICE_NAME=mcp-prod OLLAMA_EMBEDDING_MODEL=nomic-embed-text # → Collection: "mcp-prod-nomic-embed-text" # Server 2 OTEL_SERVICE_NAME=mcp-staging OLLAMA_EMBEDDING_MODEL=nomic-embed-text # → Collection: "mcp-staging-nomic-embed-text" \`\`\` ## Benefits ✅ Safe model switching: Each model gets its own collection, preventing dimension mismatch errors ✅ Multi-server support: Multiple MCP servers can share one Qdrant instance without conflicts ✅ Clear ownership: Collection names show which deployment and model owns the data ✅ Better error messages: Dimension validation provides actionable guidance ✅ Backward compatible: Existing deployments can continue using \`QDRANT_COLLECTION\` override ## Testing Validated with: - Single-server deployments (default hostname-based naming) - Multi-server deployments (OTEL service name-based naming) - Model switching scenarios (dimension validation) - Collection override scenarios (backward compatibility) Next steps: Testing various Ollama embedding models to investigate optimal chunk sizes and performance characteristics. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-10 01:18:30 +01:00
github-actions[bot]	a0576aa9a2	bump: version 0.29.1 → 0.29.2 nextcloud-mcp-server-0.29.2 v0.29.2	2025-11-09 18:28:34 +00:00
Chris Coutinho	4a6c60113b	fix(helm): Set default strategy to Recreate	2025-11-09 19:27:55 +01:00
Chris Coutinho	a0cb1ac9fe	Merge pull request #281 from cbcoutinho/renovate/qdrant-1.x chore(deps): update helm release qdrant to v1	2025-11-09 18:38:22 +01:00

1 2 3 4 5 ...

1004 Commits