nextcloud-mcp-server

Author	SHA1	Message	Date
Chris Coutinho	c5da11aa4c	Merge pull request #274 from cbcoutinho/feature/observability-monitoring feature/observability monitoring	2025-11-09 09:29:25 +01:00
Chris Coutinho	5e4667a643	fix(readiness): Only check external Qdrant in network mode The readiness probe incorrectly tried to connect to an external Qdrant service even when using memory or persistent mode (embedded Qdrant). This caused pods to never become ready in Kubernetes deployments using the default configuration. Root cause: - In memory/persistent modes, QDRANT_URL env var is NOT set - Readiness check used default 'http://qdrant:6333' anyway - Tried to connect to non-existent service - Connection failed -> 503 -> pod stuck in not-ready state Fix: - Only check external Qdrant health if QDRANT_URL is explicitly set (network mode) - For embedded modes (memory/persistent), report status as 'embedded' without blocking - Background scanner tasks don't block readiness (already non-blocking via anyio.start_soon) This allows pods to become ready immediately when using embedded Qdrant, while still validating external Qdrant connectivity in network mode. Fixes: Kubernetes pods failing readiness check with default Qdrant configuration 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:28:09 +01:00
Chris Coutinho	093ac5b5ba	feat(helm): Add observability support with ServiceMonitor and Grafana dashboard Add comprehensive observability configuration to Helm chart: Helm Values: - Add observability configuration section for metrics, tracing, and logging - Add serviceMonitor configuration (disabled by default) - Add prometheusRule configuration (disabled by default) Templates: - Update deployment to include observability environment variables - Update deployment to expose metrics port (9090) - Update service to expose metrics port - Add ServiceMonitor template for Prometheus Operator - Add PrometheusRule template with critical and warning alerts Dashboards: - Add comprehensive Grafana dashboard JSON with 6 panels: - Request Rate (by method and endpoint) - Error Rate (5xx errors percentage) - Request Latency (P50/P95 by endpoint) - Top MCP Tools (by invocation volume) - Nextcloud API Latency (by app) - Vector Sync Queue Size - Add dashboard README with import instructions Alert Rules: - Critical: Server down, high error rate (>5%), high latency (>1s), dependency down - Warning: Token validation errors (>1%), vector sync queue high (>100), Qdrant slow (>500ms) All features are opt-in via values.yaml configuration. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:10:11 +01:00
github-actions[bot]	ae81f0334e	bump: version 0.27.3 → 0.28.0 nextcloud-mcp-server-0.28.0 v0.28.0	2025-11-09 08:04:06 +00:00
Chris Coutinho	23f3a231a5	Merge pull request #273 from cbcoutinho/feature/observability-monitoring Feature/observability monitoring	2025-11-09 09:03:40 +01:00
Chris Coutinho	7be40a33e1	fix(vector): Handle missing 'modified' field in notes gracefully The vector scanner crashed when encountering notes without a 'modified' field, causing KeyError and preventing initial sync from completing. Changes: - Use dict.get() with fallback value (0) instead of direct key access - Log warnings for notes missing 'modified' field - Apply fix to both initial sync and incremental sync code paths This ensures the scanner continues processing all notes even if some have missing metadata fields, preventing scanner crashes that could affect deployment readiness. Fixes: Notes without 'modified' field causing scanner crash and readiness check failure 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 09:03:05 +01:00
Chris Coutinho	578de4d7d6	feat(observability): Add comprehensive monitoring with Prometheus and OpenTelemetry - Add Prometheus metrics for HTTP, MCP tools, Nextcloud API, OAuth, vector sync, and DB operations - Add OpenTelemetry distributed tracing with OTLP export - Add structured JSON logging with trace context correlation - Add ObservabilityMiddleware for automatic HTTP instrumentation - Add app_name attribute to all client classes for per-app metrics - Add configuration for metrics, tracing, and logging via environment variables - Add documentation in docs/observability.md - Fix graceful degradation when tracing is disabled (default state) - Fix uvicorn logging configuration to use observability formatters 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 08:54:04 +01:00
github-actions[bot]	8f0f989c6d	bump: version 0.27.2 → 0.27.3 nextcloud-mcp-server-0.27.3 v0.27.3	2025-11-09 06:52:31 +00:00
Chris Coutinho	f8a2935c22	fix(ci): Use helm dependency build instead of update to use Chart.lock	2025-11-09 07:52:00 +01:00
github-actions[bot]	137dc80075	bump: version 0.27.1 → 0.27.2 nextcloud-mcp-server-0.27.2 v0.27.2	2025-11-09 06:45:44 +00:00
Chris Coutinho	725ac65e6a	fix(helm): update Qdrant dependency condition to match new mode structure The Qdrant subchart was being included by default even in memory/persistent modes. Changed the dependency condition from `qdrant.enabled` to `qdrant.networkMode.deploySubchart` to align with the three-mode structure. Now the Qdrant subchart is ONLY deployed when: - qdrant.mode: "network" - qdrant.networkMode.deploySubchart: true Verified all three modes: - Memory mode (:memory:): No subchart, QDRANT_LOCATION=:memory: - Persistent mode (path): No subchart, QDRANT_LOCATION=/app/data/qdrant, PVC created - Network mode (subchart): Qdrant subchart deployed, QDRANT_URL=http://...:6333 - Network mode (external): No subchart, QDRANT_URL=<external-url> 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:45:06 +01:00
github-actions[bot]	f51edff25d	bump: version 0.27.0 → 0.27.1 nextcloud-mcp-server-0.27.1 v0.27.1	2025-11-09 06:22:00 +00:00
Chris Coutinho	50ba6ccc88	fix(ci): add Helm repository setup to chart release workflow The chart-releaser was failing because it couldn't resolve the dependencies (Qdrant and Ollama subcharts) when packaging. Changes: - Add azure/setup-helm action to install Helm v3.16.0 - Add step to add Qdrant and Ollama Helm repositories - Run helm dependency update before chart-releaser runs This fixes the error: "Error: no repository definition for https://qdrant.github.io/qdrant-helm, https://otwld.github.io/ollama-helm" 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:21:17 +01:00
github-actions[bot]	538bbc375e	bump: version 0.26.1 → 0.27.0 v0.27.0	2025-11-09 06:15:27 +00:00
Chris Coutinho	d4c686eba7	Merge pull request #271 from cbcoutinho/docs/adr-007-background-vector-sync feat: implement ADR-007 background vector sync and semantic search	2025-11-09 07:15:00 +01:00
Chris Coutinho	167e49788e	feat(helm): add Qdrant local mode support with three deployment options [skip ci] Add support for three Qdrant deployment modes in Helm chart: 1. In-memory mode (:memory:) - Default, zero-config, ephemeral storage 2. Persistent local mode (path-based) - File-based storage with PVC 3. Network mode (URL-based) - Dedicated Qdrant service or external instance Changes: - Restructured qdrant configuration in values.yaml with mode selector - Added conditional environment variable logic in deployment.yaml - Created PVC template for persistent local mode with optional existingClaim - Added qdrantPvcName helper template in _helpers.tpl - Updated README.md with Helm registry URL (https://cbcoutinho.github.io/nextcloud-mcp-server) Breaking change: Default changed from requiring qdrant.enabled to using in-memory mode (:memory:) when no Qdrant configuration is provided. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:14:19 +01:00
Chris Coutinho	857d8f2152	feat: add Qdrant local mode support with in-memory and persistent storage Adds flexible Qdrant deployment modes to reduce infrastructure requirements for local development and smaller deployments: Configuration Changes: - Add QDRANT_LOCATION environment variable (mutually exclusive with QDRANT_URL) - Three modes: network (URL), in-memory (:memory:, default), persistent (file path) - Settings dataclass validation via __post_init__ ensures mutual exclusivity - API key warning when set in local mode (ignored, only for network mode) Client Initialization: - Auto-detect mode: network (url + api_key) vs local (:memory: or path=) - In-memory: AsyncQdrantClient(":memory:") - zero config default - Persistent: AsyncQdrantClient(path="/app/data/qdrant") - file storage - Network: AsyncQdrantClient(url, api_key) - production mode Docker Compose Updates: - Qdrant service moved to optional profile (--profile qdrant) - MCP service uses QDRANT_LOCATION=:memory: by default - Added mcp-data volume for persistent storage (/app/data) - No hard dependency on qdrant service Documentation: - Comprehensive configuration guide in docs/configuration.md - All three modes documented with pros/cons - Docker Compose examples for each mode - Environment variable reference table Tests: - 13 new config validation tests (mutual exclusivity, defaults, warnings) - Persistent mode integration test (create, close, reopen, verify persistence) - All 82 unit tests + 5 smoke tests pass Breaking Change: - Default changed from QDRANT_URL=http://qdrant:6333 to QDRANT_LOCATION=:memory: - Simplifies local development (no external service needed) - Production deployments: explicitly set QDRANT_URL or QDRANT_LOCATION Related: ADR-007 background vector sync implementation 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 07:07:07 +01:00
Chris Coutinho	72232f937a	refactor: migrate vector sync from asyncio.Queue to anyio memory object streams Replace asyncio.Queue with anyio.create_memory_object_stream() throughout the vector sync system for better library consistency and improved shutdown semantics. ## Changes Made scanner.py: - Changed parameter type from `asyncio.Queue` to `MemoryObjectSendStream[DocumentTask]` - Replaced all `await document_queue.put()` calls with `await send_stream.send()` - Wrapped scanner loop in `async with send_stream:` context manager for automatic cleanup - Updated log messages: "Queued" → "Sent" - Removed `import asyncio` (no longer needed) processor.py: - Changed parameter type from `asyncio.Queue` to `MemoryObjectReceiveStream[DocumentTask]` - Replaced `asyncio.wait_for(document_queue.get(), timeout=1.0)` with `anyio.fail_after(1.0)` + `await receive_stream.receive()` - Removed all `document_queue.task_done()` calls (not needed with streams) - Added `anyio.EndOfStream` exception handling for graceful shutdown when scanner closes - Removed `import asyncio` (no longer needed) app.py: - Removed `import asyncio` from top-level imports - Added `from anyio.streams.memory import MemoryObjectReceiveStream, MemoryObjectSendStream` - Updated AppContext dataclass: - Replaced `document_queue: Optional[asyncio.Queue]` with: - `document_send_stream: Optional[MemoryObjectSendStream]` - `document_receive_stream: Optional[MemoryObjectReceiveStream]` - Updated `app_lifespan_basic()`: - Replaced `asyncio.Queue(maxsize=...)` with `anyio.create_memory_object_stream(max_buffer_size=...)` - Pass `send_stream` to scanner_task - Pass `receive_stream.clone()` to each processor_task (enables multiple consumers) - Updated AppContext yield to include both streams - Updated `starlette_lifespan()`: - Same changes as app_lifespan_basic for streamable-http transport - Removed `import asyncio as asyncio_module` (no longer needed) - Updated app.state storage to use send_stream and receive_stream semantic.py: - Updated `nc_get_vector_sync_status()` tool: - Access `document_receive_stream` instead of `document_queue` from lifespan context - Use `stream_stats.current_buffer_used` instead of `queue.qsize()` for pending count - More reliable metrics (qsize() was not guaranteed accurate) ## Benefits 1. Library Consistency: Pure anyio throughout codebase (was mixing asyncio.Queue with anyio.Event and anyio.create_task_group) 2. Graceful Shutdown: `async with send_stream:` automatically closes stream on exit, signaling EndOfStream to all processors 3. Better Timeout Handling: `anyio.fail_after()` is more idiomatic than `asyncio.wait_for()` 4. Stream Cloning: Easy to add multiple consumers via `receive_stream.clone()` 5. Better Statistics: `.statistics()` provides accurate buffer metrics (qsize() was unreliable) 6. Type Safety: Separate send/receive types prevent accidental misuse 7. No task_done() tracking: Streams handle completion automatically ## Testing - ✅ All 69 unit tests passing - ✅ All 5 smoke tests passing - ✅ No regressions in functionality - ✅ Graceful shutdown behavior improved ## References - https://anyio.readthedocs.io/en/stable/why.html#queue-fix - https://anyio.readthedocs.io/en/stable/streams.html#memory-object-streams 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 06:43:44 +01:00
Chris Coutinho	4b026e9aa0	feat: implement ADR-009 - refactor semantic search to use generic semantic:read scope This implements ADR-009, which documents the decision to use a generic `semantic:read` OAuth scope instead of requiring all app-specific scopes for semantic search functionality. Changes: - Created new `nextcloud_mcp_server/models/semantic.py` with semantic search models - SemanticSearchResult (with new doc_type field for multi-app support) - SemanticSearchResponse - SamplingSearchResponse - VectorSyncStatusResponse - Created new `nextcloud_mcp_server/server/semantic.py` with semantic search tools - nc_semantic_search (renamed from nc_notes_semantic_search) - nc_semantic_search_answer (renamed from nc_notes_semantic_search_answer) - nc_get_vector_sync_status (renamed from nc_notes_get_vector_sync_status) - All tools now use @require_scopes("semantic:read") instead of "notes:read" - Updated `nextcloud_mcp_server/server/notes.py` - Removed semantic search tools (moved to semantic.py) - Removed semantic search model imports - Removed unused MCP imports (ModelHint, ModelPreferences, etc.) - Updated `nextcloud_mcp_server/models/notes.py` - Removed semantic search models (moved to semantic.py) - Updated `nextcloud_mcp_server/app.py` - Import configure_semantic_tools - Register semantic tools when VECTOR_SYNC_ENABLED=true - Updated `nextcloud_mcp_server/server/__init__.py` - Export configure_semantic_tools - Updated tests - tests/integration/test_sampling.py: Use new tool names - tests/unit/test_response_models.py: Import from semantic.py, add doc_type field Architecture: - Semantic search is now a cross-app feature, not tied to Notes - Uses dual-phase authorization: semantic:read scope + per-document verification - Supports future multi-app indexing (notes, calendar, deck, files, contacts) Test results: - All 69 unit tests passing - All 5 smoke tests passing 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 05:53:53 +01:00
Chris Coutinho	31799ffd9a	docs: remove VECTOR_SYNC_ENABLED_APPS env var, use per-user database settings Replace static VECTOR_SYNC_ENABLED_APPS environment variable with per-user database storage for which apps to index. This allows each user to control their own indexing preferences (e.g., enable notes and calendar but not deck or files). Rationale: - Nextcloud doesn't support granular OAuth scopes at the app level - Per-user settings provide flexibility for multi-user deployments - Users control app enablement via nc_enable_vector_sync MCP tool - Aligns with OAuth architecture where users manage their own settings Changes: - ADR-007: Remove VECTOR_SYNC_ENABLED_APPS from configuration section - ADR-007: Update scanner implementation to read from database - ADR-007: Add explanation of per-user app enablement mechanism - ADR-007: Clarify that nc_enable_vector_sync tool manages this setting 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 05:11:56 +01:00
Chris Coutinho	5cc598e1b1	docs: refactor semantic search from notes-specific to multi-app architecture Update ADRs to reflect that vector database and semantic search support multiple Nextcloud apps (notes, calendar, deck, files, contacts) rather than being notes-specific. Introduce semantic:read/write OAuth scopes to replace app-specific scope requirements for cross-app search. Changes: - ADR-007: Add plugin architecture (DocumentScanner, DocumentProcessor, DocumentVerifier) for multi-app vector sync - ADR-008: Rename tools from nc_notes_semantic_* to nc_semantic_*, update scope from notes:read to semantic:read - ADR-009: NEW - Document decision to use generic semantic:read scope with dual-phase authorization instead of requiring all app scopes - oauth-architecture.md: Add semantic:read/write scope documentation - README.md: Move semantic search to dedicated section separate from Notes This is a breaking change that correctly positions semantic search as a cross-app capability before broader adoption. Existing deployments will need to re-authenticate with the new semantic:read scope. Relates to user request to decouple vector database from notes-only model and establish proper OAuth scope boundaries for multi-app semantic search. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 04:47:20 +01:00
Chris Coutinho	a6c76c5cc1	chore: Add openid scope to nc_notes_get_vector_sync_status	2025-11-09 03:27:17 +01:00
Chris Coutinho	a854656d3c	fix: implement deletion grace period and vector sync status tool This commit addresses issues with vector database synchronization that were causing test failures: 1. Deletion Grace Period (scanner.py) - Fixed premature deletion of documents due to pagination cursor inconsistencies in Notes API - Implemented 2-scan verification with 1.5x scan interval grace period (15 seconds default) - Documents must be missing for 2 consecutive scans before deletion - Documents that reappear are removed from deletion tracking - Prevents false deletions during concurrent note creation/indexing 2. Vector Sync Status Tool (server/notes.py, models/notes.py) - Added nc_notes_get_vector_sync_status MCP tool - Returns indexed_count, pending_count, status, and enabled fields - Enables tests and clients to wait for vector sync completion - Uses lifespan context to access document queue and Qdrant client 3. Test Improvements (test_sampling.py, conftest.py) - Added temporary_note_factory fixture for creating multiple test notes - Updated all sampling tests to wait for vector sync completion - Adjusted score_threshold to 0.0 for SimpleEmbeddingProvider (feature hashing produces low-quality embeddings) - Fixed CallToolResult extraction (removed ["result"] key access) - Removed invalid @pytest.mark.asyncio markers (anyio mode) All integration tests now pass successfully. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 03:11:39 +01:00
Chris Coutinho	bb5d4f464f	feat: implement MCP sampling for semantic search RAG (ADR-008) Add nc_notes_semantic_search_answer tool that combines semantic search with MCP sampling to generate natural language answers from retrieved Nextcloud Notes. This enables Retrieval-Augmented Generation (RAG) patterns without requiring a server-side LLM. Key features: - Client-side LLM generation via ctx.session.create_message() - Graceful fallback when sampling unavailable - Proper source citations in generated answers - No results optimization (skips sampling when no docs found) - Comprehensive unit and integration tests Implementation details: - SamplingSearchResponse model with generated_answer and sources - Fixed prompt template with document context and citation instructions - Model preferences hint Claude Sonnet for balanced performance - Falls back to returning documents without answer on sampling failure Updates: - Add ADR-008 documenting sampling architecture decision - Add MCP sampling pattern guidance to CLAUDE.md - Update README.md and docs/notes.md (7 → 9 tools) - Add 4 unit tests and 6 integration tests 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 01:00:18 +01:00
Chris Coutinho	e32c8f4aec	feat: add optional vector database and semantic search to helm chart Add support for deploying Qdrant vector database and Ollama embedding service as optional helm chart dependencies. Enables semantic search capabilities for Nextcloud content with flexible deployment options. Chart Dependencies: - Add Qdrant v0.9.0 from qdrant/qdrant-helm (conditional) - Add Ollama v1.33.0 from otwld/ollama-helm (conditional) - Both dependencies only deploy when enabled Configuration (values.yaml): - vectorSync: Background sync settings (interval, workers, queue size) - qdrant: Subchart config with persistence, resources, clustering - ollama: Subchart config with model pull, persistence, resources - Support for external Ollama via ollama.url (no subchart deployment) - openai: Alternative embedding provider (OpenAI or compatible API) Environment Variables (deployment.yaml): - VECTOR_SYNC_* variables when vectorSync.enabled - QDRANT_URL, QDRANT_COLLECTION when qdrant.enabled - OLLAMA_BASE_URL, OLLAMA_EMBEDDING_MODEL when ollama enabled or URL set - OPENAI_API_KEY when openai.enabled Documentation: - README: New "Vector Search & Semantic Capabilities" section - README: Example 5 showing three deployment patterns - NOTES.txt: Conditional guidance when vector features enabled - Secret template for OpenAI API key management All features disabled by default for backward compatibility. Tested with helm template and helm lint. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 00:03:51 +01:00
Chris Coutinho	ee183e1c1c	feat: add vector sync processing status to /user/page endpoint Add real-time processing status display to the browser UI at /user/page showing indexed document count, pending queue size, and sync status. Implements the status display described in ADR-007 lines 280-298. Changes: - Store document_queue and related state in app.state for route access - Add _get_processing_status() helper to query Qdrant and check queue - Display status section in user_info_html() with indexed/pending counts - Show color-coded status badge (green "Idle" or orange "Syncing") - Only displays when VECTOR_SYNC_ENABLED=true Status appears in both BasicAuth and OAuth modes, positioned after session info but before logout buttons. Numbers are formatted with commas for readability. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 23:59:18 +01:00
Chris Coutinho	1a57f97d3a	refactor: update to Qdrant query_points API and fix Playwright Keycloak login - Replace deprecated qdrant_client.search() with query_points() API - Update semantic search implementation in notes.py - Update all integration tests to use query_points() - Fix Keycloak login in test_keycloak_dcr.py to use form.submit() instead of button click - Remove unnecessary popup handler code - Simplify consent screen logging	2025-11-08 22:41:14 +01:00
Chris Coutinho	e96c02e4d4	fix: remove unnecessary urllib3<2.0 constraint The urllib3<2.0 constraint was added unnecessarily during troubleshooting. urllib3 2.x works perfectly fine with qdrant-client. The import path for urllib3.util.Url and parse_url remains the same across 1.x and 2.x versions. Changes: - Remove urllib3<2.0 constraint from pyproject.toml - Upgrade to urllib3 2.5.0 (latest) - All integration tests pass with urllib3 2.x Verified: - from urllib3.util import Url, parse_url works in 2.5.0 - All 6 semantic search integration tests pass - qdrant-client 1.15.1 works correctly with urllib3 2.5.0 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:18:31 +01:00
Chris Coutinho	7b8c3f93a8	test: add integration tests for semantic search with in-process embeddings Adds comprehensive integration tests for vector database semantic search that work without external dependencies (Ollama), making them suitable for CI/CD. Changes: - Add SimpleEmbeddingProvider: in-process TF-IDF-like embeddings using feature hashing - Make Ollama optional: embedding service now falls back to SimpleEmbeddingProvider - Add 6 integration tests covering semantic search, filtering, and batch operations - Downgrade urllib3 to 1.26.x for qdrant-client compatibility - Update docker-compose.yml to comment out Ollama configuration (optional) The SimpleEmbeddingProvider generates deterministic, normalized embeddings suitable for testing semantic similarity without requiring external services. Tests validate that similar texts have higher cosine similarity and that semantic search correctly ranks results by relevance. Test coverage: - Deterministic embedding generation - Semantic similarity between texts - Full search flow with Qdrant (in-memory) - Category filtering - Empty result handling - Batch embedding generation All tests pass and can run in GitHub CI without Ollama infrastructure. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:13:33 +01:00
Chris Coutinho	fdd82f59e2	feat: implement semantic search tool and fix vector sync issues (ADR-007 Phase 3) Completes the ADR-007 implementation by adding user-facing semantic search functionality. Previous phases implemented scanner and processor for background indexing; this adds the query interface. Changes: - Add nc_notes_semantic_search MCP tool for natural language queries - Fix Qdrant point IDs to use UUIDs instead of strings (was causing 400 errors) - Reduce scan interval default from 1 hour to 5 minutes for faster updates - Add SemanticSearchResult and SemanticSearchNotesResponse models - Implement dual-phase authorization (Qdrant filter + Nextcloud API verification) The semantic search enables finding notes by meaning rather than exact keywords, using vector embeddings to understand query intent. Point ID fix resolves critical bug where all document indexing failed with "invalid point ID" errors. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 21:51:12 +01:00
Chris Coutinho	4dbb2eb468	fix: integrate vector sync tasks with Starlette lifespan for streamable-http Fixes background task startup for streamable-http transport by integrating vector sync initialization into the Starlette lifespan context manager. Starlette Lifespan Integration: - Moved background task startup from FastMCP lifespan to Starlette lifespan - FastMCP lifespan only triggers on MCP session establishment - Starlette lifespan runs on server startup (correct timing) - Fixed module scoping issues with local imports (anyio_module, asyncio_module) - Added conditional startup based on oauth_enabled flag Scanner Fixes: - Fixed NotesClient method: list_notes() → get_all_notes() - Properly handle AsyncIterator with list comprehension - Collects all notes before processing Verified Working: - Background tasks start successfully on server startup - Scanner fetches notes from Nextcloud API - Processor pool (3 workers) ready for document processing - Health endpoint reports Qdrant status - No startup errors Phase 3 Complete: - BasicAuth mode with vector sync fully functional - Background tasks integrate cleanly with streamable-http transport - Graceful shutdown with coordinated task cancellation Related: ADR-007 Background Vector Database Synchronization 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 21:20:26 +01:00
Chris Coutinho	8f45e996e8	feat: implement vector sync scanner and processor (ADR-007 Phase 2) Implements background vector database synchronization using anyio TaskGroups for BasicAuth mode with single-user credentials. Scanner Implementation: - Periodic document discovery (hourly, configurable) - Timestamp-based change detection (Nextcloud vs Qdrant) - Wake event for immediate scanning on-demand - Supports both initial sync (all docs) and incremental sync (changes only) - Detects deleted documents and queues for removal Processor Implementation: - Concurrent document processing pool (3 workers default) - I/O-bound embedding generation via Ollama API - Retry logic with exponential backoff (3 retries) - Document chunking (512 words, 50-word overlap) - Handles both index and delete operations - Upserts vectors to Qdrant with rich metadata App Lifespan Integration: - Extended AppContext with background task state - Modified app_lifespan_basic() to start tasks via anyio TaskGroups - Graceful shutdown with coordinated task cancellation - Only activates when VECTOR_SYNC_ENABLED=true Embedding Service: - OllamaEmbeddingProvider with TLS support - Singleton pattern for shared client instances - Batch embedding support for efficiency - Auto-detects embedding dimension (768 for nomic-embed-text) Qdrant Client: - Async client wrapper with singleton pattern - Auto-creates collection on first use - COSINE distance metric for semantic similarity - Integrates with embedding service for dimension detection Health Check Enhancement: - Added Qdrant status check to /health/ready endpoint - Only checks when VECTOR_SYNC_ENABLED=true - 2-second timeout for health probe - Reports connection errors with details Configuration: - VECTOR_SYNC_ENABLED: Enable background sync - VECTOR_SYNC_SCAN_INTERVAL: Scanner frequency (3600s default) - VECTOR_SYNC_PROCESSOR_WORKERS: Concurrent processors (3 default) - QDRANT_URL, QDRANT_API_KEY, QDRANT_COLLECTION: Vector DB config - OLLAMA_BASE_URL, OLLAMA_EMBEDDING_MODEL: Embedding service config Dependencies Added: - qdrant-client>=1.7.0: Vector database client Docker Compose: - Added Qdrant service with health check - Exposed ports 6333 (REST) and 6334 (gRPC) - Configured MCP service with vector sync environment - Added qdrant-data volume for persistence Known Issue: - FastMCP lifespan not triggering for streamable-http transport - Background tasks will start once lifespan integration is complete - Lifespan triggers on MCP session establishment, not server startup Related: ADR-007 Background Vector Database Synchronization 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 21:14:38 +01:00
Chris Coutinho	dc93da2ea0	docs: add ADR-007 for background vector database synchronization Add comprehensive ADR-007 documenting background vector database synchronization architecture using anyio TaskGroups for in-process concurrency. This supersedes ADR-003's conceptual background worker. Key decisions: - In-process architecture using anyio TaskGroups (not Celery) - Scanner task runs hourly, detects changes via timestamp comparison - In-memory asyncio.Queue for pending documents - Pool of 3 concurrent processor tasks for I/O-bound embedding workloads - Qdrant metadata as single source of truth for indexing state - Simple user controls: enable/disable with status visibility Benefits: - Single container deployment (was 3: mcp, celery-worker, celery-beat) - No distributed task queue infrastructure - Shared process state (no volume coordination) - Sufficient throughput for I/O-bound embedding APIs - Simpler debugging and deployment Update ADR-003 status to "Superseded by ADR-007" with reference link. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 20:32:49 +01:00
Chris Coutinho	31ff8a71bf	Merge pull request #270 from cbcoutinho/renovate/downloads.unstructured.io-unstructured-io-unstructured-api-latest chore(deps): update downloads.unstructured.io/unstructured-io/unstructured-api:latest docker digest to 54282d3	2025-11-08 11:24:14 +01:00
renovate-bot-cbcoutinho[bot]	bd012831cf	chore(deps): update downloads.unstructured.io/unstructured-io/unstructured-api:latest docker digest to 54282d3	2025-11-08 05:06:25 +00:00
github-actions[bot]	4ceaf45ffd	bump: version 0.26.0 → 0.26.1 nextcloud-mcp-server-0.26.1 v0.26.1	2025-11-08 03:59:28 +00:00
Chris Coutinho	21b878a2e7	Merge pull request #265 from cbcoutinho/renovate/mcp-1.x fix(deps): update dependency mcp to >=1.21,<1.22	2025-11-08 04:59:05 +01:00
github-actions[bot]	218f0bd366	bump: version 0.25.0 → 0.26.0 nextcloud-mcp-server-0.26.0 v0.26.0	2025-11-08 03:48:50 +00:00
Chris Coutinho	afee3e8bb4	Merge pull request #268 from cbcoutinho/fix/unified-oauth-callback-pkce fix: Consolidate OAuth callbacks and implement PKCE for all flows	2025-11-08 04:48:27 +01:00
Chris Coutinho	050a00d8c8	Merge pull request #269 from cbcoutinho/renovate/ghcr.io-astral-sh-uv-0.x chore(deps): update ghcr.io/astral-sh/uv docker tag to v0.9.8	2025-11-08 00:45:24 +01:00
renovate-bot-cbcoutinho[bot]	f59b6a6cfb	chore(deps): update ghcr.io/astral-sh/uv docker tag to v0.9.8	2025-11-07 23:09:16 +00:00
Chris Coutinho	a766f4be32	test: enhance elicitation callback logging and error handling Improve debugging capabilities for OAuth flow in elicitation callback: - Add detailed logging for consent screen handling - Capture screenshots when consent screen not detected or fails - Replace networkidle wait with explicit callback URL polling - Add 2-second grace period for server-side callback processing - Log page title and current URL for debugging This helps diagnose issues like expired OAuth clients or authorization failures during real elicitation testing. Test results: All 4 elicitation integration tests passing - test_check_logged_in_with_real_elicitation_callback ✓ - test_elicitation_callback_url_extraction ✓ - test_elicitation_stores_refresh_token ✓ - test_second_check_logged_in_does_not_elicit ✓ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 23:49:58 +01:00
Chris Coutinho	ee053d559c	chore: Remove tests	2025-11-07 22:59:57 +01:00
Chris Coutinho	71326384da	feat: add real elicitation integration test with python-sdk MCP client This commit adds proper integration testing of the login elicitation flow (ADR-006) using python-sdk's MCP client with actual elicitation callback support, and fixes user_id extraction to support both JWT and opaque tokens. ## Changes ### 1. Enhanced create_mcp_client_session helper (tests/conftest.py) - Added `elicitation_callback` parameter to function signature - Pass callback to ClientSession constructor - Added necessary imports: RequestContext, ElicitRequestParams, ElicitResult, ErrorData from mcp package - Allows fixtures to provide custom elicitation handlers ### 2. New fixture: nc_mcp_oauth_client_with_elicitation (tests/conftest.py) - Creates MCP client with Playwright-based elicitation callback - Callback implementation: - Extracts OAuth URL from elicitation message using regex - Uses Playwright browser to complete OAuth flow automatically - Handles Nextcloud login form (username/password) - Handles consent screen if present - Waits for OAuth callback completion - Returns ElicitResult(action="accept") on success - Function-scoped to allow independent test state - Tracks elicitation invocations via session.elicitation_triggered ### 3. Fixed user_id extraction for opaque tokens (oauth_tools.py) - Created extract_user_id_from_token() helper to handle both JWT and opaque tokens by calling userinfo endpoint when needed - Fixed check_logged_in to use helper instead of broken ctx.authorization - Fixed revoke_nextcloud_access to use helper instead of ctx.context.get() - Both tools now properly extract user_id from access tokens ### 4. Enhanced integration tests (test_elicitation_integration.py) - Updated tests to revoke refresh tokens via MCP tool - All 4 tests now pass: - test_check_logged_in_with_real_elicitation_callback: Complete flow - test_elicitation_callback_url_extraction: URL extraction validation - test_elicitation_stores_refresh_token: Token persistence verification - test_second_check_logged_in_does_not_elicit: No redundant elicitations ### 5. Added diagnostic logging (oauth_routes.py) - Track user_id extraction from ID tokens during OAuth callbacks - Log refresh token storage with user_id and flow_type ## Test Results ✅ 4/4 tests pass The test suite successfully validates: - Elicitation callback is triggered when no refresh token exists - Playwright automation completes OAuth flow - Refresh token is stored after OAuth with correct user_id - Tool returns "yes" after successful login - Already-logged-in users don't get redundant elicitations ## Why This Matters Previous tests (test_login_elicitation.py) only validated response formats and acknowledged they couldn't test real elicitation protocol. This test exercises the REAL MCP elicitation protocol end-to-end: 1. MCP server calls ctx.elicit() 2. python-sdk ClientSession invokes custom callback 3. Callback completes OAuth via Playwright 4. Client returns acceptance to server 5. Tool proceeds with authenticated state This proves the python-sdk MCP client can handle elicitation in production environments with both JWT and opaque tokens. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 22:55:49 +01:00
Chris Coutinho	11cdab475f	feat: unify session architecture and enhance login status visibility This commit addresses the "Login not detected" issue after completing OAuth login via elicitation by unifying the session architecture and adding comprehensive visibility into background session status. ## Changes ### 1. Enhanced check_logged_in with comprehensive logging (oauth_tools.py) - Added detailed logging at each step of token lookup - Implemented fallback strategy: first search by provisioning_client_id, then fall back to user_id lookup - This allows detection of refresh tokens created via any flow (elicitation or browser login) - Log messages include flow_type, provisioned_at, and provisioning_client_id for debugging ### 2. Unified session architecture (browser_oauth_routes.py) - Browser login now stores provisioning_client_id=state when saving refresh token - This makes browser and elicitation flows consistent - both can be found by the same state parameter - Treats Flow 2 (elicitation) and browser login as the same "background session" ### 3. Enhanced /user/page with session status (userinfo_routes.py) - Added comprehensive background access section showing: - Background Access: Granted/Not Granted (with visual indicators) - Flow Type: browser/flow2/hybrid - Provisioned At: timestamp - Token Audience: nextcloud/mcp - Scopes: detailed scope list - Status displayed regardless of which flow created the session (browser login or elicitation) ### 4. Added revoke functionality (userinfo_routes.py, app.py) - New POST endpoint: /user/revoke - Allows users to revoke background access (delete refresh token) - Browser session cookie remains valid for UI access - Confirmation dialog before revocation - Success page with auto-redirect back to /user/page - Registered route in app.py browser_routes ## Testing All tests pass: - 6/6 login elicitation tests pass - 21/21 core OAuth tests pass - Comprehensive logging helps debug future issues ## Fixes Resolves: "Login not detected. Please ensure you completed the login at the provided URL before clicking OK." The issue occurred because elicitation and browser login created separate sessions. Now they are unified under the same architecture. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 21:50:55 +01:00
Chris Coutinho	281d28c7cd	test: Add comprehensive elicitation URL and refresh token validation Enhanced test suite to validate: 1. Elicitation URL format and Flow 2 endpoint routing 2. Server-side refresh token validation via check_provisioning_status API 3. Proper separation of concerns - tests use MCP server API, not direct storage access The refresh token validation test validates server responses: - is_provisioned=true: Server has valid refresh token - is_provisioned=false: No token or invalid token - Error response: Token validation failed This ensures the MCP server properly validates refresh tokens internally and reports status correctly through its public API. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 21:21:58 +01:00
Chris Coutinho	0c9a9ea24d	fix: Consolidate OAuth callbacks and implement PKCE for all flows This PR fixes multiple OAuth-related issues: ## Unified OAuth Callback - Consolidated `/oauth/callback-nextcloud` and `/oauth/login-callback` into single `/oauth/callback` endpoint - Flow type determined by session lookup via state parameter (no query params in redirect_uri) - Fixes redirect_uri validation issues with IdPs requiring exact match - Legacy endpoints kept as aliases for backwards compatibility ## PKCE Implementation - Implemented PKCE (RFC 7636) for Flow 2 (resource provisioning) - Generate code_verifier and code_challenge - Store code_verifier in session storage - Retrieve and use in token exchange - Fixed PKCE for browser login (integrated mode) - Previously only worked for external IdP (Keycloak) - Now works for both Nextcloud OIDC and external IdP ## Login Elicitation Fixes (ADR-006) - Fixed elicitation URL to route through MCP server endpoint - Changed from direct Nextcloud URL to `/oauth/authorize-nextcloud` - Ensures PKCE is properly handled by server - Fixed login detection after OAuth flow completes - Look up refresh token by state parameter instead of user_id - Works even when Flow 1 token not present - Added `get_refresh_token_by_provisioning_client_id()` method ## Session Authentication - Fixed `/user/page` redirect loop - Shared oauth_context with mounted browser_app - SessionAuthBackend can now validate sessions correctly ## Tests - Added comprehensive login elicitation test suite - Updated scope authorization test expectations - All 43 OAuth tests passing ## Files Changed - `app.py`: Shared oauth_context, unified callback route - `oauth_routes.py`: Unified callback, PKCE for Flow 2 - `browser_oauth_routes.py`: PKCE for integrated mode - `oauth_tools.py`: Fixed elicitation URL generation - `refresh_token_storage.py`: Added lookup by provisioning_client_id - `test_login_elicitation.py`: New test suite 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 21:08:55 +01:00
Chris Coutinho	dfa6d08ba7	Merge pull request #266 from cbcoutinho/renovate/quay.io-keycloak-keycloak-26.x chore(deps): update quay.io/keycloak/keycloak docker tag to v26.4.4	2025-11-07 12:24:57 +01:00
renovate-bot-cbcoutinho[bot]	c5395041d3	chore(deps): update quay.io/keycloak/keycloak docker tag to v26.4.4	2025-11-07 11:06:04 +00:00
renovate-bot-cbcoutinho[bot]	c1e135c4a2	fix(deps): update dependency mcp to >=1.21,<1.22	2025-11-07 05:06:10 +00:00

1 2 3 4 5 ...

943 Commits