nextcloud-mcp-server

Author	SHA1	Message	Date
Chris Coutinho	c74695af16	Merge branch 'feature/notes'	2025-11-16 08:28:00 +01:00
Chris Coutinho	f36f92120c	build: bump submodule	2025-11-16 08:27:49 +01:00
Chris Coutinho	1faf572546	Merge branch 'feature/bm25' Resolves conflict in viz_routes.py by combining: - Named vector extraction from feature/bm25 - Performance timing from master	2025-11-16 08:18:39 +01:00
Chris Coutinho	944b6dcf5a	fix: Handle named vectors in visualization and semantic search - viz_routes.py: Extract "dense" vector from named vector dict - semantic.py: Specify using="dense" for BM25 hybrid collections - Fixes "X must be 2D array" error in hybrid search - Fixes "Dense vector is not found" error in semantic search 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 08:16:35 +01:00
Chris Coutinho	2aa82d849c	Merge branch 'feature/bm25'	2025-11-16 07:57:36 +01:00
Chris Coutinho	fc6a2f14e4	fix: Update vizApp to use bm25_hybrid algorithm and remove deprecated weights The visualization UI was still using the old 'hybrid' algorithm name and weight parameters that were replaced by the BM25 hybrid search refactor. This caused "Unknown algorithm: hybrid" errors when using the search & visualize feature. Changes: - Update default algorithm from 'hybrid' to 'bm25_hybrid' - Update default scoreThreshold from 0.7 to 0.0 to match backend - Remove deprecated semanticWeight, keywordWeight, fuzzyWeight parameters - Remove weight parameters from search request Fixes the visualization search functionality after BM25 hybrid refactor. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 07:54:20 +01:00
Chris Coutinho	d1fb7eb633	Merge branch 'rag-evaluation'	2025-11-16 07:46:17 +01:00
Chris Coutinho	96cee48258	build: Migrate image to debian-based	2025-11-16 07:32:01 +01:00
Chris Coutinho	16c22c953b	fix: Update viz routes to use BM25 hybrid search after refactor - Remove obsolete search algorithm imports (Fuzzy, Keyword, Hybrid) - Update UI to only show Semantic and BM25 Hybrid algorithms - Replace manual weight controls with RRF fusion info message - Update default algorithm from "hybrid" to "bm25_hybrid" - Remove weight parameters (semantic_weight, keyword_weight, fuzzy_weight) - Update score_threshold default from 0.7 to 0.0 for RRF scoring - Document ty type checker in CLAUDE.md Fixes unresolved-import type errors after BM25 refactor. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 07:23:11 +01:00
Chris Coutinho	529daf2b48	ci: temp disable sse in ci	2025-11-16 07:03:18 +01:00
Chris Coutinho	137d1d6c75	perf: fix vector viz search performance and visual encoding This commit addresses critical performance issues with vector visualization search (reducing time from 40s to ~2s) and improves result visualization through better visual encoding. ## Performance Fixes ### 1. Fix blocking sleep in retry decorator (base.py:51) - Changed `time.sleep(5)` to `await anyio.sleep(5)` in @retry_on_429 - Prevents entire event loop from freezing during rate limit retries - Impact: Reduced search time from 22s to 16s initially ### 2. Add concurrency limiting for verification (verification.py:77-93) - Added `anyio.Semaphore(20)` to limit concurrent HTTP requests - Prevents connection pool exhaustion (RequestError) from 90+ simultaneous requests - Fixes false filtering (was filtering 77/90 results incorrectly) - Note: Semaphore still in code but verification removed from viz endpoint ### 3. Remove unnecessary verification from viz endpoint (viz_routes.py:483-486) - Visualization only needs Qdrant metadata (title, excerpt), not full content - Verification only required for sampling (LLM needs full note content) - Impact: Reduced search time from 43.7s to ~2s (final fix) ### 4. Restore streaming scanner pattern (scanner.py) - Process notes one-at-a-time using async generator - Avoids loading all notes into memory ## Visualization Improvements ### 5. Result-relative score normalization (viz_routes.py:489-504) - Normalize scores within result set: best=1.0, worst=0.0 - Removes arbitrary RRF normalization (theoretical max didn't make sense) - Makes visual encoding meaningful regardless of algorithm scores ### 6. Power scaling for marker sizes (userinfo_routes.py:743) - Changed from linear `8 + (score * 12)` to power `6 + (score² * 14)` - Creates dramatic visual contrast: 0.0→6px, 0.5→9.5px, 1.0→20px - Combined with opacity (0.2-1.0) for clear visual hierarchy ### 7. Multi-channel visual encoding (userinfo_routes.py:740-745) - Size: Exponentially scaled with score² - Opacity: Linear 0.2-1.0 (keeps all points visible) - Color: Viridis gradient (blue→yellow) - Effect: Top results are large/bright/opaque, context results small/dim/transparent ## Result - Search time: 40s → ~2s (20x faster) - Visual contrast: Subtle → dramatic (clear result hierarchy) - No arbitrary cutoffs: All results visible, best naturally highlighted 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 07:01:35 +01:00
Chris Coutinho	b96657c935	ci: Add open-webui to docker-compose	2025-11-16 07:00:20 +01:00
Chris Coutinho	6fe5596c13	feat: Implement BM25 hybrid search with native Qdrant RRF fusion Replace custom keyword/fuzzy search algorithms with industry-standard BM25 sparse vectors, combined with dense semantic vectors using Qdrant's native Reciprocal Rank Fusion (RRF). This consolidates search architecture and improves relevance for both semantic and keyword queries. Key changes: - Add fastembed dependency for BM25 sparse vector generation - Update Qdrant collection schema to support named vectors (dense + sparse) - Create BM25SparseEmbeddingProvider using FastEmbed's Qdrant/bm25 model - Implement BM25HybridSearchAlgorithm with native Qdrant RRF prefetch - Update document processor to generate both dense and sparse embeddings - Simplify nc_semantic_search() tool to use BM25 hybrid only - Remove legacy keyword.py, fuzzy.py, and custom hybrid.py (736 lines) - Update ADR-014 with implementation notes and test results Benefits: - Consolidated architecture (single Qdrant database) - Native database-level RRF fusion (more efficient) - Industry-standard BM25 (replaces brittle custom keyword search) - Better relevance across semantic and keyword queries - Simplified codebase (-285 net lines) Tests: All 125 tests passing (118 unit, 7 integration) Implements ADR-014: Replace Custom Keyword Search with BM25 Hybrid Search 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 06:59:44 +01:00
Chris Coutinho	b174e7f8fb	ci: Add notes app for development	2025-11-16 06:57:28 +01:00
Chris Coutinho	f5bc3e3bc3	docs: init ADR	2025-11-16 06:24:25 +01:00
Chris Coutinho	c8d9cc24e0	refactor: migrate asyncio to anyio for consistent structured concurrency Replace asyncio primitives with anyio equivalents throughout the codebase to establish a single async pattern. This provides better structured concurrency with automatic cancellation on errors and aligns with the pytest anyio configuration. Changes: - hybrid.py: Replace asyncio.gather() with anyio task groups - token_broker.py: Replace asyncio.Lock() with anyio.Lock() - storage.py: Replace asyncio.run() with anyio.run() - app.py: Replace tg.start_soon() with await tg.start() for task status - processor.py: Add task_status parameter for structured startup - scanner.py: Add task_status parameter for structured startup - CLAUDE.md: Update async/await patterns guidance The change from start_soon() to await tg.start() enables proper task initialization signaling, ensuring background tasks are ready before proceeding. This follows anyio best practices for structured concurrency. All 118 unit tests pass with the new implementation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 03:51:45 +01:00
Chris Coutinho	98d1c2de8e	perf: make note deletion concurrent in upload --force - Collect all notes to delete first, then delete concurrently - Use anyio task group with semaphore (20 concurrent deletions) - Add progress reporting and error tracking for deletions - Show count of notes found before deletion starts This significantly improves --force performance when refreshing large corpuses (e.g., 3,633 notes now delete in ~1 minute instead of ~5 minutes). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 00:55:27 +01:00
Chris Coutinho	30a4d84458	feat: add concurrent uploads and --force flag to upload command - Add --force flag to delete all existing notes in target category before upload - Implement concurrent uploads using anyio task groups (20 concurrent max) - Add semaphore to limit concurrent requests and avoid overwhelming server - Improve progress reporting with upload count and error tracking - Update README with --force flag documentation Performance improvement: Concurrent uploads significantly reduce upload time from ~10-15 minutes to ~2-3 minutes for 3,633 documents. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 00:41:00 +01:00
Chris Coutinho	fca8ab0cfd	Merge remote-tracking branch 'origin/master' into rag-evaluation	2025-11-16 00:32:59 +01:00
github-actions[bot]	7a7ed79d56	bump: version 0.35.0 → 0.36.0 nextcloud-mcp-server-0.36.0 v0.36.0	2025-11-15 23:32:55 +00:00
Chris Coutinho	7e7d861797	Merge pull request #302 from cbcoutinho/feature/viz feat: Vector visualization enhancements and search optimizations	2025-11-16 00:32:31 +01:00
Chris Coutinho	4fa2edf4c7	ci: Set default scan interval to 5min	2025-11-16 00:10:12 +01:00
Chris Coutinho	defa8db18e	fix: download qrels from BEIR ZIP instead of HuggingFace - HuggingFace BeIR/nfcorpus only has 'corpus' and 'queries' configs - Download qrels from original BEIR ZIP file (nfcorpus.zip) - Use synchronous httpx.Client for download (simpler than async) - Remove deprecated trust_remote_code parameter Tested with successful corpus download and qrels extraction. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 00:02:15 +01:00
Chris Coutinho	c9506da2d2	refactor: replace httpx client with NextcloudClient in upload command - Use NextcloudClient with BasicAuth instead of raw httpx - Replace direct HTTP POST with notes.create_note() method - Add close() method to LLMProvider Protocol for proper cleanup - Fix type annotations for dataset iteration This improves code reuse and consistency with the rest of the codebase. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 23:26:07 +01:00
Chris Coutinho	c272ddd82d	feat: implement RAG evaluation framework with CLI tooling - Add ADR-013 documenting RAG evaluation architecture - Implement two-part evaluation: Context Recall (retrieval) + Answer Correctness (generation) - Create Click CLI for ground truth generation and corpus upload - Add pytest fixtures and tests for retrieval/generation quality - Use BeIR/nfcorpus dataset with 5 selected test queries - Support Ollama and Anthropic LLM providers - Generate synthetic ground truth answers offline - Add comprehensive documentation in tests/rag_evaluation/README.md The framework separates one-time setup (generate/upload) from test execution, making tests much faster (~6-12 min vs ~15-25 min per run). Tests are manual only (not in CI) and require external LLM access. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 23:11:21 +01:00
Chris Coutinho	eaeb8eae28	feat: Normalize hybrid search RRF scores to 0-1 range Improve user comprehension by scaling RRF scores to match the intuitive 0-1 range used by other search algorithms. ## Problem RRF (Reciprocal Rank Fusion) scores had a drastically different scale than semantic/keyword/fuzzy scores: - Semantic similarity: 0.0 to 1.0 (typical: 0.5-0.9) - RRF scores: 0.0 to ~0.016 (typical: 0.005-0.015) This caused user confusion - a score of 0.0078 looked terrible but was actually excellent (near theoretical maximum). ## Solution Normalize RRF scores using the formula: `normalized_score = rrf_score * (rrf_k + 1) / total_weight` Where: - rrf_k = 60 (RRF constant) - total_weight = sum of algorithm weights (default: 1.0) Example transformation: - Before: 0.0078 (confusing) - After: 0.477 (intuitive) ## Changes nextcloud_mcp_server/search/hybrid.py: - Store total_weight as instance variable (line 63) - Calculate normalization factor in _reciprocal_rank_fusion() (line 209) - Apply normalization to all RRF scores (line 217) - Preserve raw RRF score in metadata for debugging (line 222) ## Impact User Experience: - Hybrid search scores now comparable with semantic/keyword/fuzzy - Score of 0.5 indicates good match across all algorithms - Consistent scale improves score threshold usability Backward Compatibility: - Raw RRF scores preserved in metadata["rrf_score_raw"] - Result ordering unchanged (normalization is linear transformation) - Breaking change: Existing score thresholds need adjustment Performance: - Negligible overhead (single multiplication per result) ## Testing Verified with nc_semantic_search and nc_semantic_search_answer: - Hybrid scores now 0.47-0.7 range (was 0.003-0.011) - Semantic scores unchanged (0.75) - Result ordering preserved 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 06:48:58 +01:00
Chris Coutinho	42376483ab	refactor: Optimize Nextcloud access verification with centralized filtering Move access verification from individual search algorithms to final output stage, eliminating redundant API calls and improving performance. ## Changes New: - `search/verification.py`: Centralized verification using anyio task groups - Deduplicates results by (doc_id, doc_type) before verification - Verifies all unique documents in parallel using structured concurrency - Filters out inaccessible documents in single pass Modified Search Algorithms: - `search/semantic.py`: Removed _deduplicate_and_verify() and _verify_document_access() - `search/keyword.py`: Removed _verify_access() and parallel verification - `search/fuzzy.py`: Removed _verify_access() and parallel verification - `search/hybrid.py`: Removed nextcloud_client parameter passing All algorithms now return unverified results from Qdrant payload. Modified Output Stages: - `server/semantic.py`: Added verify_search_results() call after search - `auth/viz_routes.py`: Added verify_search_results() call after search Both endpoints now verify access once at final stage with deduplication. ## Performance Impact Before: - Hybrid mode (limit=10): 30 API calls (10 per algorithm × 3 algorithms) - Single algorithm: 10-20 API calls (with verification buffer) After: - Hybrid mode (limit=10): 10 API calls (deduplicated verification) - Single algorithm: 10 API calls (deduplicated verification) Performance Gain: 3x reduction in API calls for hybrid search ## Architecture Benefits - Separation of concerns: Algorithms handle scoring, output stage handles security - Deduplication: Each document verified exactly once - Parallel execution: All verifications run concurrently via anyio task groups - Consistency: Same verification logic across MCP tools and viz endpoints 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 06:21:06 +01:00
Chris Coutinho	ed0825e661	feat: Enhance vector visualization UI and parallelize search verification Vector Visualization Improvements: - Add interactive vector viz tab with Alpine.js and Plotly.js to user info page - Refactor viz route CSS for better scoping and maintainability - Remove unused nextcloud_host variable Performance Optimizations: - Parallelize access verification in fuzzy and keyword search algorithms - Use asyncio.gather() to verify multiple documents concurrently - Add exception handling with return_exceptions=True for resilience Dependencies: - Update third_party/oidc submodule to include RFC 9728 resource_url support 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 05:39:07 +01:00
Chris Coutinho	e3153822f7	perf: Exclude vector-sync status polling from distributed tracing Skip tracing for /app/vector-sync/status to reduce noise from HTMX polling. Metrics collection continues for this endpoint. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 05:19:35 +01:00
Chris Coutinho	2b35dd729f	fix: Reorder tabs and fix viz pane session access - Move Webhooks tab to the right (User Info \| Vector Sync \| Vector Viz \| Webhooks) - Use request.user.display_name instead of session for viz routes - Fixes session middleware error when accessing via iframe	2025-11-15 02:41:42 +01:00
Chris Coutinho	eb32bbbc6b	feat: Add Vector Viz tab to app home page - Add Vector Viz button to tab navigation - Embed viz pane in iframe for seamless integration - Only shown when vector sync is enabled	2025-11-15 02:38:05 +01:00
Chris Coutinho	916af1c8f3	feat: Add vector visualization pane with multi-select document types - Add /app/vector-viz endpoint for interactive search testing - Implement server-side PCA dimensionality reduction (768-dim → 2D) - Support multi-select document type filter for cross-app search - Support all search algorithms: semantic, keyword, fuzzy, hybrid - Display 2D scatter plot of vector embeddings using Plotly - Show search results with scores and document types - Register viz routes in app.py	2025-11-15 02:32:10 +01:00
Chris Coutinho	9a62c8478f	feat: Implement custom PCA to remove sklearn dependency - Add custom PCA implementation using numpy eigendecomposition - Replace sklearn.decomposition.PCA with custom implementation - Maintains same API (fit, transform, fit_transform) - Supports explained_variance_ratio_ for variance analysis - Removes scikit-learn dependency from project - Add type hints and assertion for type safety	2025-11-15 02:02:57 +01:00
Chris Coutinho	2a078093ed	refactor!: Make all search algorithms query Qdrant payload, not Nextcloud BREAKING CHANGE: Search algorithms now require Qdrant to be populated. Vector sync must be enabled and documents indexed for search to work. - Keyword and fuzzy search now query Qdrant scroll API for title/excerpt - Remove inefficient Nextcloud API fetching pattern - Add optional Nextcloud verification for security - Deduplicate by (doc_id, doc_type) tuple, keeping chunk_index=0 - Align with document processor pattern that already stores text in Qdrant	2025-11-15 01:56:41 +01:00
github-actions[bot]	682923dcc8	bump: version 0.34.2 → 0.35.0 nextcloud-mcp-server-0.35.0 v0.35.0	2025-11-15 00:46:11 +00:00
Chris Coutinho	b1a756145e	Merge pull request #301 from cbcoutinho/feature/sse feat: Enable SSE transport for validation testing	2025-11-15 01:45:48 +01:00
Chris Coutinho	b5b03bfd78	feat: Add multi-document Protocol with cross-app search support Implements NextcloudClientProtocol for multi-document type search following user requirement that document types are not 1:1 with apps (e.g., Notes app specializes in markdown, while Files/WebDAV handles multiple file types). Key Changes: - NextcloudClientProtocol: Generic protocol with app-specific client properties - get_indexed_doc_types(): Query Qdrant for actually-indexed document types - Document dispatch: All algorithms check Qdrant before attempting access - Cross-type deduplication: Use (doc_id, doc_type) tuples in hybrid RRF Search Algorithm Updates: - Semantic: Added _verify_document_access() with dispatch to appropriate client - Deduplication by (doc_id, doc_type) tuple - Only "note" verification implemented, others return None with info log - Keyword: Added _fetch_documents() dispatch method - Queries Qdrant for available types before fetching - Supports cross-app search when doc_type=None - Fuzzy: Same pattern as keyword search - Hybrid: Already uses (doc_id, doc_type) for deduplication (no changes needed) Future-Proof Design: - File/calendar verification stubs in place - Clear logging when unsupported types found - Easy to extend when processor indexes new document types Currently Supported: - "note" documents fully implemented and tested - Other types gracefully handled (logged but skipped) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 01:19:29 +01:00
Chris Coutinho	f3bdb8b885	feat: Update nc_semantic_search tool with algorithm selection Implements ADR-012 by adding multi-algorithm support to the MCP tool. Key changes: - Added algorithm parameter: "semantic"\|"keyword"\|"fuzzy"\|"hybrid" (default: "hybrid") - Added weight parameters for hybrid mode configuration - Replaced direct Qdrant/embedding calls with search module abstractions - Updated docstring to describe all four algorithms - Simplified implementation: ~50 lines vs ~150 lines (67% reduction) - Better error handling for missing vector sync Algorithm selection: - semantic: Pure vector similarity (requires VECTOR_SYNC_ENABLED=true) - keyword: Token-based matching with weighted title/content scoring - fuzzy: Character overlap for typo tolerance - hybrid: RRF fusion with configurable weights (default: 0.5/0.3/0.2) Backward compatibility: - Tool name unchanged (nc_semantic_search) - New parameters have sensible defaults - Existing clients get hybrid search automatically (better than pure semantic) - search_method field in response reflects actual algorithm used Weight validation: - Performed in HybridSearchAlgorithm constructor - Must sum to ≤1.0 and all non-negative - At least one weight must be > 0 - Clear error messages on validation failure Next: Update viz pane to use same algorithms 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 00:25:55 +01:00
Chris Coutinho	11e620f2d1	feat: Implement unified search algorithm module Creates shared search module with four algorithms implementing ADR-012: - Semantic search (vector similarity via Qdrant) - Keyword search (token-based matching from ADR-001) - Fuzzy search (character overlap matching) - Hybrid search (RRF fusion from ADR-003) Architecture: - Base SearchAlgorithm interface for consistent API - SearchResult dataclass for unified result format - All algorithms async and independently testable - Proper logging and error handling throughout Semantic Search (search/semantic.py): - Extracted from server/semantic.py - Vector similarity using Qdrant query_points - Dual-phase authorization (vector filter + API verification) - Deduplication of document chunks - Configurable score threshold (default: 0.7) Keyword Search (search/keyword.py): - Implements ADR-001 token-based matching - Title matches weighted 3x higher than content - Case-insensitive token matching - Relevance scoring with normalization - Excerpt extraction with context Fuzzy Search (search/fuzzy.py): - Simple character overlap calculation - Configurable threshold (default: 70%) - Typo-tolerant matching - Fast and dependency-free Hybrid Search (search/hybrid.py): - Reciprocal Rank Fusion (RRF) from ADR-003 - Parallel execution of sub-algorithms - Configurable weights per algorithm - RRF constant k=60 (standard value) - Weight validation (must sum ≤1.0) All algorithms: - Share NextcloudClient for document access - Support user_id filtering (multi-tenant) - Support doc_type filtering (currently notes only) - Return consistent SearchResult objects - Properly formatted with ruff and type-checked Next steps: Update MCP tool to use these algorithms 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 00:10:19 +01:00
Chris Coutinho	56bd85c0f7	docs: Emphasize server-side processing in ADR-012 viz pane Updates ADR-012 to clarify that all search and filtering operations must happen server-side, not in the browser. Key changes: - Enhanced viz pane data flow showing server-side processing - Added performance benefits section (384x bandwidth reduction) - Detailed server-side filtering approach: * Query execution via search/algorithms.py * User ID filtering (multi-tenant security) * Document type filtering * PCA reduction (768-dim → 2D) on server * Only 2D coordinates + metadata sent to client - Updated Phase 3 implementation plan: * Remove ALL client-side search logic * Implement /app/vector-viz server endpoint * htmx form submission for queries * Performance optimizations (caching, streaming) This ensures: - Minimal bandwidth usage (only 2 floats per doc vs 768) - Client handles only visualization, not computation - Can visualize 10,000+ documents without client lag - Raw vectors never leave server (security) - Same search logic as MCP tool (consistency) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 00:02:54 +01:00
Chris Coutinho	5e67277049	docs: Add architecture diagrams and viz pane UI to ADR-012 Enhances ADR-012 with detailed architecture visualization and UI mockup for the vector visualization pane. Added sections: - Architecture diagram showing MCP tool and viz pane integration - Data flow diagrams for both MCP requests and viz pane interactions - Detailed UI mockup with ASCII art showing: * Search configuration controls * Algorithm selector with weight sliders * Interactive 2D scatter plot (Plotly.js) * Results panel with scores * Performance comparison table - Technology stack details (htmx, Alpine.js, Plotly.js, Tailwind CSS) The diagrams illustrate how the viz pane and MCP tool share the same search algorithm implementations from search/algorithms.py, ensuring consistency between user testing interface and programmatic API. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 00:00:40 +01:00
Chris Coutinho	66a7109130	docs: Add ADR-012 for unified multi-algorithm search Proposes unified search architecture with client-configurable algorithm selection and weighting. Addresses the need for flexible search options beyond pure semantic search. Key features: - Four algorithms: semantic, keyword, fuzzy, hybrid - Client-configurable weights for hybrid search - Shared implementation between viz pane and MCP tools - Reciprocal Rank Fusion (RRF) for result combination - Backward compatible with existing nc_semantic_search() Implements designs from: - ADR-003: Hybrid search with RRF (previously unimplemented) - ADR-001: Token-based keyword search (previously unimplemented) Supersedes ADR-011's placeholder for "ADR-013: Hybrid Search" 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 23:56:09 +01:00
Chris Coutinho	00e72d24a6	feat: Enable SSE transport for mcp service and update test fixtures Changes: - Remove streamable-http transport override from mcp service in docker-compose.yml - Service now uses CLI default SSE transport on /sse endpoint - Add create_mcp_client_session_sse() helper for SSE connections - Update nc_mcp_client fixture to use SSE transport - Fix unpacking for SSE client (yields 2 values vs 3 for streamable-http) Testing: - All 4 smoke tests pass with SSE transport - 32/34 affected tests pass (2 skipped for vector sync) - OAuth services remain on streamable-http (unchanged) Note: SSE transport is being deprecated in favor of streamable-http. This enables minimal validation testing before deprecation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 19:20:30 +01:00
Chris Coutinho	dc78d92e5b	Merge pull request #299 from cbcoutinho/renovate/docker.io-library-mariadb-lts chore(deps): update docker.io/library/mariadb:lts docker digest to 6b848cb	2025-11-14 11:23:32 +01:00
renovate-bot-cbcoutinho[bot]	86891173b2	chore(deps): update docker.io/library/mariadb:lts docker digest to 6b848cb	2025-11-14 05:07:34 +00:00
Chris Coutinho	73b3d80026	Merge pull request #294 from cbcoutinho/feature/app_api docs: Add ADR-011 for hybrid OAuth + AppAPI deployment architecture	2025-11-13 23:43:25 +01:00
Chris Coutinho	26099d643d	docs: Update ADR-011 to rejected status with Context Agent validation After comprehensive research, the hybrid OAuth + AppAPI architecture is NOT being implemented due to fundamental architectural incompatibilities. Key updates: - Status: Proposed → Not Planned - Added validation from Nextcloud Context Agent project - Context Agent (official NC ExApp with MCP) faces IDENTICAL limitations - Proves constraints are architectural, not implementation-specific Context Agent findings: - ExApp with MCP server endpoint (~28 tools exposed) - Uses Task Processing API for confirmations (NOT MCP elicitation) - Works around AppAPI proxy limitations by changing protocol - MCP endpoint is secondary feature with documented constraints - Primary use: In-app Assistant integration, not external MCP clients Critical features impossible through AppAPI proxy: - ❌ MCP sampling (eliminates RAG/LLM features) - ❌ MCP elicitation (user prompts) - ❌ Real-time progress updates - ❌ Bidirectional streaming - Validated by Context Agent facing same limitations Decision rationale: - MCP requires multi-turn nested interactions - AppAPI provides stateless request/response proxy only - No implementation effort can bridge this fundamental gap - Would require complete AppAPI redesign (WebSocket, message routing) - Even official Nextcloud projects work around these limitations Alternative considered for future: - Register as Task Processing provider (different product) - Use Nextcloud Assistant UI (not external MCP clients) - Accept different capabilities (no sampling, custom flows) OAuth mode remains sole solution for external MCP client integration. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 23:30:14 +01:00
github-actions[bot]	56a5c63994	bump: version 0.34.1 → 0.34.2 nextcloud-mcp-server-0.34.2 v0.34.2	2025-11-13 21:11:36 +00:00
Chris Coutinho	92c8e1e41d	Merge pull request #290 from cbcoutinho/renovate/quay.io-keycloak-keycloak-26.x chore(deps): update quay.io/keycloak/keycloak docker tag to v26.4.5	2025-11-13 22:11:09 +01:00
github-actions[bot]	dd12c957f6	bump: version 0.34.0 → 0.34.1	2025-11-13 21:10:16 +00:00

1 2 3 4 5 ...

1068 Commits