nextcloud-mcp-server

Author	SHA1	Message	Date
github-actions[bot]	bf11f16e2f	bump: version 0.43.0 → 0.44.0	2025-11-19 22:43:03 +00:00
github-actions[bot]	441d94301e	bump: version 0.42.0 → 0.43.0	2025-11-18 12:56:15 +00:00
Chris Coutinho	eec923eff5	feat: Replace custom document chunker with LangChain MarkdownTextSplitter Migrates from custom word-based chunking to LangChain's MarkdownTextSplitter for better semantic search quality. This implements the chunking portion of ADR-011. Changes: - Replace custom regex word chunker with MarkdownTextSplitter - Optimized for Markdown content (headers, code blocks, lists) - Convert from word-based (512 words) to character-based (2048 chars) chunking - Maintain backward-compatible ChunkWithPosition interface - Update configuration defaults and validation - Update all unit tests (12/12 passing) Benefits: - Respects markdown structure boundaries - Never breaks code blocks or headers mid-chunk - Preserves semantic coherence within chunks - Expected 20-30% improvement in recall quality - Industry-standard approach (used by production RAG systems) Note: Full reindex required to apply new chunking to existing documents. Current vector database still contains old word-based chunks. Related: ADR-011 (Improving Semantic Search Quality) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-18 12:17:23 +01:00
github-actions[bot]	8367208a03	bump: version 0.41.0 → 0.42.0	2025-11-17 07:25:33 +00:00
github-actions[bot]	b1f7b1d30b	bump: version 0.40.0 → 0.41.0	2025-11-17 05:57:12 +00:00
Chris Coutinho	c3282534eb	feat: add vector viz template and chunk context endpoint Extracted vector visualization HTML template to separate file to resolve syntax conflicts between Jinja2, Alpine.js, and CSS. Added chunk context endpoint for fetching matched chunks with surrounding text. Changes: - Moved vector_viz.html to templates/ directory (separates Jinja2/Alpine.js/CSS) - Added /app/chunk-context endpoint for retrieving chunk text with context - Updated .dockerignore to include HTML files in Docker builds - Moved anthropic and boto3 to main dependencies (needed for production features) - Added jinja2 dependency for template rendering Fixes Jinja2 TemplateSyntaxError caused by CSS colons being parsed as Jinja2 syntax when template was inline in Python code. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-17 06:46:52 +01:00
github-actions[bot]	39131cefcc	bump: version 0.39.0 → 0.40.0	2025-11-16 11:09:40 +00:00
Chris Coutinho	1504df6fb5	Merge branch 'master' into feature/bedrock	2025-11-16 12:08:23 +01:00
github-actions[bot]	050e9a56b9	bump: version 0.38.0 → 0.39.0	2025-11-16 11:02:48 +00:00
Chris Coutinho	c28fc955ca	Merge origin/master into feature/bm25 Resolved conflicts: - viz_routes.py: Kept bm25's extract_dense_vector() function for robust vector handling - hybrid.py: Removed (bm25 uses native Qdrant RRF fusion instead) - uv.lock: Regenerated after accepting master's dependencies This merge brings in: - RAG evaluation framework (ADR-013) - Performance optimizations (double-fetch elimination) - Migration from asyncio to anyio - OpenTelemetry tracing improvements - Notes app enhancements 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 11:52:40 +01:00
Chris Coutinho	5b484c9226	feat: add unified provider architecture with Amazon Bedrock support Refactored LLM provider infrastructure to support sustainable additions of new providers with both embedding and text generation capabilities. ## Major Changes ### Unified Provider Architecture (ADR-015) - Created `nextcloud_mcp_server/providers/` with unified Provider ABC - Providers now support optional capabilities (embeddings and/or generation) - Auto-detection registry with priority: Bedrock → Ollama → Simple - Backward compatible - existing code continues to work ### New Providers - BedrockProvider: Full Amazon Bedrock integration - Embeddings: Titan Embed, Cohere Embed models - Generation: Claude, Llama, Titan Text, Mistral models - Model-specific request/response handling - AWS credential chain integration - OllamaProvider: Migrated with both capabilities support - AnthropicProvider: Moved from test code to production providers - SimpleProvider: Migrated in-memory fallback provider ### Breaking Changes None - full backward compatibility maintained: - `embedding.get_embedding_service()` still works - RAG evaluation tests updated to use unified providers - All existing tests pass (127 unit tests) ### Testing - Added 9 comprehensive Bedrock unit tests with mocked boto3 - All existing unit tests pass - Type checking (ty) and linting (ruff) pass - Verified backward compatibility ### Documentation - `docs/ADR-015-unified-provider-architecture.md`: Comprehensive ADR - `docs/bedrock-setup.md`: AWS setup guide with IAM permissions - `CLAUDE.md`: Updated with provider architecture section ### Dependencies - Added `boto3>=1.35.0` to dev dependencies (optional) ## Environment Variables ### Bedrock - `AWS_REGION`: AWS region (e.g., "us-east-1") - `BEDROCK_EMBEDDING_MODEL`: Model ID for embeddings - `BEDROCK_GENERATION_MODEL`: Model ID for generation - `AWS_ACCESS_KEY_ID`, `AWS_SECRET_ACCESS_KEY`: Optional credentials ### Ollama - `OLLAMA_BASE_URL`: API URL - `OLLAMA_EMBEDDING_MODEL`: Embedding model (default: "nomic-embed-text") - `OLLAMA_GENERATION_MODEL`: Generation model ## AWS Bedrock Permissions Required Minimal IAM policy: ```json { "Effect": "Allow", "Action": ["bedrock:InvokeModel"], "Resource": ["arn:aws:bedrock:::foundation-model/"] } ``` See `docs/bedrock-setup.md` for detailed setup instructions. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 11:36:58 +01:00
github-actions[bot]	b58b200452	bump: version 0.37.0 → 0.38.0	2025-11-16 10:18:37 +00:00
github-actions[bot]	10129354d9	bump: version 0.36.0 → 0.37.0	2025-11-16 10:18:00 +00:00
Chris Coutinho	8799450c7d	Merge pull request #306 from cbcoutinho/rag-evaluation feat: RAG evaluation framework with performance improvements	2025-11-16 11:17:41 +01:00
Chris Coutinho	2aa82d849c	Merge branch 'feature/bm25'	2025-11-16 07:57:36 +01:00
Chris Coutinho	d1fb7eb633	Merge branch 'rag-evaluation'	2025-11-16 07:46:17 +01:00
Chris Coutinho	6fe5596c13	feat: Implement BM25 hybrid search with native Qdrant RRF fusion Replace custom keyword/fuzzy search algorithms with industry-standard BM25 sparse vectors, combined with dense semantic vectors using Qdrant's native Reciprocal Rank Fusion (RRF). This consolidates search architecture and improves relevance for both semantic and keyword queries. Key changes: - Add fastembed dependency for BM25 sparse vector generation - Update Qdrant collection schema to support named vectors (dense + sparse) - Create BM25SparseEmbeddingProvider using FastEmbed's Qdrant/bm25 model - Implement BM25HybridSearchAlgorithm with native Qdrant RRF prefetch - Update document processor to generate both dense and sparse embeddings - Simplify nc_semantic_search() tool to use BM25 hybrid only - Remove legacy keyword.py, fuzzy.py, and custom hybrid.py (736 lines) - Update ADR-014 with implementation notes and test results Benefits: - Consolidated architecture (single Qdrant database) - Native database-level RRF fusion (more efficient) - Industry-standard BM25 (replaces brittle custom keyword search) - Better relevance across semantic and keyword queries - Simplified codebase (-285 net lines) Tests: All 125 tests passing (118 unit, 7 integration) Implements ADR-014: Replace Custom Keyword Search with BM25 Hybrid Search 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 06:59:44 +01:00
Chris Coutinho	fca8ab0cfd	Merge remote-tracking branch 'origin/master' into rag-evaluation	2025-11-16 00:32:59 +01:00
github-actions[bot]	7a7ed79d56	bump: version 0.35.0 → 0.36.0	2025-11-15 23:32:55 +00:00
Chris Coutinho	c272ddd82d	feat: implement RAG evaluation framework with CLI tooling - Add ADR-013 documenting RAG evaluation architecture - Implement two-part evaluation: Context Recall (retrieval) + Answer Correctness (generation) - Create Click CLI for ground truth generation and corpus upload - Add pytest fixtures and tests for retrieval/generation quality - Use BeIR/nfcorpus dataset with 5 selected test queries - Support Ollama and Anthropic LLM providers - Generate synthetic ground truth answers offline - Add comprehensive documentation in tests/rag_evaluation/README.md The framework separates one-time setup (generate/upload) from test execution, making tests much faster (~6-12 min vs ~15-25 min per run). Tests are manual only (not in CI) and require external LLM access. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 23:11:21 +01:00
github-actions[bot]	682923dcc8	bump: version 0.34.2 → 0.35.0	2025-11-15 00:46:11 +00:00
github-actions[bot]	56a5c63994	bump: version 0.34.1 → 0.34.2	2025-11-13 21:11:36 +00:00
github-actions[bot]	dd12c957f6	bump: version 0.34.0 → 0.34.1	2025-11-13 21:10:16 +00:00
github-actions[bot]	2f138e7539	bump: version 0.33.1 → 0.34.0	2025-11-13 16:15:29 +00:00
github-actions[bot]	bd76902932	bump: version 0.33.0 → 0.33.1	2025-11-13 12:10:42 +00:00
github-actions[bot]	15951c38fa	bump: version 0.32.1 → 0.33.0	2025-11-13 10:58:05 +00:00
github-actions[bot]	747d297008	bump: version 0.32.0 → 0.32.1	2025-11-12 02:16:57 +00:00
github-actions[bot]	49a9dd43c6	bump: version 0.31.1 → 0.32.0	2025-11-11 23:54:43 +00:00
github-actions[bot]	ce666934f2	bump: version 0.31.0 → 0.31.1	2025-11-10 22:21:48 +00:00
github-actions[bot]	f44bf3e8f2	bump: version 0.30.0 → 0.31.0	2025-11-10 07:02:49 +00:00
renovate-bot-cbcoutinho[bot]	fb1af697f7	chore(deps): lock file maintenance	2025-11-10 05:13:55 +00:00
github-actions[bot]	126b5a7626	bump: version 0.29.2 → 0.30.0	2025-11-10 02:50:11 +00:00
github-actions[bot]	a0576aa9a2	bump: version 0.29.1 → 0.29.2	2025-11-09 18:28:34 +00:00
github-actions[bot]	7772b1ac2e	bump: version 0.29.0 → 0.29.1	2025-11-09 08:54:26 +00:00
github-actions[bot]	af96378cb6	bump: version 0.28.0 → 0.29.0	2025-11-09 08:29:53 +00:00
github-actions[bot]	ae81f0334e	bump: version 0.27.3 → 0.28.0	2025-11-09 08:04:06 +00:00
Chris Coutinho	23f3a231a5	Merge pull request #273 from cbcoutinho/feature/observability-monitoring Feature/observability monitoring	2025-11-09 09:03:40 +01:00
Chris Coutinho	578de4d7d6	feat(observability): Add comprehensive monitoring with Prometheus and OpenTelemetry - Add Prometheus metrics for HTTP, MCP tools, Nextcloud API, OAuth, vector sync, and DB operations - Add OpenTelemetry distributed tracing with OTLP export - Add structured JSON logging with trace context correlation - Add ObservabilityMiddleware for automatic HTTP instrumentation - Add app_name attribute to all client classes for per-app metrics - Add configuration for metrics, tracing, and logging via environment variables - Add documentation in docs/observability.md - Fix graceful degradation when tracing is disabled (default state) - Fix uvicorn logging configuration to use observability formatters 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 08:54:04 +01:00
github-actions[bot]	8f0f989c6d	bump: version 0.27.2 → 0.27.3	2025-11-09 06:52:31 +00:00
github-actions[bot]	137dc80075	bump: version 0.27.1 → 0.27.2	2025-11-09 06:45:44 +00:00
github-actions[bot]	f51edff25d	bump: version 0.27.0 → 0.27.1	2025-11-09 06:22:00 +00:00
github-actions[bot]	538bbc375e	bump: version 0.26.1 → 0.27.0	2025-11-09 06:15:27 +00:00
Chris Coutinho	e96c02e4d4	fix: remove unnecessary urllib3<2.0 constraint The urllib3<2.0 constraint was added unnecessarily during troubleshooting. urllib3 2.x works perfectly fine with qdrant-client. The import path for urllib3.util.Url and parse_url remains the same across 1.x and 2.x versions. Changes: - Remove urllib3<2.0 constraint from pyproject.toml - Upgrade to urllib3 2.5.0 (latest) - All integration tests pass with urllib3 2.x Verified: - from urllib3.util import Url, parse_url works in 2.5.0 - All 6 semantic search integration tests pass - qdrant-client 1.15.1 works correctly with urllib3 2.5.0 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:18:31 +01:00
Chris Coutinho	7b8c3f93a8	test: add integration tests for semantic search with in-process embeddings Adds comprehensive integration tests for vector database semantic search that work without external dependencies (Ollama), making them suitable for CI/CD. Changes: - Add SimpleEmbeddingProvider: in-process TF-IDF-like embeddings using feature hashing - Make Ollama optional: embedding service now falls back to SimpleEmbeddingProvider - Add 6 integration tests covering semantic search, filtering, and batch operations - Downgrade urllib3 to 1.26.x for qdrant-client compatibility - Update docker-compose.yml to comment out Ollama configuration (optional) The SimpleEmbeddingProvider generates deterministic, normalized embeddings suitable for testing semantic similarity without requiring external services. Tests validate that similar texts have higher cosine similarity and that semantic search correctly ranks results by relevance. Test coverage: - Deterministic embedding generation - Semantic similarity between texts - Full search flow with Qdrant (in-memory) - Category filtering - Empty result handling - Batch embedding generation All tests pass and can run in GitHub CI without Ollama infrastructure. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:13:33 +01:00
Chris Coutinho	8f45e996e8	feat: implement vector sync scanner and processor (ADR-007 Phase 2) Implements background vector database synchronization using anyio TaskGroups for BasicAuth mode with single-user credentials. Scanner Implementation: - Periodic document discovery (hourly, configurable) - Timestamp-based change detection (Nextcloud vs Qdrant) - Wake event for immediate scanning on-demand - Supports both initial sync (all docs) and incremental sync (changes only) - Detects deleted documents and queues for removal Processor Implementation: - Concurrent document processing pool (3 workers default) - I/O-bound embedding generation via Ollama API - Retry logic with exponential backoff (3 retries) - Document chunking (512 words, 50-word overlap) - Handles both index and delete operations - Upserts vectors to Qdrant with rich metadata App Lifespan Integration: - Extended AppContext with background task state - Modified app_lifespan_basic() to start tasks via anyio TaskGroups - Graceful shutdown with coordinated task cancellation - Only activates when VECTOR_SYNC_ENABLED=true Embedding Service: - OllamaEmbeddingProvider with TLS support - Singleton pattern for shared client instances - Batch embedding support for efficiency - Auto-detects embedding dimension (768 for nomic-embed-text) Qdrant Client: - Async client wrapper with singleton pattern - Auto-creates collection on first use - COSINE distance metric for semantic similarity - Integrates with embedding service for dimension detection Health Check Enhancement: - Added Qdrant status check to /health/ready endpoint - Only checks when VECTOR_SYNC_ENABLED=true - 2-second timeout for health probe - Reports connection errors with details Configuration: - VECTOR_SYNC_ENABLED: Enable background sync - VECTOR_SYNC_SCAN_INTERVAL: Scanner frequency (3600s default) - VECTOR_SYNC_PROCESSOR_WORKERS: Concurrent processors (3 default) - QDRANT_URL, QDRANT_API_KEY, QDRANT_COLLECTION: Vector DB config - OLLAMA_BASE_URL, OLLAMA_EMBEDDING_MODEL: Embedding service config Dependencies Added: - qdrant-client>=1.7.0: Vector database client Docker Compose: - Added Qdrant service with health check - Exposed ports 6333 (REST) and 6334 (gRPC) - Configured MCP service with vector sync environment - Added qdrant-data volume for persistence Known Issue: - FastMCP lifespan not triggering for streamable-http transport - Background tasks will start once lifespan integration is complete - Lifespan triggers on MCP session establishment, not server startup Related: ADR-007 Background Vector Database Synchronization 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 21:14:38 +01:00
github-actions[bot]	4ceaf45ffd	bump: version 0.26.0 → 0.26.1	2025-11-08 03:59:28 +00:00
Chris Coutinho	21b878a2e7	Merge pull request #265 from cbcoutinho/renovate/mcp-1.x fix(deps): update dependency mcp to >=1.21,<1.22	2025-11-08 04:59:05 +01:00
github-actions[bot]	218f0bd366	bump: version 0.25.0 → 0.26.0	2025-11-08 03:48:50 +00:00
renovate-bot-cbcoutinho[bot]	c1e135c4a2	fix(deps): update dependency mcp to >=1.21,<1.22	2025-11-07 05:06:10 +00:00
github-actions[bot]	77e491beea	bump: version 0.24.1 → 0.25.0	2025-11-05 23:02:25 +00:00

1 2 3 4

156 Commits