Files
archon/python
claude[bot] bc97f5dbf5 feat: integrate Docling for advanced document processing in RAG pipeline
- Add Docling dependency to pyproject.toml for multi-format document support
- Create docling_processing.py with DocumentConverter and HybridChunker
- Implement intelligent semantic-aware chunking for better RAG performance
- Add enhanced document upload method with Docling integration
- Update knowledge API to use enhanced processing with legacy fallback
- Support PDF, DOCX, PPTX, XLSX, HTML formats with structure preservation
- Include enhanced metadata tracking for processing methods
- Maintain backward compatibility with existing document processors

Resolves #756

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-authored-by: Cole Medin <coleam00@users.noreply.github.com>
2025-10-04 13:46:19 +00:00
..
2025-09-22 12:25:58 +03:00