feat: Implement comprehensive OpenAI error handling for Issue #362

mirror of https://github.com/coleam00/Archon.git synced 2026-01-11 09:07:05 -05:00

Replace silent failures with clear, actionable error messages to eliminate
90-minute debugging sessions when OpenAI API quota is exhausted.

## Backend Enhancements
- Add error sanitization preventing sensitive data exposure (API keys, URLs, tokens)
- Add upfront API key validation before expensive operations (crawl, upload, refresh)
- Implement fail-fast pattern in RAG service (no more empty results for API failures)
- Add specific error handling for quota, rate limit, auth, and API errors
- Add EmbeddingAuthenticationError exception with masked key prefix support

## Frontend Enhancements
- Create enhanced error utilities with OpenAI-specific parsing
- Build TanStack Query compatible API wrapper preserving ETag caching
- Update knowledge service to use enhanced error handling
- Enhance TanStack Query hooks with user-friendly error messages

## Security Features
- Comprehensive regex sanitization (8 patterns) with ReDoS protection
- Input validation and circular reference detection
- Generic fallback messages for sensitive keywords
- Bounded quantifiers to prevent regex DoS attacks

## User Experience
- Clear error messages: "OpenAI API quota exhausted"
- Actionable guidance: "Check your OpenAI billing dashboard and add credits"
- Immediate error visibility (no more silent failures)
- Appropriate error severity styling

## Architecture Compatibility
- Full TanStack Query integration maintained
- ETag caching and optimistic updates preserved
- No performance regression (all existing tests pass)
- Compatible with existing knowledge base architecture

Resolves #362: Users no longer experience mysterious empty RAG results
that require extensive debugging to identify OpenAI quota issues.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

This commit is contained in:

leex279

2025-09-12 19:22:36 +02:00

parent 94aed6b9fa

commit 98b798173e

26 changed files with 1375 additions and 143 deletions

									
										2

python/src/agents/base_agent.py
									
												View File
												
				@@ -216,7 +216,7 @@ class BaseAgent(ABC, Generic[DepsT, OutputT]):

				            self.logger.info(f"Agent {self.name} completed successfully")

				            # PydanticAI returns a RunResult with data attribute

				            return result.data

				        except asyncio.TimeoutError:

				        except TimeoutError:

				            self.logger.error(f"Agent {self.name} timed out after 120 seconds")

				            raise Exception(f"Agent {self.name} operation timed out - taking too long to respond")

				        except Exception as e:

feat: Implement comprehensive OpenAI error handling for Issue #362

2 python/src/agents/base_agent.py Unescape Escape View File

2

python/src/agents/base_agent.py

View File