mirror of
https://github.com/coleam00/Archon.git
synced 2026-01-02 12:48:54 -05:00
265 lines
6.5 KiB
Plaintext
265 lines
6.5 KiB
Plaintext
---
|
|
title: Coding Best Practices
|
|
description: Practical patterns and anti-patterns from real-world Archon development
|
|
sidebar_position: 15
|
|
---
|
|
|
|
# Coding Best Practices
|
|
|
|
Essential patterns and pitfalls to avoid when developing Archon.
|
|
|
|
## Async/Await Patterns
|
|
|
|
### ❌ DON'T: Lambda Functions Returning Unawaited Coroutines
|
|
|
|
```python
|
|
# This will break - the coroutine is never awaited
|
|
progress_callback=lambda data: update_progress(progress_id, data)
|
|
```
|
|
|
|
### ✅ DO: Use asyncio.create_task for Async Callbacks
|
|
|
|
```python
|
|
# Properly schedule async function execution
|
|
progress_callback=lambda data: asyncio.create_task(update_progress(progress_id, data))
|
|
```
|
|
|
|
### ❌ DON'T: Call Sync Functions from Async Context
|
|
|
|
```python
|
|
# This causes "event loop already running" errors
|
|
async def process_data():
|
|
embeddings = create_embeddings_batch(texts) # Sync function in async context
|
|
```
|
|
|
|
### ✅ DO: Use Async Versions in Async Context
|
|
|
|
```python
|
|
async def process_data():
|
|
embeddings = await create_embeddings_batch_async(texts)
|
|
```
|
|
|
|
## Socket.IO Best Practices
|
|
|
|
### Room-Based Broadcasting
|
|
|
|
Always broadcast to specific rooms, not all clients:
|
|
|
|
```python
|
|
# ✅ Good - targeted broadcast
|
|
await sio.emit('crawl_progress', data, room=progress_id)
|
|
|
|
# ❌ Bad - broadcasts to everyone
|
|
await sio.emit('crawl_progress', data)
|
|
```
|
|
|
|
### Simple Event Handlers
|
|
|
|
Keep Socket.IO handlers simple and direct:
|
|
|
|
```python
|
|
# ✅ Good - simple, clear purpose
|
|
@sio.event
|
|
async def crawl_subscribe(sid, data):
|
|
progress_id = data.get('progress_id')
|
|
if progress_id:
|
|
await sio.enter_room(sid, progress_id)
|
|
await sio.emit('crawl_subscribe_ack', {'status': 'subscribed'}, to=sid)
|
|
```
|
|
|
|
## Service Layer Patterns
|
|
|
|
### Progress Callbacks
|
|
|
|
When services need progress callbacks, ensure proper async handling:
|
|
|
|
```python
|
|
# ✅ Good - service accepts optional async callback
|
|
async def process_with_progress(data, progress_callback=None):
|
|
if progress_callback:
|
|
await progress_callback({'status': 'processing', 'percentage': 50})
|
|
```
|
|
|
|
### Error Handling in Services
|
|
|
|
Always provide fallbacks for external service failures:
|
|
|
|
```python
|
|
# ✅ Good - graceful degradation
|
|
try:
|
|
embeddings = await create_embeddings_batch_async(texts)
|
|
except Exception as e:
|
|
logger.warning(f"Embedding creation failed: {e}")
|
|
# Return zero embeddings as fallback
|
|
return [[0.0] * 1536 for _ in texts]
|
|
```
|
|
|
|
## Common Pitfalls
|
|
|
|
### Event Loop Issues
|
|
|
|
**Problem**: "This event loop is already running"
|
|
|
|
**Solution**: Check if you're in an async context before using sync functions:
|
|
|
|
```python
|
|
try:
|
|
loop = asyncio.get_running_loop()
|
|
# Already in async context - use async version
|
|
result = await async_function()
|
|
except RuntimeError:
|
|
# No event loop - safe to use sync version
|
|
result = sync_function()
|
|
```
|
|
|
|
### Socket.IO Progress Updates
|
|
|
|
**Problem**: Progress updates not reaching the UI
|
|
|
|
**Solution**: Ensure the client is in the correct room and progressId is included:
|
|
|
|
```python
|
|
# Always include progressId in the data
|
|
data['progressId'] = progress_id
|
|
await sio.emit('crawl_progress', data, room=progress_id)
|
|
```
|
|
|
|
### Embedding Service Optimization
|
|
|
|
**Problem**: Synchronous embedding calls blocking async operations
|
|
|
|
**Solution**: Always use async versions and batch operations:
|
|
|
|
```python
|
|
# Process in batches with rate limiting
|
|
async def create_embeddings_for_documents(documents):
|
|
batch_size = 20 # OpenAI limit
|
|
for i in range(0, len(documents), batch_size):
|
|
batch = documents[i:i + batch_size]
|
|
embeddings = await create_embeddings_batch_async(batch)
|
|
await asyncio.sleep(0.5) # Rate limiting
|
|
```
|
|
|
|
## Testing Async Code
|
|
|
|
### Mock Async Functions Properly
|
|
|
|
```python
|
|
# ✅ Good - proper async mock
|
|
mock_update = AsyncMock()
|
|
with patch('module.update_progress', mock_update):
|
|
await function_under_test()
|
|
mock_update.assert_called_once()
|
|
```
|
|
|
|
### Test Socket.IO Events
|
|
|
|
```python
|
|
# ✅ Good - test room management
|
|
@patch('src.server.socketio_app.sio')
|
|
async def test_subscribe_joins_room(mock_sio):
|
|
await crawl_subscribe('sid-123', {'progress_id': 'progress-456'})
|
|
mock_sio.enter_room.assert_called_with('sid-123', 'progress-456')
|
|
```
|
|
|
|
## React/Frontend Best Practices
|
|
|
|
### Performance Anti-Patterns to Avoid
|
|
|
|
#### ❌ DON'T: Update State on Every Keystroke Without Debouncing
|
|
```typescript
|
|
// This causes excessive re-renders
|
|
<input
|
|
value={formData.title}
|
|
onChange={(e) => setFormData({ ...formData, title: e.target.value })}
|
|
/>
|
|
```
|
|
|
|
#### ✅ DO: Use Debounced Inputs or Local State
|
|
```typescript
|
|
// Use a debounced input component
|
|
<DebouncedInput
|
|
value={formData.title}
|
|
onChange={handleTitleChange}
|
|
delay={300}
|
|
/>
|
|
```
|
|
|
|
#### ❌ DON'T: Create New Functions in Render
|
|
```typescript
|
|
// Creates new function every render
|
|
<button onClick={() => handleAction(item.id)}>Click</button>
|
|
```
|
|
|
|
#### ✅ DO: Use useCallback for Stable References
|
|
```typescript
|
|
const handleClick = useCallback((id) => {
|
|
handleAction(id);
|
|
}, []);
|
|
|
|
<button onClick={() => handleClick(item.id)}>Click</button>
|
|
```
|
|
|
|
### Socket.IO Client-Side Patterns
|
|
|
|
#### Room Subscription Pattern
|
|
```typescript
|
|
useEffect(() => {
|
|
const ws = createWebSocketService();
|
|
|
|
const connect = async () => {
|
|
await ws.connect('/'); // Always default namespace
|
|
|
|
// Join room
|
|
ws.send({
|
|
type: 'join_project',
|
|
data: { project_id: projectId }
|
|
});
|
|
|
|
// Handle messages
|
|
ws.addMessageHandler('task_updated', handleTaskUpdate);
|
|
};
|
|
|
|
connect();
|
|
return () => ws.disconnect();
|
|
}, [projectId]);
|
|
```
|
|
|
|
### State Management Patterns
|
|
|
|
#### Batch Updates
|
|
```typescript
|
|
// ✅ Good - Single render
|
|
setState(prev => ({
|
|
...prev,
|
|
field1: value1,
|
|
field2: value2,
|
|
field3: value3
|
|
}));
|
|
|
|
// ❌ Bad - Three renders
|
|
setField1(value1);
|
|
setField2(value2);
|
|
setField3(value3);
|
|
```
|
|
|
|
#### Local State for Transient Values
|
|
```typescript
|
|
// Keep form inputs local until save
|
|
const [localTitle, setLocalTitle] = useState(title);
|
|
|
|
const handleSave = () => {
|
|
onSave({ title: localTitle });
|
|
};
|
|
```
|
|
|
|
## Key Takeaways
|
|
|
|
1. **Always await async functions** - Never let coroutines go unawaited
|
|
2. **Use rooms for Socket.IO** - Target specific audiences, not everyone
|
|
3. **Handle async boundaries** - Know when you're in an async context
|
|
4. **Fail gracefully** - Always have fallbacks for external services
|
|
5. **Test async code properly** - Use AsyncMock and proper async test patterns
|
|
6. **Optimize React renders** - Use memo, useCallback, and debouncing
|
|
7. **Batch state updates** - Minimize renders with single setState calls
|
|
8. **Use local state** - Keep transient values local to components |