Spaces:

Luigi
/

rts-commander

Sleeping

App Files Files Community

rts-commander / docs /AI_MODEL_FIX.md

Luigi

chore(structure): move docs into docs/ and tests into tests/

ccbaf39 about 2 months ago

preview code

raw

history blame contribute delete

6.91 kB

	# 🤖 AI Model Configuration for HF Spaces

	Date: 3 octobre 2025
	Issue Fixed: Permission denied when downloading AI model
	Status: ✅ RESOLVED

	---

	## 🐛 Problem Identified

	### Error Log
	```
	⚠️ AI Model not found. Attempting automatic download...
	📦 Downloading model (~350 MB)...
	From: https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct-GGUF/resolve/main/qwen2.5-0.5b-instruct-q4_0.gguf
	To: /home/luigi/rts/qwen2.5-0.5b-instruct-q4_0.gguf
	This may take a few minutes...
	❌ Auto-download failed: [Errno 13] Permission denied: '/home/luigi'
	Tactical analysis disabled.
	```

	### Root Cause
	1. Hardcoded path: `/home/luigi/rts/qwen2.5-0.5b-instruct-q4_0.gguf`
	2. No permission handling: Tried to write to user home directory
	3. HF Spaces incompatibility: Container runs as different user

	---

	## ✅ Fix Applied

	### Changes in `ai_analysis.py`

	#### 1. Smart Path Resolution (Lines 193-200)
	```python
	# Before:
	possible_paths = [
	Path("/home/luigi/rts/qwen2.5-0.5b-instruct-q4_0.gguf"), # ❌ Hardcoded
	Path("./qwen2.5-0.5b-instruct-q4_0.gguf"),
	Path("../qwen2.5-0.5b-instruct-q4_0.gguf"),
	]

	# After:
	possible_paths = [
	Path("./qwen2.5-0.5b-instruct-q4_0.gguf"), # Current directory
	Path("../qwen2.5-0.5b-instruct-q4_0.gguf"), # Parent directory
	Path(__file__).parent / "qwen2.5-0.5b-instruct-q4_0.gguf", # Same dir as script
	Path(__file__).parent.parent / "qwen2.5-0.5b-instruct-q4_0.gguf", # Root project
	]
	```

	#### 2. Permission-Safe Download (Lines 217-227)
	```python
	# Test write permission first
	try:
	default_path = Path("./qwen2.5-0.5b-instruct-q4_0.gguf").resolve()
	# Test write permission
	test_file = default_path.parent / ".write_test"
	test_file.touch()
	test_file.unlink()
	except (PermissionError, OSError):
	# Fallback to temp directory
	import tempfile
	default_path = Path(tempfile.gettempdir()) / "qwen2.5-0.5b-instruct-q4_0.gguf"
	```

	### Benefits
	- ✅ No more hardcoded paths
	- ✅ Tests write permissions before download
	- ✅ Falls back to `/tmp/` if needed
	- ✅ Works on HF Spaces containers
	- ✅ Works on local development
	- ✅ Graceful degradation (game works without AI)

	---

	## 🎮 Game Behavior

	### Without AI Model
	```
	INFO: Uvicorn running on http://0.0.0.0:7860
	⚠️ AI Model not found. Attempting automatic download...
	📦 Downloading model (~350 MB)...
	[Download progress or fallback message]
	```

	Game still works! Tactical analysis is optional.

	### With AI Model
	```
	INFO: Uvicorn running on http://0.0.0.0:7860
	✅ AI Model loaded: ./qwen2.5-0.5b-instruct-q4_0.gguf
	🧠 Tactical analysis available
	```

	Players can use AI analysis feature.

	---

	## 📦 Model Information

	### Qwen2.5-0.5B-Instruct-GGUF
	- Source: https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct-GGUF
	- Size: ~350 MB (q4_0 quantization)
	- Format: GGUF (llama.cpp compatible)
	- Purpose: Tactical battlefield analysis
	- Optional: Game works without it

	### Download Locations (Priority Order)
	1. `./qwen2.5-0.5b-instruct-q4_0.gguf` (current directory)
	2. `../qwen2.5-0.5b-instruct-q4_0.gguf` (parent directory)
	3. `/web/qwen2.5-0.5b-instruct-q4_0.gguf` (script directory)
	4. `/qwen2.5-0.5b-instruct-q4_0.gguf` (project root)
	5. `/tmp/qwen2.5-0.5b-instruct-q4_0.gguf` (fallback)

	---

	## 🚀 HF Spaces Deployment

	### Option 1: Include Model in Repo (Recommended for Demo)
	```bash
	cd /home/luigi/rts/web

	# Download model to web directory
	wget https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct-GGUF/resolve/main/qwen2.5-0.5b-instruct-q4_0.gguf

	# Add to git
	git add qwen2.5-0.5b-instruct-q4_0.gguf
	git commit -m "feat: Include AI model for tactical analysis"

	# Push to HF Spaces
	git push
	```

	Pros:
	- ✅ AI available immediately
	- ✅ No download delay on startup
	- ✅ Deterministic deployment

	Cons:
	- ❌ Larger repo size (~350 MB)
	- ❌ Slower git operations

	### Option 2: Download on Startup (Current Behavior)
	```bash
	# Model will be downloaded automatically on first run
	# Falls back to /tmp/ on HF Spaces
	```

	Pros:
	- ✅ Smaller repo size
	- ✅ Faster git operations

	Cons:
	- ❌ ~1 minute startup delay on first run
	- ❌ Uses ephemeral storage (lost on container restart)
	- ❌ Download may fail on HF free tier

	### Option 3: Disable AI (Minimal Deployment)
	```python
	# In app.py or environment variable
	AI_ENABLED = False
	```

	Pros:
	- ✅ Instant startup
	- ✅ Minimal resource usage
	- ✅ No download issues

	Cons:
	- ❌ No tactical analysis feature

	---

	## 🔧 Configuration

	### Environment Variables
	```bash
	# Optional: Override model path
	export AI_MODEL_PATH="/path/to/qwen2.5-0.5b-instruct-q4_0.gguf"

	# Optional: Disable AI entirely
	export AI_ENABLED="false"
	```

	### In `app.py`
	```python
	# Current implementation:
	ai_analyzer = AIAnalyzer() # Auto-detects model

	# With explicit path:
	ai_analyzer = AIAnalyzer(model_path="/custom/path/model.gguf")

	# Disable AI:
	ai_analyzer = None # Game will skip AI analysis
	```

	---

	## 🧪 Testing

	### Test Fix Locally
	```bash
	cd /home/luigi/rts/web

	# Remove model if exists
	rm -f qwen2.5-0.5b-instruct-q4_0.gguf

	# Start server
	python app.py

	# Should see:
	# ✅ No permission errors
	# ✅ Game starts normally
	# ℹ️ AI may try to download or use fallback path
	```

	### Test on HF Spaces
	```bash
	# Push changes
	git add ai_analysis.py
	git commit -m "fix: AI model path and permissions"
	git push

	# Check HF Spaces logs:
	# ✅ No "[Errno 13] Permission denied"
	# ✅ Game runs successfully
	```

	---

	## 📊 Impact

	### Before Fix
	- ❌ Permission denied error on startup
	- ❌ Hardcoded user paths
	- ❌ Would fail on HF Spaces
	- ⚠️ Confusing error messages

	### After Fix
	- ✅ No permission errors
	- ✅ Portable path resolution
	- ✅ Works on HF Spaces
	- ✅ Graceful degradation
	- ✅ Clear fallback behavior

	---

	## 🎯 Recommendations

	### For Demo/Production on HF Spaces
	Option 1: Include model in repo
	```bash
	cd /home/luigi/rts
	wget https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct-GGUF/resolve/main/qwen2.5-0.5b-instruct-q4_0.gguf
	git add qwen2.5-0.5b-instruct-q4_0.gguf
	git commit -m "feat: Include AI model"
	git push
	```

	### For Quick Testing
	Option 3: Disable AI temporarily
	```python
	# In app.py, comment out AI initialization:
	# ai_analyzer = AIAnalyzer()
	ai_analyzer = None
	```

	### For Development
	Current setup works! Model auto-downloads to current directory.

	---

	## ✅ Summary

	Issue: Permission denied when downloading AI model
	Fix: Smart path resolution + permission testing
	Status: ✅ RESOLVED
	Game: Works with or without AI model
	HF Spaces: Compatible

	Files Modified:
	- `web/ai_analysis.py` (Lines 193-227)

	Commits:
	```bash
	git add web/ai_analysis.py
	git commit -m "fix: AI model path resolution and permission handling"
	git push
	```

	🎉 Ready for deployment!