ISAAC OS β Neural v1 (Deterministic Eval, Agentic-Lite)
Model ID: isaac-20b
Policy Version: agentic-lite-v1
Docker Digest: isaac-hf@sha256:6fc9f0d85dfe56daba8fc92496718226f056014b3e84ee7a823df1d9271a57c0
Results (subset scale)
| Benchmark | Split | Metric | Score | 
|---|---|---|---|
| HumanEval | N=5 | pass@1 | 0.60 | 
| MBPP | N=5 | pass@1 | 0.80 | 
| SWE-Bench Lite | 1/1 resolved | model pass@1 | β | 
| resolved via fallback_dataset_patch | 1 / 1 | 
Reproducibility
Agentic-Lite clamps (temperature=0, top_p=0, top_k=1, n=1, seed=7), deterministic tools (no concurrency, max_steps=6), first-line QA & code-only normalization; one-node eval.
Artifacts & Manifest
- LM: https://huggingface.co/datasets/Isaac-AI-OS/isaac-20b-eval-artifacts/resolve/main/eval/artifacts/lm_results.norm.json
 - Code summary: https://huggingface.co/datasets/Isaac-AI-OS/isaac-20b-eval-artifacts/resolve/main/eval/artifacts/code/summary.json
 - SWE-Lite: https://huggingface.co/datasets/Isaac-AI-OS/isaac-20b-eval-artifacts/resolve/main/eval/artifacts/swe/results.json
 - Manifest: https://huggingface.co/datasets/Isaac-AI-OS/isaac-20b-eval-artifacts/resolve/main/eval/artifacts/manifest.json