TODO
- Initialize Git repository and perform initial commit.
 -  Install necessary dependencies: 
express,node-llama-cpp. -  Create the basic server structure in 
index.js. -  Implement the 
/v1/chat/completionsendpoint. - Load the Qwen3 model.
 -  Implement the inference logic using 
node-llama-cpp. - Add error handling.
 - Add logging.
 - Write tests for the API endpoint.
 -  Update 
CLAUDE.mdwith detailed documentation. -  Update 
ARCHITECTURE.mdwith the project architecture. - Push the initial project to the GitHub repository.