Update README.md
Browse files
README.md
CHANGED
|
@@ -12,4 +12,53 @@ hf_oauth: true
|
|
| 12 |
hf_oauth_expiration_minutes: 480
|
| 13 |
---
|
| 14 |
|
| 15 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
hf_oauth_expiration_minutes: 480
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# 🧠 GAIA-Agent
|
| 16 |
+
|
| 17 |
+
A **LLM agent** designed to solve the final assignment of the [HuggingFace Agents course](https://huggingface.co/agents-course).
|
| 18 |
+
The assignment consists of 20 level 1 questions of the [GAIA benchmark](https://huggingface.co/datasets/andrewrreed/GAIA).
|
| 19 |
+
This agent is built with [LangGraph](https://www.langchain.com/langgraph), [LangChain](https://www.langchain.com/),
|
| 20 |
+
and is tracked using [LangFuse](https://www.langfuse.com/).
|
| 21 |
+
It uses real-time web search, Wikipedia lookups, file parsing, image analysis, audio transcription to provide accurate answers.
|
| 22 |
+
|
| 23 |
+
## 🛠️ Technical Details
|
| 24 |
+
|
| 25 |
+
| Component/ Tool | Tech Used |
|
| 26 |
+
|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
| 27 |
+
| Agent framework | [LangGraph](https://www.langchain.com/langgraph) + [LangChain](https://www.langchain.com/) |
|
| 28 |
+
| LLM | [Meta LLaMA 4 Maverick 17B 128e](https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct) via [Groq](https://groq.com/) |
|
| 29 |
+
| Web search | [SerperAPI](https://serper.dev/), [requests](https://requests.readthedocs.io/en/latest/), [Playwright](https://playwright.dev/python/) for rendering JSON |
|
| 30 |
+
| HTML parsing | [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/), [markdownify](https://github.com/matthewwithanm/python-markdownify) for converting HTML to Markdown |
|
| 31 |
+
| Wikipedia | [Wikimedia API](https://api.wikimedia.org/wiki/Core_REST_API) |
|
| 32 |
+
| File parsing | [Unstructured](https://unstructured.io/) (for PDFs, PowerPoint, Word), [pandas](https://pandas.pydata.org/) (for CSV, TSV, Excel) |
|
| 33 |
+
| Audio transcription | [OpenAI Whisper base](https://github.com/openai/whisper) |
|
| 34 |
+
| Agent monitoring | [LangFuse](https://www.langfuse.com/) |
|
| 35 |
+
| UI / API | [Gradio](https://www.gradio.app/) based on the [Final Assignment Template](https://huggingface.co/spaces/agents-course/First_agent_template) |
|
| 36 |
+
|
| 37 |
+
|
| 38 |
+
## 🔐 Setup Instructions
|
| 39 |
+
|
| 40 |
+
1. **Clone the repository**:
|
| 41 |
+
```bash
|
| 42 |
+
git clone https://github.com/phucdev/GAIA-agent.git
|
| 43 |
+
cd GAIA-agent
|
| 44 |
+
```
|
| 45 |
+
2. **Install dependencies**:
|
| 46 |
+
```bash
|
| 47 |
+
pip install -r requirements.txt
|
| 48 |
+
```
|
| 49 |
+
3. **Create API keys and set-up environment variables**:
|
| 50 |
+
Use `env.example` to create a `.env` file in the root directory and replace the placeholders with your actual API keys:
|
| 51 |
+
```plaintext
|
| 52 |
+
GROQ_API_KEY=your_groq_api_key
|
| 53 |
+
SERPER_API_KEY=your_serper_api_key
|
| 54 |
+
... (other API keys as needed)
|
| 55 |
+
```
|
| 56 |
+
4. Optional: For rendering JS with Playwright, you may need some additional setup:
|
| 57 |
+
```bash
|
| 58 |
+
playwright install
|
| 59 |
+
sudo apt-get install libgtk-3-0
|
| 60 |
+
```
|
| 61 |
+
5. Then you can run the app with:
|
| 62 |
+
```bash
|
| 63 |
+
python app.py
|
| 64 |
+
```
|