Spaces:
Running
Running
| title: Zebra Logic Bench | |
| emoji: 🦓 | |
| colorFrom: blue | |
| colorTo: yellow | |
| sdk: gradio | |
| sdk_version: 4.19.2 | |
| app_file: app.py | |
| pinned: true | |
| fullWidth: true | |
| hf_oauth: true | |
| api: false | |
| tags: | |
| - leaderboard | |
| datasets: | |
| - allenai/ZebraLogicBench | |
| - WildEval/ZebraLogic | |
| models: | |
| - Qwen/Qwen2-72B-Instruct | |
| - Qwen/Qwen1.5-72B-Chat | |
| - Qwen/Qwen1.5-7B-Chat | |
| - meta-llama/Meta-Llama-3-8B-Instruct | |
| - meta-llama/Meta-Llama-3-70B-Instruct | |
| - meta-llama/Llama-2-13b-chat-hf | |
| - meta-llama/Llama-2-70b-chat-hf | |
| - meta-llama/Llama-2-7b-chat-hf | |
| - mistralai/Mistral-7B-Instruct-v0.1 | |
| - mistralai/Mistral-7B-Instruct-v0.2 | |
| - mistralai/Mixtral-8x7B-Instruct-v0.1 | |
| - microsoft/Phi-3-medium-128k-instruct | |
| - microsoft/Phi-3-mini-128k-instruct | |
| - NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO | |
| - NousResearch/Hermes-2-Theta-Llama-3-8B | |
| - 01-ai/Yi-1.5-34B-Chat | |
| - 01-ai/Yi-1.5-9B-Chat | |
| - 01-ai/Yi-1.5-6B-Chat | |
| - google/gemma-7b-it | |
| - google/gemma-2b-it | |
| - allenai/tulu-2-dpo-70b | |
| - HuggingFaceH4/zephyr-7b-beta | |
| - Nexusflow/Starling-LM-7B-beta | |
| - databricks/dbrx-instruct | |
| - princeton-nlp/Llama-3-Instruct-8B-SimPO | |
| - chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO | |
| - chujiezheng/Starling-LM-7B-beta-ExPO | |
| - ZhangShenao/SELM-Zephyr-7B-iter-3 | |
| - deepseek-ai/DeepSeek-V2-Chat | |
| - m-a-p/neo_7b_instruct_v0.1 | |
| - 01-ai/Yi-34B-chat | |
| - lmsys/vicuna-13b-v1.5 | |
| - HuggingFaceH4/zephyr-7b-gemma-v0.1 | |
| - deepseek-ai/DeepSeek-Coder-V2 | |
| - THUDM/glm-4-9b-chat | |
| - chujiezheng/neo_7b_instruct_v0.1-ExPO | |
| - ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3 | |
| Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference | |
| Paper: arxiv.org/abs/2406.04770 | |
| Paper: arxiv.org/abs/2502.01100 | |