README / README.md
Borchmann's picture
Update README.md
d84eb49 verified
metadata
title: README
emoji: πŸ“Š
colorFrom: green
colorTo: green
sdk: static
pinned: false
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/600b381d3cc3b87db94bc0ce/_qucT89ORB8kzC5dpf84H.png
short_description: The benchmark for LLMs in Data Science setup.
license: apache-2.0

The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, which requires domain knowledge in addition to a deep understanding of the underlying problem and data structure. The method can cheaply and efficiently assess the broad capabilities of LLMs in contrast to the existing methods.

See: https://arxiv.org/abs/2410.23331