FeatEng

non-profit

https://github.com/FeatEng/FeatEng/

FeatEng

Activity Feed

AI & ML interests

LLMs Evaluation

Recent Activity

Borchmann authored a paper about 2 months ago

AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI

Borchmann authored a paper about 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Borchmann submitted a paper about 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

View all activity

Organization Card

Community About org cards

The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, which requires domain knowledge in addition to a deep understanding of the underlying problem and data structure. The method can cheaply and efficiently assess the broad capabilities of LLMs in contrast to the existing methods.

See: https://arxiv.org/abs/2410.23331

models 0

None public yet

datasets 2

FeatEng/Benchmark

Viewer • Updated Nov 6, 2024 • 103 • 48

FeatEng/Data

Viewer • Updated Oct 20, 2024 • 4.59M • 929

AI & ML interests

Recent Activity

Team members 2

models 0

datasets 2 Sort: Recently updated

datasets 2