johnaffolter / aideml

AIDE: Autonomous AI for Data Science

Home Page:https://www.weco.ai/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AIDE: Autonomous AI for Data Science

Welcome to the official repository for AIDE, an AI system that can automatically solve data science tasks at a human level, and with human input, it can perform even better. We believe giving developers and researchers direct access to AIDE locally, with local compute and choice to use their own LLM keys, is the most straightforward way to make it useful. That's why we'll open-source it, and the tentative timeline is it will arrive before the end of April. Currently, this repository serves as a gallery showcasing its solutions for 60+ Kaggle competitions we tested.

About AIDE

AIDE is an AI-powered data science assistant that can autonomously understand task requirements, design, and implement solutions. By leveraging large language models and innovative agent architectures, such as the Solution Space Tree Search algorithm, AIDE has achieved human-level performance on a wide range of data science tasks, outperforming over 50% of human data scientists on Kaggle competitions.

Gallery

Domain Task Top% Solution Link Competition Link
Urban Planning Forecast city bikeshare system usage 5% link link
Physics Predicting Critical Heat Flux 56% link link
Genomics Classify bacteria species from genomic data 0% link link
Agriculture Predict blueberry yield 58% link link
Healthcare Predict disease prognosis 0% link link
Economics Predict monthly microbusiness density in a given area 35% link link
Cryptography Decrypt shakespearean text 91% link link
Data Science Education Predict passenger survival on Titanic 78% link link
Software Engineering Predict defects in c programs given various attributes about the code 0% link link
Real Estate Predict the final price of homes 5% link link
Real Estate Predict house sale price 36% link link
Entertainment Analytics Predict movie worldwide box office revenue 62% link link
Entertainment Analytics Predict scoring probability in next 10 seconds of a rocket league match 21% link link
Environmental Science Predict air pollution levels 12% link link
Environmental Science Classify forest categories using cartographic variables 55% link link
Computer Vision Predict the probability of machine failure 32% link link
Computer Vision Identify handwritten digits 14% link link
Manufacturing Predict missing values in dataset 70% link link
Manufacturing Predict product failures 48% link link
Manufacturing Cluster control data into different control states 96% link link
Natural Language Processing Classify toxic online comments 78% link link
Natural Language Processing Predict passenger transport to an alternate dimension 59% link link
Natural Language Processing Classify sentence sentiment 42% link link
Natural Language Processing Predict whether a tweet is about a real disaster 48% link link
Business Analytics Predict total sales for each product and store in the next month 87% link link
Business Analytics Predict book sales for 2021 66% link link
Business Analytics Predict insurance claim amount 80% link link
Business Analytics Minimize penalty cost in scheduling families to santa's workshop 100% link link
Business Analytics Predict yearly sales for learning modules 26% link link
Business Analytics Binary classification of manufacturing machine state 60% link link
Business Analytics Forecast retail store sales 36% link link
Business Analytics Predict reservation cancellation 54% link link
Finance Predict the probability of an insurance claim 13% link link
Finance Predict loan loss 0% link link
Finance Predict a continuous target 42% link link
Finance Predict customer churn 24% link link
Finance Predict median house value 58% link link
Finance Predict closing price movements for nasdaq listed stocks 99% link link
Finance Predict taxi fare 100% link link
Finance Predict insurance claim probability 62% link link
Biotech Predict cat in dat 66% link link
Biotech Predict the biological response of molecules 62% link link
Biotech Predict medical conditions 92% link link
Biotech Predict wine quality 61% link link
Biotech Predict binary target without overfitting 98% link link
Biotech Predict concrete strength 86% link link
Biotech Predict crab age 46% link link
Biotech Predict enzyme characteristics 10% link link
Biotech Classify activity state from sensor data 51% link link
Biotech Predict horse health outcomes 86% link link
Biotech Predict the mohs hardness of a mineral 64% link link
Biotech Predict cirrhosis patient outcomes 51% link link
Biotech Predict obesity risk 62% link link
Biotech Classify presence of feature in data 66% link link
Biotech Predict patient's smoking status 40% link link

About

AIDE: Autonomous AI for Data Science

https://www.weco.ai/