binwensun / awesome-ai4code-papers

A collection of recent papers, benchmarks and datasets of AI4Code domain.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Recent Advances in AI4CODE.

A niche collection of AI4Code papers and other resources (dataset, tutorial, etc.), this list will focus mostly on the papers that use pre-trained models and deep learning techniques for programming languages processing. There are also other collections that cover a wider range of AI4Code papers, such as:

Academic Conferences that usually published AI4Code papers:

Software Enginnering/Programming Languages

The emphasis is on combining program analysis and deep learning to solve novel software engineering/programming languages task. In most cases, strong empirical results are required. Typically, new datasets are usually curated.

Machine Learning/AI

The emphasis is on desigining novel neural network architectures to process code. Typically, new datasets are usually curated.

Natural Language Processing

The emphasis is on applying NLP techniques for code, and the evaluation is primarily on running the models on known benchmark datasets; unique tasks are rarely introduced.

Evaluate CodeLLMs

CodeLLMs for Code Generation

Repo-Level CodeLLMs

Benchmarking CodeLLMs

Pretrained Models for Code

Dataset and Benchmark

Talk and Tutorial

About

A collection of recent papers, benchmarks and datasets of AI4Code domain.