mrd / datasets

source{d} datasets ("big code") for source code analysis and machine learning on source code

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

datasets Build Status Build status

source{d} datasets for source code analysis and machine learning on source code.

This repository contains all the needed tools and scripts to reproduce the datasets.

List of available datasets:

Contributions

Contributions are very welcome, please see CONTRIBUTING.md and code of conduct.

License

The tools and scripts are licensed under Apache 2.0, see LICENSE.md.

About

source{d} datasets ("big code") for source code analysis and machine learning on source code

License:Other


Languages

Language:Jupyter Notebook 70.3%Language:Go 22.7%Language:CSS 5.1%Language:HTML 1.6%Language:Makefile 0.4%