PhilipMay / llm-data

LLM Training Data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LLM Data

This repository is mainly about cleaning, converting and checking LLM training datasets.

Datasets

New datasets cleaned and created by this project:

Licensing

Copyright (c) 2024 Philip May

Licensed under the MIT License (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License by reviewing the file LICENSE in the repository.

About

LLM Training Data

License:MIT License


Languages

Language:Jupyter Notebook 100.0%