linkuew / iu-babylm

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A general setup which can be used for the baby-lm project found here: https://babylm.github.io/index.html

General information

The general structure:

  • Clone this repo
  • Clone the evaluation pipeline repo somewhere else (https://github.com/babylm/evaluation-pipeline)
  • Using a python venv, install the necessary packages for creating our model
  • Train a model on the data
  • Save the model and the model tokenizer
  • Using a python venv, run the evaluation pipeline on the data

About

License:MIT License


Languages

Language:Python 100.0%