JY-Ren's starred repositories
Labeling-system-for-RLHF
To train the base model of chatgpt, we initially implemented a labeling system based on the actual requirements of SFT and RLHF.
transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
bios_entity_classification
BIOS: An Algorithmically Generated Biomedical Knowledge Graph