MrBananaHuman's repositories
CounselGPT
한국어 심리 상담 데이터셋
KorGPT2Tutorial
Tutorial for pretraining Korean GPT-2 model
open-korean-instructions
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
Awesome-LLM-Tabular
Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data
NengoProject
Spiking Neural Network Model using Nengo
evolve-instruct
evolve llm training instruction, from english instruction to any language.
ko-flan
한국어 FLAN 데이터 구축과 모델 학습을 위한 프로젝트
language-model
한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)
LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
multilingual-transfer
Multi-lingual transfer experiments
odqa_baseline_code
Baseline code for Korean open domain question answering(ODQA)
textlesslib
Library for Textless Spoken Language Processing
tppys
Text processing by pyspark (just sample project)
unsloth
5X faster 60% less memory QLoRA finetuning
vision-transformer-tf
Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.