MrBananaHuman's repositories

CounselGPT

한국어 심리 상담 데이터셋

KorGPT2Tutorial

Tutorial for pretraining Korean GPT-2 model

open-korean-instructions

언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.

Language:PythonStargazers:19Issues:1Issues:0

Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

Stargazers:4Issues:0Issues:0

KoChatGPT

ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

NengoProject

Spiking Neural Network Model using Nengo

Language:Jupyter NotebookStargazers:2Issues:2Issues:0

ru-dalle

Generate images from texts. In Russian

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:1Issues:0

S2APLER

S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

evolve-instruct

evolve llm training instruction, from english instruction to any language.

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

ko-flan

한국어 FLAN 데이터 구축과 모델 학습을 위한 프로젝트

License:MITStargazers:0Issues:0Issues:0

language-model

한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

lassl

Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

multilingual-transfer

Multi-lingual transfer experiments

Language:PythonStargazers:0Issues:1Issues:0

odqa_baseline_code

Baseline code for Korean open domain question answering(ODQA)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

textlesslib

Library for Textless Spoken Language Processing

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tppys

Text processing by pyspark (just sample project)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

unsloth

5X faster 60% less memory QLoRA finetuning

License:Apache-2.0Stargazers:0Issues:0Issues:0

vision-transformer-tf

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

Language:PythonStargazers:0Issues:1Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0