Alex Nguyen's repositories
autocrit
A repository for transformer critique learning and generation
baize
Baize is an open-source chatbot trained with ChatGPT self-chatting data, developed by researchers at UCSD and Sun Yat-sen University.
botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering various contexts and tasks (task-oriented dialogue systems, abstract reasoning, brainstorming).
cformers
SoTA Transformers with C-backend for fast inference on your CPU.
ChatGLM-finetune-LoRA
Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
datasetGPT
A command-line interface to generate textual and conversational datasets with LLMs.
DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
ggml
Tensor library for machine learning
gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
Instruct_Prompt_Dataset_Generator
This repo is a demo of how to create new instruction prompts using ChatGPT API and using ray as workers to distribute the work across up to 1000 workers.
kinda-llama
An open-source replication and extension of the Meta AI's LLAMA dataset
lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
llama.cpp
Port of Facebook's LLaMA model in C/C++
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation