donggyukimc's starred repositories
the-algorithm
Source code for Twitter's Recommendation Algorithm
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
comprehensive-rust
This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
incubator-opendal
Apache OpenDAL: access data freely.
helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
DeepSeek-LLM
DeepSeek LLM: Let there be answers
llm-meetup
Liner LLM Meetup archive
vrdu
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.
liner-pdf-chat-tutorial
LINER PDF Chat Tutorial with ChatGPT & Pinecone