Beast code in Giters

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Apache-2.0000

LLM-Factuality-Survey

The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>

000

OpenLLaMA2

DeepSpeed+Ray based LLaMA2 PT/RLHF/RS training framework

Apache-2.0000

llama

Inference code for LLaMA models

NOASSERTION000

research-course

"How to Do Great Research" Course for Ph.D. Students

NOASSERTION000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

MIT000

FastChat

An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.

Apache-2.0000

Megatron-LM

Ongoing research training transformer models at scale

NOASSERTION000

Lantern官方版本下载蓝灯翻墙代理科学上网外网加速器梯子路由 - Быстрый, надежный и безопасный доступ к открытому интернету - lantern proxy vpn censorship-circumvention censorship gfw accelerator پراکسی لنترن، ضدسانسور، امن، قابل اعتماد و پرسرعت

000

IC-DST

Code base of In-Context Learning for Dialogue State tracking

MIT000

adaptive-retrieval

MIT000

icl-selective-annotation

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"

000

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Apache-2.0000

Awesome_Few_Shot_Learning

Advances of few-shot learning, especially for NLP applications.

000

transfer-learning-conv-ai

🦄 State-of-the-Art Conversational AI with Transfer Learning

MIT000

latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

000

TextualExplInContext

The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)

000

zhangxy-2019

zhangxy-2019's repositories

TransformerLens

long-form-factuality

bonito

promptbench

Multimodal-AND-Large-Language-Models

ReAlign

long-context-fine-tuning-blogpost

weak-to-strong

wikiextractor

trlx

lit-llama

LLM-Factuality-Survey

OpenLLaMA2

llama

research-course

DeepSpeed

prm800k

CUHK-PhD-Thesis-Template

FastChat

sgp-tod

Megatron-LM

lantern

IC-DST

adaptive-retrieval

icl-selective-annotation

superset

Awesome_Few_Shot_Learning

transfer-learning-conv-ai

latex_paper_writing_tips

TextualExplInContext