kleeeeea's repositories
Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
dtt-multi-branch
Code for Controlling Hallucinations at Word Level in Data-to-Text Generation (C. Rebuffel, M. Roberti, L. Soulier, G. Scoutheeten, R. Cancelliere, P. Gallinari)
GEMBA
GEMBA — GPT Estimation Metric Based Assessment
geval
Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"
langchain-tutorials
Overview and tutorial of the LangChain Library
LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
MAmmoTH
This repo contains the code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning"
MetaCLIP
Everything about MetaCLIP: curation/training code, metadata, distribution and pre-trained models.
Muffin
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Pangu
Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
QA4RE
Source code of paper 'Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors' (ACL 2023 Findings)
Recformer
Codebase for KDD 2023 paper, Text Is All You Need: Learning Language Representations for Sequential Recommendation
ReChorus
“Chorus” of recommendation models: a light and flexible PyTorch framework for Top-K recommendation.
smartgpt
A program that provides LLMs with the ability to complete complex tasks using plugins.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2020
wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
Youtube-Code-Repository
Repository for most of the code from my YouTube channel