Salem Messoud's starred repositories
LLM-Finetuning
LLM Finetuning with peft
hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
consistencydecoder
Consistency Distilled Diff VAE
Building-with-Instruction-Tuned-LLMs-A-Step-by-Step-Guide
Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSw
flash-attention
Fast and memory-efficient exact attention
alignment-handbook
Robust recipes to align language models with human and AI preferences
book-dataset
This dataset contains 207,572 books from the Amazon.com, Inc. marketplace.
Semantic-Retrieval-Models
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).
awesome-information-retrieval
A curated list of awesome information retrieval resources
multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
FindTheChatGPTer
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利