zhql's repositories
alignment-handbook
Robust recipes for to align language models with human and AI preferences
chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
ControlNet
Let us control diffusion models!
ConvNeXt-V2
Code release for ConvNeXt V2 model
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
detectron2
Detectron2 is FAIR's next-generation research platform for object detection and segmentation.
Dataset-Pruning
Dataset pruning for ImageNet and LAION-2B.
DeepSeek-LLM
DeepSeek LLM: Let there be answers
deit
Official DeiT repository
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
EVA
Exploring the Limits of Masked Visual Representation Learning at Scale (https://arxiv.org/abs/2211.07636)
how-to-train-tokenizer
怎么训练一个LLM分词器
jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
NLP_Assignment
NLP Assignment of NJU...
OLMo
Modeling, training, eval, and inference code for OLMo
open_clip
An open source implementation of CLIP.
prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Volcano
[NAACL 2024] Official github for "Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision"