qmpham's repositories
fairseq2
FAIR Sequence Modeling Toolkit 2
qmpham.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
llama-recipes
Examples and recipes for Llama 2 model
metaseq
Repo for external large-scale work
llama.cpp
Port of Facebook's LLaMA model in C/C++
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Awesome-Multimodal-Large-Language-Models
Latest Papers and Datasets on Multimodal Large Language Models
prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
zeno-build
Set up common LLM applications with evaluation
LASER
Language-Agnostic SEntence Representations
Gymnasium
A standard API for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
alpaca-lora
Instruct-tune LLaMA on consumer hardware
gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5.
safari
Convolutions for Sequence Modeling
cdx_toolkit
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
awesome-swedish-nlp
A curated list of resources for natural language processing (NLP) in Swedish
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
summarize-from-feedback
Code for "Learning to summarize from human feedback"
conan
Conan - The open-source C/C++ package manager
cramming
Cramming the training of a (BERT-type) language model into limited compute.