qmpham's repositories
alpaca-lora
Instruct-tune LLaMA on consumer hardware
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Awesome-Multimodal-Large-Language-Models
Latest Papers and Datasets on Multimodal Large Language Models
awesome-swedish-nlp
A curated list of resources for natural language processing (NLP) in Swedish
cdx_toolkit
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
conan
Conan - The open-source C/C++ package manager
cramming
Cramming the training of a (BERT-type) language model into limited compute.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
fairseq2
FAIR Sequence Modeling Toolkit 2
flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5.
gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
Gymnasium
A standard API for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
LASER
Language-Agnostic SEntence Representations
llama-recipes
Examples and recipes for Llama 2 model
llama.cpp
Port of Facebook's LLaMA model in C/C++
metaseq
Repo for external large-scale work
prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
qmpham.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
safari
Convolutions for Sequence Modeling
summarize-from-feedback
Code for "Learning to summarize from human feedback"
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
zeno-build
Set up common LLM applications with evaluation