tattrongvu

Tat Trong Vu's repositories

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0000

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

000

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

000

FlagEmbedding

Dense Retrieval and Retrieval-augmented LLMs

MIT000

infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

MIT000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

Language:PythonApache-2.0000

llm-continuous-batching-benchmarks

000

LLM_Helper_Scripts

Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub

MIT000

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

NOASSERTION000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

MIT000

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Apache-2.0000

LongLoRA

Code and documents of LongLoRA and LongAlpaca

Apache-2.0000

mergekit

Tools for merging pretrained large language models.

LGPL-3.0000

openai-token-counter

Count tokens for OpenAI accurately with support for all parameters like name, functions.

MIT000

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

MIT000

pytorch_mppi

Model Predictive Path Integral (MPPI) with approximate dynamics implemented in pytorch

MIT000

pytransform3d

3D transformations for Python.

NOASSERTION000

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Apache-2.0000

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonApache-2.0000

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Apache-2.0000

tattrongvu

Tat Trong Vu's repositories

alignment-handbook

alpaca-lora

awesome-instruction-dataset

Awesome-Multimodal-Large-Language-Models

FlagEmbedding

infinity

insanely-fast-whisper

llama-cpp-python

LLaVA

llm-continuous-batching-benchmarks

LLM_Helper_Scripts

LLMTest_NeedleInAHaystack

lm-evaluation-harness

lmdeploy

LongLoRA

LookaheadDecoding

mergekit

openai-token-counter

outlines-regex

petals

pytorch_mppi

pytransform3d

ray-llm

S-LoRA

sentence-transformers

TensorRT-LLM

text-generation-inference

VALL-E-X

vllm-gptq

yarn