pauldanielconway's repositories
gpt-oss
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
llama-cpp-python
Python bindings for llama.cpp
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
google-research
Google Research
trl
Train transformer language models with reinforcement learning.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
flash-attention
Fast and memory-efficient exact attention
triton
Development repository for the Triton language and compiler
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
scikit-learn
scikit-learn: machine learning in Python
accelerate
π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
gpt-2-simple
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
peft
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Awesome-Code-LLM
π¨βπ» An awesome and curated list of best code-LLM for research.
tqdm
:zap: A Fast, Extensible Progress Bar for Python and CLI
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
datasets
π€ The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
trax
Trax β Deep Learning with Clear Code and Speed
ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.