pauldanielconway

pauldanielconway's repositories

gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Apache-2.0000

llama-cpp-python

Python bindings for llama.cpp

MIT000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Apache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

MIT000

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

AGPL-3.0000

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Apache-2.0000

google-research

Google Research

Apache-2.0000

trl

Train transformer language models with reinforcement learning.

Apache-2.0000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

NOASSERTION000

flash-attention

Fast and memory-efficient exact attention

BSD-3-Clause000

triton

Development repository for the Triton language and compiler

MIT000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0000

scikit-learn

scikit-learn: machine learning in Python

BSD-3-Clause000

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Apache-2.0000

Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

000

gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Language:PythonNOASSERTION000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache-2.0000

Awesome-Code-LLM

👨‍💻 An awesome and curated list of best code-LLM for research.

MIT000

tqdm

:zap: A Fast, Extensible Progress Bar for Python and CLI

NOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

MIT000

Tencent-Hunyuan-Large

NOASSERTION000

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.0000

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Apache-2.0000

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Apache-2.0000

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

NOASSERTION000

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

MIT000

trax

Trax — Deep Learning with Clear Code and Speed

Apache-2.0000

ERNIE

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

000