Pham Van Ngoan's starred repositories
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
NeuScraper
[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".
distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
youtubeuploader
Scripted uploads to Youtube
promptbase
All things prompt engineering
tensorrtllm_backend
The Triton TensorRT-LLM Backend
TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
opentelemetry-specification
Specifications for OpenTelemetry