David Macêdo, PhD's starred repositories
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
amazon-dsstne
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models
RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
dialoqbase
Create chatbots with ease
ForestDiffusion
Generating and Imputing Tabular Data via Diffusion and Flow XGBoost Models