andy's repositories
bert-solr-search
Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU
factuality-eval
Library for iPython notebooks for evaluating factuality.
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
aws-machine-learning-university-accelerated-cv
Machine Learning University: Accelerated Computer Vision Class
aws-machine-learning-university-accelerated-nlp
Machine Learning University: Accelerated Natural Language Processing Class
aws-machine-learning-university-accelerated-tab
Machine Learning University: Accelerated Tabular Data Class
aws-modern-application-workshop
A tutorial for developers that want to learn about how to build modern applications on top of AWS. You will build a sample website that leverages infrastructure as code, containers, serverless code functions, CI/CD, and more.
aws-serverless-workshops
Code and walkthrough labs to set up serverless applications for Wild Rydes workshops
bitsandbytes
8-bit CUDA functions for PyTorch
datasets-server
Lightweight web API for visualizing and exploring all types of datasets - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
gensim
Topic Modelling for Humans
happy-transformer
Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
langchain
⚡ Building applications with LLMs through composability ⚡
paperai
📄 🤖 Semantic search and workflows for medical/scientific papers
pd3f
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
qlora
QLoRA: Efficient Finetuning of Quantized LLMs
training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows