Saj's repositories
AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
AIDB
ai4db and db4ai work
Augmentation-Adapted-Retriever
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".
AutoClean
Python package for automated data preprocessing & cleaning.
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
DAIL-SQL
A efficient and effective few-shot NL2SQL method on GPT-4.
dq_label_noise
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
evaporate
This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"
faiss
A library for efficient similarity search and clustering of dense vectors.
FLARE
Forward-Looking Active REtrieval-augmented generation (FLARE)
frictionless-py
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
gorilla
Gorilla: An API store for LLMs
handson-ml3-works
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
llama-hub-cot
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
MIT_introtodeeplearning
Lab Materials for MIT 6.S191: Introduction to Deep Learning
ml-road
Machine Learning Resources, Practice and Research
pandera
A light-weight, flexible, and expressive statistical data testing library
PraisonAI
PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.
prompt-engineering
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
pyo3
Rust bindings for the Python interpreter
responsible-ai-toolbox-mitigations
Python library for implementing Responsible AI mitigations.
rust_torch
Rust bindings for the C++ api of PyTorch.
semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
TableCoT
The code and data used for EACL2023 Paper: "Large Language Models are few(1)-shot Table Reasoners"
trafficserver
Apache Traffic Server™ is a fast, scalable and extensible HTTP/1.1 and HTTP/2 compliant caching proxy server.
ydata-synthetic
Synthetic data generators for tabular and time-series data