Jay Alammar's starred repositories
pydocstyle
docstring style checker
TransformerLens
A library for mechanistic interpretability of GPT-style language models
pynndescent
A Python nearest neighbor descent for approximate nearest neighbors
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
python-arabic-reshaper
Reconstruct Arabic sentences to be used in applications that don't support Arabic
camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
tantivy-py
Python bindings for Tantivy
toy-ml-pipeline
Toy example of an applied ML pipeline for me to experiment with MLOps tools.
amazon-sagemaker-architecting-for-ml
Materials for a 2-day instructor led course on applying machine learning
tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
sandbox-conversant-lib
Conversational AI tooling & personas built on Cohere's LLMs
genome-spy
A visualization grammar and GPU-accelerated toolkit for genomic data
sandbox-grounded-qa
A sandbox repo for grounded question answering with Cohere and Google Search
sandbox-toy-semantic-search
A demonstration of how a toy (but usable!) semantic search engine can be quickly built using Cohere's platform.
xai-benchmark
A Diagnostic Study of Explainability Techniques for Text Classification
rnn_agreement
Evaluating recurrent neural networks on predicting subject-verb agreement dependencies
sandbox-accelerating-chatbot-training
Leveraging Cohere's models to enable zero-shot routing
efficient-columnwise-correlation
Efficient ways to compute Pearson's correlation between columns of two matrices in various scientific computing languages