Jordan T Bates's starred repositories
ColossalAI
Making large AI models cheaper, faster and more accessible
recommenders
Best Practices on Recommendation Systems
vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
gnn-model-explainer
gnn explainer
tensordict
TensorDict is a pytorch dedicated tensor container.
robust_loss_pytorch
A pytorch port of google-research/google-research/robust_loss/
LargeBatchCTR
Large batch training of CTR models based on DeepCTR with CowClip.
dviz-course
Data visualization course material
fasttrackml
Experiment tracking server focused on speed and scalability
CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
wukong-recommendation
Implements the paper "Wukong: Towards a Scaling Law for Large-Scale Recommendation" from Meta.
CEDS-Data-Warehouse-Parquet
The Common Education Data Standards (CEDS) Data Warehouse Parquet (DW Parquet) standard is designed for data engineering and data science needs in the cloud. The DW Parquet Models mirror the SQL-based CEDS Data Warehouse. Parquet files are designed for rapid and distributed reporting across multiple technology stacks, data processing and BI tools, and are cloud vendor agnostic. This standard is ideal for stakeholders implementing reporting structures in a data lake environment.
SeqTestBlog
Simulation files for Schultzberg & Ankargren blogpost 2023