Amit Hasan's repositories
Accelerating-RecSys-Training
Accelerating Recommender model training by leveraging popular choices -- VLDB 2022
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Bi-GCN
Implementation of "Binary Graph Convolutional Network" in Pytorch Geometric
CSE-5825-Project-Diffusion-Probabilistic-Analysis
This repository is for the CSE 5825 Fall 2023 final project titled "Diffusion Probabilistic Analysis for Inductive vs. Transductive Graph Datasets"
CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
CSRankings-conference-list
These are the conferences used by CSRankings to rank universities
cutlass
CUDA Templates for Linear Algebra Subroutines
deepsparse
Sparsity-aware deep learning inference runtime for CPUs
DGL-clustering
An example for DGL cluster/subgraph manipulation
GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
ICCAD-Accel-GNN
Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"
llama
User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.
LLaMA-Pruning
Structural Pruning for LLaMA
MergePath-SpMM
Merg-path based Parallel Sparse Matrix-Matrix Algorithm for Irregular Sparse Matrices
NasRec
NASRec Weight Sharing Neural Architecture Search for Recommender Systems
nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
OSDI21_AE
Artifact for OSDI'21 GNNAdvisor
pyllama
LLaMA: Open and Efficient Foundation Language Models
Pytorch-DDP-Example
A minimum example for pytorch DDP based single node multi-GPU training on MNIST dataset, with different gradient compression
qa-lora
Official PyTorch implementation of QA-LoRA
RecSystemsPapers
A Curated List of Must-read Papers on Recommender System.
smoothquant
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
the-algorithm-ml_twitter_rec
Source code for Twitter's Recommendation Algorithm
VoxFormer
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
wanda
A simple and effective LLM pruning approach.
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
yolov8
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite