Christin David Bose (christindbose)

christindbose

Geek Repo

Company:Purdue university

Location:West Lafayette

Twitter:@ChristinBose

Github PK Tool:Github PK Tool


Organizations
ECE468

Christin David Bose's repositories

Language:CudaStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:1Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

dlrm_syn

An implementation of a deep learning recommendation model (DLRM)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:C++License:BSD-2-ClauseStargazers:0Issues:1Issues:0

ECE60827_simulation_project_part1_old

Part 1 of HW simulation project

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:C++License:BSD-2-ClauseStargazers:0Issues:1Issues:0
Language:C++License:BSD-2-ClauseStargazers:0Issues:1Issues:0

ECE60827_simulation_project_part4-bonus

Repo for the HW simulation project part 4 (bonus)

Language:C++License:BSD-2-ClauseStargazers:0Issues:1Issues:0

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FLsystem-paper

Federated Learning Systems

Stargazers:0Issues:0Issues:0

ggml

Tensor library for machine learning

Language:CLicense:MITStargazers:0Issues:0Issues:0

HierarchicalKV

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of Merlin-KV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LLM-Pruner

LLM-Pruner: On the Structural Pruning of Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLM4HWDesign_Starting_Toolkit

LLM4HWDesign Starting Toolkit

Stargazers:0Issues:0Issues:0

mgpu-gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

ml-engineering

Machine Learning Engineering Guides and Tools

Language:PythonLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:StarlarkStargazers:0Issues:0Issues:0

oss-arch-gym

Open source version of ArchGym project.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

pytorch-direct_dgl

PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)

Stargazers:0Issues:0Issues:0

reproduce_isca23_cpu_DLRM_inference

Sharing the codebase and steps for artifact evaluation for ISCA 2023 paper

Language:PythonStargazers:0Issues:0Issues:0

superblock

A block oriented training approach for inference time optimization.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0