mit10000

mit10000

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

mit10000's repositories

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ai_and_memory_wall

AI and Memory Wall blog post

License:MITStargazers:0Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:0Issues:0Issues:0

awesome-real-time-AI

This is a list of awesome edgeAI inference related papers.

Stargazers:0Issues:0Issues:0

bert4torch

An elegent pytorch implement of transformers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

calm

C(UDA) accelerated language model inference

License:MITStargazers:0Issues:0Issues:0

Computer-Science-Textbooks

Collect some CS textbooks for learning.

Stargazers:0Issues:0Issues:0

cuda_learning

learning how CUDA works

Stargazers:0Issues:0Issues:0

DeepLearningSystem

Deep Learning System core principles introduction.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

gemmini

Berkeley's Spatial Array Generator

License:NOASSERTIONStargazers:0Issues:0Issues:0

gpu-benches

collection of benchmarks to measure basic GPU capabilities

License:GPL-3.0Stargazers:0Issues:0Issues:0

llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

License:MITStargazers:0Issues:0Issues:0

llm_profiler

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

Stargazers:0Issues:0Issues:0

llmperf

LLMPerf is a library for validating and benchmarking LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

mixbench

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

License:GPL-2.0Stargazers:0Issues:0Issues:0

model_analyzer

Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

License:MITStargazers:0Issues:0Issues:0

PatchTST

An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730

License:Apache-2.0Stargazers:0Issues:0Issues:0

pdfs

Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)

Stargazers:0Issues:0Issues:0

pytorch-benchmark

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

License:Apache-2.0Stargazers:0Issues:0Issues:0

scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Stargazers:0Issues:0Issues:0

sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

License:Apache-2.0Stargazers:0Issues:0Issues:0

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Stargazers:0Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

License:MITStargazers:0Issues:0Issues:0

wanda

A simple and effective LLM pruning approach.

License:MITStargazers:0Issues:0Issues:0

zigzag

HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators

License:BSD-3-ClauseStargazers:0Issues:0Issues:0