Akash K.'s repositories
exo
Exocompilation for productive programming of hardware accelerators
MIMDRAM
Source code for the architectural simulator used for modeling the PUD system proposed in our HPCA 2024 paper `MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing''. Paper is at: https://arxiv.org/pdf/2402.19080.pdf
llm-code-watermark
LLM Program Watermarking
omniperf
Advanced Profiling and Analytics for AMD Hardware
MultiPIM
MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator
PIMSimulator
Processing-In-Memory (PIM) Simulator
omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
DNN_NeuroSim_V2.1
Benchmark framework of compute-in-memory based accelerators for deep neural network (on-chip training chip focused)
minimalloc
A lightweight memory allocator for hardware-accelerated machine learning
torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
tensorflow
An Open Source Machine Learning Framework for Everyone
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
LLaMA-Adapter
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
DNN_NeuroSim_V1.4
Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)
llama
Inference code for LLaMA models
codellama
Inference code for CodeLlama models
DNN_NeuroSim_V1.3
Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)
PIMLibrary
PIM Runtime Library and Tools
LaVIT
LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
MODel_opt
Memory Optimizations for Deep Learning (ICML 2023)
HD-Clustering
Port HD-Clustering to use Hetero-C++
PIMFlow_accel-sim-framework
This is the top-level repository for the Accel-Sim framework.
rake
compiling DSLs to high-level hardware instructions