Vikram Sharma's repositories
nn_dataflow
Modified version of the "Explore the energy-efficient dataflow scheduling for neural networks. "
carml
CarML (Cognitive ARtifacts for Machine Learning) is an open source distributed platform to easily deploy and benchmark machine learning (ML) and deep learning (DL) frameworks and models across hardware infrastructures, within a common interface. CarML allows ML/DL developers to publish and evaluate their models, users to experiment with published models, and system architects to profile end-to-end workflows to inform system designs.
ensembleMLBP
EnsembleMLBP is a ECE511 course project created by Vikram, Umur and Andrew. Here we explore ensemble+ML based branch predictor. Initially we analyse tournamentBP and provide it loopholes. In second half we discuss RNN-LSTM and reinforcement learning schemes for BP. More details of the project will be added soon. The project is in early stage of development.
gem5_experiments
Trying out different experiments in gem5
llm.c
LLM training in simple, raw C/CUDA
MemXCT-GPU-NonMPI-1
Non-MPI version for single GPU
pytorch-cifar
This is a forked version from kuangliu/pytorch-cifar with integration of torchgpipe. It is currently non-functional with more than one GPU.
rai
The RAI client allows one to interact with a cluster of machine to submit and evaluate code. RAI is a scalable job submission system designed for diverse workloads. RAI’s design addresses challenges of scalability, configurability, security, and cost in delivering a flexible programming environments.
SimpleGPUHashTable
A simple GPU hash table implemented in CUDA using lock free techniques
timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.