Yunjie's repositories
low-bit-quant-admm
admm for cnn layerwise weight low bit quantization
LQ-net-pytorch
LQ-net implementation on pytorch
Accelerate1rnn
use Eigen and SIMD to accelerate a simple 1-layer NN.
awesome-point-cloud-analysis
A list of papers and datasets about point cloud analysis (processing)
blockdrop
BlockDrop: Dynamic Inference Paths in Residual Networks
DeepCompression-caffe-master
log quantization
faiss
A library for efficient similarity search and clustering of dense vectors.
FedML
A Research-oriented Federated Learning Library. Supporting distributed computing, mobile/IoT on-device training, and standalone simulation. Best Paper Award at NeurIPS 2020 Federated Learning workshop. Join our Slack Community:(https://join.slack.com/t/fedml/shared_invite/zt-havwx1ee-a1xfOUrATNfc9DFqU~r34w)
fpu
synthesiseable ieee 754 floating point library in verilog
grappolo
OpenMP implementation of Graph Community Detection, with a number of parallel heuristics/approximate computing techniques
hiddenlayer
Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.
hugo-academic
📝 The website builder for Hugo. Build and deploy a beautiful website in minutes!
incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
llama
Inference code for LLaMA models
llama-recipes
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
pyjhzwh.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
QNNPACK
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
Spoon-Knife
This repo is for demonstration purposes only.
TASO
The Tensor Algebra SuperOptimizer for Deep Learning
temporal-triangle-counting
code for the paper Faster and Generalized Temporal Triangle Counting, via Degeneracy Ordering (KDD 2021)
TestFile
only test for git
tflite-micro
TensorFlow Lite for Microcontrollers
Tiger-Compiler
a tiny compiler for tiger
torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
zju-icicles
浙江大学课程攻略共享计划