Beast code in Giters

jaemyungkim's repositories

awesome-fpga-list

A collection of some awesome public FPGA projects.

100

AI-Chip

A list of ICs and IPs for AI, Machine Learning and Deep Learning.

Language:PHP000

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.

000

BJUT_Tutorials

Things to learn for new students in the Lab for AI chips and systems of BJTU .

000

Want a faster ML processor? Do it yourself! -- A framework for playing with custom opcodes to accelerate TensorFlow Lite for Microcontrollers (TFLM). . . . . . Online tutorial: https://google.github.io/CFU-Playground/ For reference docs, see the link below.

Apache-2.0000

CNN-Accelerator-VLSI

Convolutional accelerator kernel, target ASIC & FPGA

Apache-2.0000

data-gradients

Computer Vision dataset analysis

Apache-2.0000

dnn-engine

AXI-Stream Universal DNN Engine with Novel Dataflow enabling 70.7 Gops/mm2 on TSMC 65nm GP for 8-bit VGG16

Language:PythonApache-2.0000

DNN_HLS_Accelerator

This repository contains source code for CNN layers of ALexNet using Xilinx HLS Vivado.

000

EfficientPyTorch

A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.

Language:Python000

finn

Dataflow compiler for QNN inference on FPGAs

Language:PythonBSD-3-Clause000

finn-hlslib

Vivado HLS library for FINN

Language:C++BSD-3-Clause000

HandyFigure

HandyFigure provides the sources file (ususally PPT files) for paper figures

MIT000

HLS-Tiny-Tutorials

Language:C++NOASSERTION000

LilNetX

Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification

MIT000

micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Language:PythonMIT000

nanodet

⚡Super fast and lightweight anchor-free object detection model. 🔥Only 1.8MB and run 97FPS on cellphone🔥

Language:PythonApache-2.0000

Neural-Networks-on-Silicon

This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.

000

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

MIT000