FudanEMWLab's repositories
nv_isa_solver
Nvidia Instruction Set Specification Generator
mlir-cgra
An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.
hidet
An open-source efficient deep learning framework/compiler, written in python.
morpher
An Open-Source Tool for CGRA Accelerators
stonne
STONNE: A Simulation Tool for Neural Networks Engines
Learn-LLVM-12
Learn LLVM 12, published by Packt
onnx-tool
ONNX model's shape inference and MACs(FLOPs) counting.
MyTinySTL
Achieve a tiny STL in C++11
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
exo
Exocompilation for productive programming of hardware accelerators
ColossalAI
Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
alpa
Auto parallelization for large-scale neural networks
CompilerGym
Reinforcement learning environments for compiler and program optimization tasks
YHs_Sample
Yinghan's Code Sample
CppTemplateTutorial
中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)
LLVM_for_cpu0
This is a tutorial to learn LLVM, I realize a backend to compiler machine code for cpu0 which is a simple RISC cpu.
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
PINN-1
Simple PyTorch Implementation of Physics Informed Neural Network (PINN)
DeepLearningExamples
Deep Learning Examples
PyChip-py-hcl
A Hardware Construct Language
PyTorch-YOLOv3-ModelArts
在华为云ModelArts云端平台部署PyTorch版本的YOLOv3并实现训练、在线预测及参赛发布。
binding-test
Test language bindings with simple examples.