FudanEMWLab's repositories
alpa
Auto parallelization for large-scale neural networks
AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
binding-test
Test language bindings with simple examples.
ColossalAI
Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
CompilerGym
Reinforcement learning environments for compiler and program optimization tasks
CppTemplateTutorial
中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
DeepLearningExamples
Deep Learning Examples
exo
Exocompilation for productive programming of hardware accelerators
hidet
An open-source efficient deep learning framework/compiler, written in python.
Learn-LLVM-12
Learn LLVM 12, published by Packt
LLVM_for_cpu0
This is a tutorial to learn LLVM, I realize a backend to compiler machine code for cpu0 which is a simple RISC cpu.
mlir-cgra
An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
morpher
An Open-Source Tool for CGRA Accelerators
MyTinySTL
Achieve a tiny STL in C++11
onnx-tool
ONNX model's shape inference and MACs(FLOPs) counting.
PINN-1
Simple PyTorch Implementation of Physics Informed Neural Network (PINN)
PyChip-py-hcl
A Hardware Construct Language
PyTorch-YOLOv3-ModelArts
在华为云ModelArts云端平台部署PyTorch版本的YOLOv3并实现训练、在线预测及参赛发布。
stonne
STONNE: A Simulation Tool for Neural Networks Engines
YHs_Sample
Yinghan's Code Sample