UofT-EcoSystem

UofT-EcoSystem

Geek Repo

Github PK Tool:Github PK Tool

UofT-EcoSystem's repositories

CSCD70

CSCD70 Compiler Optimization

Language:C++Stargazers:228Issues:8Issues:0

Minuet

[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs

Language:CudaLicense:Apache-2.0Stargazers:57Issues:2Issues:1

DietCode

DietCode Code Release

Language:CudaStargazers:56Issues:10Issues:0

rlscope

RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads

Language:PythonLicense:Apache-2.0Stargazers:37Issues:23Issues:1

hfta

Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion

Language:PythonLicense:MITStargazers:32Issues:6Issues:19
Language:PythonLicense:Apache-2.0Stargazers:28Issues:6Issues:3

BPPSA-open

The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".

Language:PythonLicense:MITStargazers:13Issues:4Issues:0

Tempo

Memory footprint reduction for transformer models

Language:PythonStargazers:12Issues:3Issues:0

Grape-MICRO56-Artifact

This repository contains the source code for Grape.

Language:PythonStargazers:5Issues:4Issues:0

MXNet-GPU_Memory_Profiler

Benchmarking using MXNet GPU Memory Profiler

MoIL

MoIL: Enabling Efficient Incremental Training on Edge Devices

License:Apache-2.0Stargazers:2Issues:3Issues:0

skyline

🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:4Issues:0

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Language:C++License:Apache-2.0Stargazers:0Issues:4Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

cache-trace

A collection of Twitter's anonymized production cache traces.

Language:ShellLicense:CC-BY-4.0Stargazers:0Issues:2Issues:0
Language:ShellStargazers:0Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:1

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

rlscope_agents

Fork of https://github.com/tensorflow/agents with RL-Scope annotations added.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0

rlscope_mlperf_training

Fork of https://github.com/mlperf/training with RL-Scope annotations added.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0

rlscope_ReAgent

Fork of https://github.com/facebookresearch/ReAgent with RL-Scope annotations added.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:3Issues:0

rlscope_rl-baselines-zoo

Fork of https://github.com/araffin/rl-baselines-zoo with RL-Scope annotations added.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rlscope_stable-baselines

Fork of https://github.com/hill-a/stable-baselines with RL-Scope annotations added.

Language:PythonLicense:MITStargazers:0Issues:3Issues:0
Stargazers:0Issues:0Issues:0

TensorComprehensions

A domain specific language to express machine learning workloads.

Language:C++License:Apache-2.0Stargazers:0Issues:3Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:0Issues:3Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0