heshandevaka

heshandevaka

Geek Repo

Github PK Tool:Github PK Tool

heshandevaka's repositories

Trade-Off-MOL

Experiments on trade-off among optimization, generalization and conflict aversion in multi-objective learning (MOL), and introducing MoDo.

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

MoCo

This repository contains the codebase used to generate the main results of "Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach, which has been accepted to ICLR 2023."

Language:PythonStargazers:3Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

FedML

A Research-oriented Federated Learning Library. Supporting distributed computing, mobile/IoT on-device training, and standalone simulation. Best Paper Award at NeurIPS 2020 Federated Learning workshop. Join our Slack Community:(https://join.slack.com/t/fedml/shared_invite/zt-havwx1ee-a1xfOUrATNfc9DFqU~r34w)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

MoCo-plus

Code for implementing MoCo+ (ICASSP 2024)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pareto-hypernetworks

Official implementation of Learning The Pareto Front With HyperNetworks [ICLR 2021]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PCGrad

Code for "Gradient Surgery for Multi-Task Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Stargazers:0Issues:0Issues:0

RL-Adventure-2

PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

Stargazers:0Issues:0Issues:0
Language:HTMLLicense:GPL-3.0Stargazers:0Issues:0Issues:0