sungjinl

sungjinl

Geek Repo

Github PK Tool:Github PK Tool

sungjinl's repositories

agile

Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.

Language:PythonStargazers:0Issues:0Issues:0

brain_agent

Brain Agent for Large-Scale and Multi-Task Agent Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

captum

Model interpretability and understanding for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

COBS

OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.

Stargazers:0Issues:0Issues:0

CommaQA

Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents

License:Apache-2.0Stargazers:0Issues:0Issues:0

d3rlpy

An offline deep reinforcement learning library

License:MITStargazers:0Issues:0Issues:0

DeepDPM

"DeepDPM: Deep Clustering With An Unknown Number of Clusters" [CVPR 2022]

License:MITStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dialogue-meaning-representation

Data and code for the paper "Dialogue Meaning Representation for Task-Oriented Dialogue Systems".

License:NOASSERTIONStargazers:0Issues:0Issues:0

dialogue-reinforce

Training chatbot models with reinforcement learning in ParlAI.

License:MITStargazers:0Issues:0Issues:0

dowhy

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.

License:MITStargazers:0Issues:0Issues:0

ds2

Code for DS2 paper

Stargazers:0Issues:0Issues:0

industrialbenchmark

Industrial Benchmark

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

language

Shared repository for open-sourced projects from the Google AI Language team.

License:Apache-2.0Stargazers:0Issues:0Issues:0

learning-scaffold

This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

Megatron-LM

Ongoing research training transformer models at scale

License:NOASSERTIONStargazers:0Issues:0Issues:0

MOML

Source Code of "Multi-Objective Meta Learning" [NeurIPS 2021]

Stargazers:0Issues:0Issues:0

naturalcc

NaturalCC: An Open-Source Toolkit for Code Intelligence

License:MITStargazers:0Issues:0Issues:0

neural_chat

Code to support training, evaluating and interacting neural network dialog models, and training them with reinforcement learning. Code to deploy a web server which hosts the models live online is available at: https://github.com/asmadotgh/neural_chat_web

License:MITStargazers:0Issues:0Issues:0

NLIWOD

Collection of tools, utilities, datasets and approaches towards realising natural language interfaces for the Web of Data.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

OpenCSR

Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)

License:MITStargazers:0Issues:0Issues:0

ReAgent

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

t-few

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

License:MITStargazers:0Issues:0Issues:0

task_oriented_dialogue_as_dataflow_synthesis

Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).

License:MITStargazers:0Issues:0Issues:0

Tk-Instruct

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

License:MITStargazers:0Issues:0Issues:0

trans-encoder

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

License:Apache-2.0Stargazers:0Issues:0Issues:0

unas

Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral

License:NOASSERTIONStargazers:0Issues:0Issues:0