Dongsub Shim (DongsubShim)

DongsubShim

Geek Repo

Company:University of Toronto

Location:Toronto

Github PK Tool:Github PK Tool

Dongsub Shim's starred repositories

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4380Issues:0Issues:0

Humback

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

Language:PythonLicense:Apache-2.0Stargazers:127Issues:0Issues:0

airoboros

Customizable implementation of the self-instruct paper.

Language:PythonLicense:Apache-2.0Stargazers:992Issues:0Issues:0

WeightWatcher

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Language:PythonLicense:Apache-2.0Stargazers:1426Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13065Issues:0Issues:0

pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Language:PythonLicense:Apache-2.0Stargazers:580Issues:0Issues:0

causal-text-papers

Curated research at the intersection of causal inference and natural language processing.

Stargazers:770Issues:0Issues:0

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

License:Apache-2.0Stargazers:7330Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15548Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4493Issues:0Issues:0

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Language:PythonLicense:Apache-2.0Stargazers:2909Issues:0Issues:0

LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:1028Issues:0Issues:0

BFP

Pytorch Implementation for "Preserving Linear Separability in Continual Learning by Backward Feature Projection" (CVPR 2023)

Language:PythonStargazers:18Issues:0Issues:0

awesome-totally-open-chatgpt

A list of totally open alternatives to ChatGPT

License:CC0-1.0Stargazers:4474Issues:0Issues:0

Task-Oriented-Dialogue-Research-Progress-Survey

A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.

Stargazers:1241Issues:0Issues:0

toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Language:PythonLicense:MITStargazers:1936Issues:0Issues:0

causallib

A Python package for modular causal inference analysis and model evaluations

Language:PythonLicense:Apache-2.0Stargazers:703Issues:0Issues:0

EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3710Issues:0Issues:0

dowhy

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.

Language:PythonLicense:MITStargazers:6965Issues:0Issues:0

COMET

A Neural Framework for MT Evaluation

Language:PythonLicense:Apache-2.0Stargazers:469Issues:0Issues:0

prism

MT Evaluation in Many Languages via Zero-Shot Paraphrasing

Language:PythonLicense:NOASSERTIONStargazers:100Issues:0Issues:0

MT-Evaluation

Machine Translation (MT) Evaluation Scripts

Language:PythonStargazers:16Issues:0Issues:0

Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

Stargazers:557Issues:0Issues:0

MT-Reading-List

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

Language:TeXLicense:BSD-3-ClauseStargazers:2420Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1441Issues:0Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:2628Issues:0Issues:0

Diff-SCM

Code for Diff-SCM paper

Language:PythonLicense:Apache-2.0Stargazers:86Issues:0Issues:0

CLS-ER

The official PyTorch code for ICLR'22 Paper "Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System""

Language:PythonLicense:MITStargazers:44Issues:0Issues:0
Language:PythonStargazers:60Issues:0Issues:0
Language:PythonStargazers:23Issues:0Issues:0