Tom Young (tomyoung903)

tomyoung903

Geek Repo

Company:NTU

Location:50 Nanyang Ave, 639798

Home Page:https://tomyoung903.github.io/

Github PK Tool:Github PK Tool

Tom Young's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54856Issues:518Issues:947

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38441Issues:384Issues:1627

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:29410Issues:278Issues:1212

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21053Issues:179Issues:424

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7659Issues:143Issues:46

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7651Issues:98Issues:198

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6043Issues:35Issues:980

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3455Issues:24Issues:439

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1923Issues:19Issues:78

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

veScale

A PyTorch Native LLM Training Framework

Language:PythonLicense:Apache-2.0Stargazers:521Issues:34Issues:7

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonLicense:Apache-2.0Stargazers:494Issues:13Issues:28

ConvLab

DSTC8 Track 1 Task 1 End-to-End Multi-Domain Dialog Challenge Result:

Language:PythonLicense:MITStargazers:400Issues:24Issues:58

InfoBatch

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Dataset_Quantization

[ICCV2023] Dataset Quantization

ccm

This project is a tensorflow implement of our work, CCM (Commonsense Conversational Model).

Language:PythonLicense:Apache-2.0Stargazers:219Issues:16Issues:12

GLMP

PyTorch code for ICLR 2019 paper: Global-to-local Memory Pointer Networks for Task-Oriented Dialogue https://arxiv.org/pdf/1901.04713

thesis_template_ntu

Thesis Latex Template for Nanyang Technological University (NTU)

Language:TeXLicense:MITStargazers:140Issues:3Issues:7

SpeeD

SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Language:PythonLicense:Apache-2.0Stargazers:134Issues:8Issues:5

DREAM

Efficient Dataset Distillation by Representative Matching

MultiWOZ_Evaluation

Unified MultiWOZ evaluation scripts for the context-to-response task.

Language:PythonLicense:MITStargazers:54Issues:5Issues:6

mixmatch

Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models

FusedChat

FusedChat is a dialogue dataset. It contains dialogue sessions fusing task-oriented dialogues and open-domain dialogues.

Language:PythonLicense:MITStargazers:28Issues:2Issues:6

experiments

My exploration on new technologies.

Language:PythonLicense:MITStargazers:8Issues:1Issues:0

MLM_inconsistencies

Inconsistencies in Masked Language Models

Language:PythonStargazers:6Issues:1Issues:0