Sheng Shen (sIncerass)

sIncerass

Geek Repo

Company:University of California, Berkeley

Location:Berkeley, CA

Home Page:https://sincerass.github.io/

Twitter:@shengs1123

Github PK Tool:Github PK Tool

Sheng Shen's repositories

powernorm

[ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845

Language:PythonLicense:GPL-3.0Stargazers:119Issues:8Issues:15

MVLPT

code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720

Language:PythonLicense:MITStargazers:51Issues:2Issues:6

prag_generation

[NAACL 2019] code for "Pragmatically Informative Text Generation" https://arxiv.org/abs/1904.01301

ELSA

[WWW 2019] code for "Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification" https://arxiv.org/abs/1806.02557

Language:PythonLicense:MITStargazers:30Issues:4Issues:3

one_layer_lottery_ticket

[EMNLP 2021] code for "Whatā€™s Hidden in a One-layer Randomly Weighted Transformer?"

Language:PythonLicense:MITStargazers:8Issues:2Issues:1
Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

few-shot-learning

Few-shot Learning of GPT-3

Language:PythonStargazers:0Issues:1Issues:0

google-drive-downloader

Minimal class to download shared files from Google Drive.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

java-nlp-toolkit

My personal Java NLP toolkit that serves as an interface to various existing NLP libraries.

Language:JavaStargazers:0Issues:1Issues:0

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

nums

A library that translates Python and NumPy to optimized distributed systems code.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

PreSumm

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

promptsource

Toolkit for collecting and applying templates of prompting instances

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:C++Stargazers:0Issues:1Issues:0

sincerass.github.io

Sheng (Arnold) Shen's homepage

Language:SCSSStargazers:0Issues:1Issues:0

sockeye

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0

transformers

šŸ¤— Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0