Beast code in Giters

steve's repositories

SpeechGPT

SpeechGPT Series: Speech Large Language Models

Apache-2.0000

IS2024_stream_decoder_only_asr

000

SCM4LLMs

Self-Controlled Memory System for LLMs

MIT000

fairseq-apollo

FairSeq repo with Apollo optimizer

MIT000

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

NOASSERTION000

fast-transformers

Pytorch library for fast transformer implementations

000

pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

MIT000

adahessian

ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning

MIT000

tart

Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.

NOASSERTION000

flash-attention

Fast and memory-efficient exact attention

BSD-3-Clause000

gdfm_nips22

code of Generalized Delayed Feedback Model with Post-Click Information in Recommender Systems, NeurIPS 2022

NOASSERTION000

NM-sparsity

000

bolt

10x faster matrix and vector operations

MPL-2.0000

dpro

Analysis for the traces from byteprofile

000

CLSR

The official implementation of "Disentangling Long and Short-Term Interests for Recommendation" (WWW '22)

MIT000

frnet

The Source Code of FRNet

000

byteps

A high performance and generic framework for distributed DNN training

NOASSERTION000

p4app-switchML

Switch ML Application

Apache-2.0000

DEFUSE

code of our WWW 2022 paper Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction

000

ps-lite

A lightweight parameter server interface

Apache-2.0000

DensePhrases

ACL'2021: Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too

Apache-2.0000

robust-aggregate-lfs

Source code of our ACL 2022 paper 'Learning to robustly aggregate labeling functions for semi-supervised data programming'

000

t-few

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

MIT000

ASTRA

Self-training with Weak Supervision (NAACL 2021)

MIT000

CHMM-ALT

Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"

Apache-2.0000

wrench

WRENCH: Weak supeRvision bENCHmark

Apache-2.0000

automatic-rule-induction

MIT000

KnowledgeablePromptTuning

kpt code

000

CALM-Dialogue

000

WWW-22-DIHN

[WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation

000