syncdoth

Sehyun Choi's starred repositories

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonMIT41100

Knowledge-Constrained-Decoding

Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection"

Language:Python2600

yet-another-retnet

A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (https://arxiv.org/pdf/2307.08621.pdf)

Language:PythonMIT10100

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python913000

llm-reasoners

A library for advanced large language model reasoning

Language:PythonApache-2.0104200

RetNet

Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.

Language:Jupyter NotebookMIT22400

deep-thinking

A centralized place for deep thinking code and experiments

Language:PythonMIT6900

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonMIT446000

Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.

Language:Python10900

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.0193300

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT441000

CAR

Code for the EMNLP2023 Findings paper: CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering (https://aclanthology.org/2023.findings-emnlp.902.pdf).

Language:Python700

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonNOASSERTION276900

KID

Knowledge Infused Decoding

Language:PythonMIT7100

korean-safety-benchmarks

Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)

Language:PythonMIT23200

MFMA

Factual consistency checking model for abstractive summaries (NAACL-22 Findings)

Language:Python2800

diff-svc

Singing Voice Conversion via diffusion model

Language:Jupyter NotebookAGPL-3.0261000

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT3410000

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.01261700

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03248200

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Language:Jupyter NotebookNOASSERTION86200

fed

Code for SIGdial 2020 paper: Unsupervised Evaluation of Interactive Dialog with DialoGPT (https://arxiv.org/abs/2006.12719)

Language:Python2700

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT978300

Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language:PythonMIT19200

KnowledGPT

Language:Python7200

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT1139700

COLA

Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal Inference Perspective

Language:PythonMIT2400

OpenAlpaca

OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA

Language:PythonApache-2.030200

awesome-mlops

A curated list of references for MLOps

1234500