Sehyun Choi (syncdoth)

syncdoth

Geek Repo

Location:South Korea | Hong Kong

Home Page:syncdoth.github.io

Twitter:@schoiaj

Github PK Tool:Github PK Tool

Sehyun Choi's starred repositories

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonLicense:MITStargazers:411Issues:0Issues:0

Knowledge-Constrained-Decoding

Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection"

Language:PythonStargazers:26Issues:0Issues:0

yet-another-retnet

A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (https://arxiv.org/pdf/2307.08621.pdf)

Language:PythonLicense:MITStargazers:101Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:9130Issues:0Issues:0

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:1042Issues:0Issues:0

RetNet

Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.

Language:Jupyter NotebookLicense:MITStargazers:224Issues:0Issues:0

deep-thinking

A centralized place for deep thinking code and experiments

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4460Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:72Issues:0Issues:0

flacuna

Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.

Language:PythonStargazers:109Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1933Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4410Issues:0Issues:0

CAR

Code for the EMNLP2023 Findings paper: CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering (https://aclanthology.org/2023.findings-emnlp.902.pdf).

Language:PythonStargazers:7Issues:0Issues:0

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonLicense:NOASSERTIONStargazers:2769Issues:0Issues:0

KID

Knowledge Infused Decoding

Language:PythonLicense:MITStargazers:71Issues:0Issues:0

korean-safety-benchmarks

Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)

Language:PythonLicense:MITStargazers:232Issues:0Issues:0

MFMA

Factual consistency checking model for abstractive summaries (NAACL-22 Findings)

Language:PythonStargazers:28Issues:0Issues:0

diff-svc

Singing Voice Conversion via diffusion model

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2610Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34100Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12617Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32482Issues:0Issues:0

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:862Issues:0Issues:0

fed

Code for SIGdial 2020 paper: Unsupervised Evaluation of Interactive Dialog with DialoGPT (https://arxiv.org/abs/2006.12719)

Language:PythonStargazers:27Issues:0Issues:0

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9783Issues:0Issues:0

Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language:PythonLicense:MITStargazers:192Issues:0Issues:0
Language:PythonStargazers:72Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11397Issues:0Issues:0

COLA

Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal Inference Perspective

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

OpenAlpaca

OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA

Language:PythonLicense:Apache-2.0Stargazers:302Issues:0Issues:0

awesome-mlops

A curated list of references for MLOps

Stargazers:12345Issues:0Issues:0