Yejin Cho (scarletcho)

scarletcho

Geek Repo

Company:University of Texas at Austin

Location:Austin, TX

Home Page:http://yejin-cho.wordpress.com/

Github PK Tool:Github PK Tool

Yejin Cho's starred repositories

openai-cookbook

Examples and guides for using the OpenAI API

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8591Issues:97Issues:1255

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:5914Issues:54Issues:1654

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4407Issues:26Issues:83

llm

Access large language models from the command-line

Language:PythonLicense:Apache-2.0Stargazers:3668Issues:34Issues:394

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Language:PythonLicense:MITStargazers:1189Issues:17Issues:108

KICE_slayer_AI_Korean

수능 국어 1등급에 도전하는 AI

awesome-korean-llm

Awesome list of Korean Large Language Models.

gutenberg-poetry-corpus

A corpus of poetry from Project Gutenberg

Language:Jupyter NotebookStargazers:184Issues:4Issues:3

propbank-release

The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts

propbank-frames

Lexicon of frame files used by Propbank annotation. A searchable, readable version of the latest release is here: http://propbank.github.io/v3.4.0/frames/

Language:PythonLicense:CC-BY-SA-4.0Stargazers:96Issues:14Issues:14

riveter-nlp

Package to extract connotation frames

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:77Issues:7Issues:2

SWOWEN-2018

English Small World of Words SWOWEN-2018

backpacks-flash-attn

The original Backpack Language Model implementation, a fork of FlashAttention

Language:PythonLicense:BSD-3-ClauseStargazers:62Issues:2Issues:4

FairytaleQAData

A dataset of over 10000 question and answer pairs written for storybooks.

Language:PythonLicense:Apache-2.0Stargazers:28Issues:1Issues:0

knowledge_distillation

Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).

nanoBackpackLM

The simplest repository for training medium-sized BackpackLM for cs224n

Language:Jupyter NotebookLicense:MITStargazers:21Issues:2Issues:1

SPAML

Semantic Priming Across Many Languages (PSA Proposal)

Language:HTMLLicense:MITStargazers:11Issues:4Issues:1

WAX

The respository describing a novel datasets for word association explanations

Language:PythonStargazers:10Issues:3Issues:0

candle

Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)

Language:PythonLicense:CC-BY-4.0Stargazers:9Issues:2Issues:1
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6Issues:5Issues:1
Language:PythonStargazers:5Issues:1Issues:0

features_in_context

Predict psycholoinguistic feature norms for words in context.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4Issues:5Issues:1

TopicKG

repository for NeurIPS2022

semantic-norms

Code for: Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction

Language:Jupyter NotebookLicense:MITStargazers:3Issues:1Issues:0

essentialism_in_llms

Materials for the paper "You are what you're for: Essentialist categorization in large language models" by Siying Zhang, Jingyuan She, Tobias Gerstenberg and David Rose.

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

StorySettings

This repository contains the dataset described in the forthcoming "Story Settings: A Dataset" in 5th Workshop on Narrative Understanding at ACL 2023