Tae Yoon Kim (Ted)'s repositories
DHS-LLM-Workshop
DHS 2023 LLM Workshop by Sourab Mangrulkar
transformer_framework
framework for plug and play of various transformers (vision and nlp) with FSDP
Research
novel deep learning research works with PaddlePaddle
Korean-CommonGen
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
LEAP_NLI_v2.0
Dataset for Korean Legal Inference
transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
NaverCafeFreePass
Naver Cafe Free Pass Web Browser Extension
NAMPreprogressing
National Assembly Minutes Preprogressing
micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
ARDM
Alternate Recurrent Dialog Model
LSH
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
TedWebCrawler
'ν λ'μμ λ°μ΄ν° λΆμμ νμν μλ£λ€μ ν¬λ‘€λ§ ν μ μλ νμ΄μ¬ νλ‘μ νΈμ λλ€. μμ΄, νκ΅μ΄ μ§μ κ°λ₯ν©λλ€.
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
python-sdk
:snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud
dstc8-schema-guided-dialogue
Schema-Guided Dialogue State Tracking
cognitive-research-technologies-docs
Documentation related to Microsoft Cognitive Research Technologies
Taskmaster
Please see the readme file as well as our 2019 EMNLP paper linked here -->
sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
mag
A lightweight python library that helps to keep track of numerical experiments
PRML
PRML algorithms implemented in Python
polyai-models
Neural Models for Conversational AI
KoBERT
Korean BERT pre-trained cased (KoBERT)
SUMBT
SUMBT: Slot-Utterance Matching for Universal and Scalable Belief Tracking (ACL 2019)