송영숙 (songys)

songys

Geek Repo

Company:Sionic AI Inc.

Home Page:https://sites.google.com/view/youngsooksong

Github PK Tool:Github PK Tool


Organizations
KLUE-benchmark
korean-named-entity
ModuASR

송영숙's repositories

AwesomeKorean_Data

한국어 데이터 세트 링크

Chatbot_data

Chatbot_data_for_Korean

entity

날짜, 장소, 사람, 기관, 시간

single_turn_dialogue

사전에서 대화 예문만 추출한 데이터

2020LangconOnOff

자연어 처리 데이터에게 길을 묻다.

Language:RubyStargazers:9Issues:1Issues:0

Awesome_GhatGPT_News

유용한 ChatGPT 블로그 글들 모음

ToxicCiD

Toxic Comment in Dictionary

License:CC0-1.0Stargazers:5Issues:1Issues:0

ConceptSpeechMood

단어 집합과 화행을 이용한 gpt-3.5-turbo 모델 생성 결과 품질 통제(Quality Control) 데이터 세트

Language:Jupyter NotebookStargazers:4Issues:2Issues:0
Language:RubyStargazers:2Issues:3Issues:0

Awosome_KOITblog

한국어 기반의 기술 블로그

Stargazers:2Issues:0Issues:0

parsing_json

모두의 말뭉치 파싱 코드 예시

Language:Jupyter NotebookStargazers:2Issues:1Issues:0
Language:HTMLStargazers:1Issues:1Issues:0

CodeMixed-Text-Generator

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

License:MITStargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0
License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

DAKSA-Domain_Adaptation_in_Korean_Speech_Act

Cross-Domain Speech Act Adaptation and Analysis

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Ko-ATOMIC

Korean Commonsense Knowledge Graph

License:MITStargazers:0Issues:0Issues:0

KoChatGPT

ChatGPT의 RLHF를 학습을 위한 3가지 step별 한국어 데이터셋

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Korean-CommonGen

[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

koSpeechAct

#Generate natural language sentences that reflect speech acts

Stargazers:0Issues:1Issues:0

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

License:MITStargazers:0Issues:0Issues:0
License:CC0-1.0Stargazers:0Issues:1Issues:0

NeurIPS-2022-Submission-3358

This is the code for the Submission 3358 at NeurIPS 2022.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

project-dialogism-novel-corpus

The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:SCSSLicense:NOASSERTIONStargazers:0Issues:1Issues:0

UnethicalQuestionsKor

ethicalVsUnethicalQuestionsKor로 데이터 증강 필요

Stargazers:0Issues:0Issues:0

XSum

Topic-Aware Convolutional Neural Networks for Extreme Summarization

License:MITStargazers:0Issues:0Issues:0