yuye2133

followers

following

stars

yuye2133's repositories

Abstractive-Text-Summarization

Contrastive Attention Mechanism for Abstractive Text Summarization

Language:PythonNOASSERTION000

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonGPL-3.0000

bazel

a fast, scalable, multi-language and extensible build system

Language:JavaApache-2.0000

bert

TensorFlow code and pre-trained models for BERT

Language:PythonApache-2.0000

blog_others

Language:Jupyter Notebook000

Chinese-NewWordRecognition

专业领域词库构建/中文新词发现/专业词库发现

000

chinese-poetry

最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

Language:PythonMIT000

ChineseNER

中文命名实体识别，实体抽取，tensorflow，pytorch，BiLSTM+CRF

Language:Python000

chip2018

chip2018

Language:Python000

CHIP2018-1

CHIP2018问句匹配大赛 Rank6解决方案

Language:Python000

chip2018_task2_question_pairs_matching

CHIP2018评测任务2，平安医疗科技智能患者健康咨询问句匹配大赛baseline，BiLSTM+特征工程计算相似性，10折交叉验证平均投票做bagging，F1值0.83左右，rank16。

Language:Python000

Closer

2nd place solution to CIKM AnalytiCup 2018, determining the short-text semantic similarity.

Language:Python000

conlleval

conlleval in Python (script for chunking/NER evaluation)

000

CONLP

一个自然语言处理初学者可以参考的库，包含分词，词性标注，命名实体识别，依存句法分析大多模型和算法都是自己实现。a natural language processing library for beginners

Language:Java000

DSSM-Lookalike

000

EGPapers

事件知识图谱构建相关的论文, 包含事件抽取、事件关系识别等任务

000

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

NOASSERTION000

Fuck-XueXiQiangGuo

学习强国懒人刷分工具自动学习

000

gpt-3

GPT-3: Language Models are Few-Shot Learners

000

helloworld

first project in github

Language:Python000

maccms10

苹果cms-v10,maccms-v10,开源CMS,内容管理系统,视频分享程序,分集剧情程序,网址导航程序,新闻程序,漫画程序,图片程序

Apache-2.0000

nl2sql_baseline

Language:PythonBSD-3-Clause000

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

MIT000

nlp_corpus

本人项目进行中搜集的数据集，包含原始数据和经过处理后的数据，项目持续更新。

000

NLPGNN

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement GCN, GAN, GIN and GraphSAGE based on message passing.

MIT000

Pre-modern_Chinese_corpus_dataset

一个近代汉语语料库数据集 This is a pre-modern Chinese ( From Song dynasty in 10th century AD to Republic of China in the early 20th Century ) language corpus.These language resources are all txt format,arranged by Dynasty（Song,Yuan,Ming,Early-Qing,Late-Qing and Republic of China）.The relevant authors' information and types of literature also have been labelled.

000

pycorrector

pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.

Language:PythonApache-2.0000

tensorflow_poems

中文古诗自动作诗机器人，屌炸天，基于tensorflow1.10 api，正在积极维护升级中，快star，保持更新！

Language:Python000

TextMatch

基于Pytorch的，中文语义相似度匹配模型（ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet）

000

WikiSQL

A large annotated semantic parsing corpus for developing natural language interfaces.

Language:HTMLBSD-3-Clause000