Zhaox's starred repositories
awesome-books
📚 非常棒的程序员学习书籍大全。(📚 Great programmer learning Book Encyclopedia.)
ant-design
An enterprise-class UI design language and React UI library
BERT-CCPoem
BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry
guwen-models
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.
Guwen-UNILM
本仓库是基于bert4keras实现的古文-现代文翻译模型。具体使用了基于掩码自注意力机制的UNILM(Li al., 2019)预训练模型作为翻译系统的backbone。我们首先使用了普通的中文(现代文)BERT、Roberta权重作为UNILM的初始权重以训练UNILM模型(具体在文中分别为B-UNILM以及R-UNILM)。为了更好的使UNILM模型适应古文的特性,我们尝试使用了在古文预训练模型Guwen-BERT,作为UNILM的初始权重,并且获得了最优的效果。
Chinese-ancient-poetry-text-mining
古诗词爬虫和文本挖掘,含13个朝代的3万多条诗人数据、85万多条诗词数据,包括主题聚类、相关诗词推荐、藏头诗生成、诗词翻译等算法实现
Classical-Modern
非常全的文言文(古文)-现代文平行语料
TextStyleTransfer
Style Transfer in Text
TA-seq2seq
复现论文:《Topic Aware Neural Response Generation》
Neural_Topic_Models
Implementation of topic models based on neural network approaches.
chinese-chatbot-corpus
中文公开聊天语料库
efaqa-corpus-zh
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
text_data_enhancement_with_LaserTagger
Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。
nisl-fabric
Hyperledger Fabric is an enterprise-grade permissioned distributed ledger framework for developing solutions and applications. Its modular and versatile design satisfies a broad range of industry use cases. It offers a unique approach to consensus that enables performance at scale while preserving privacy.