Darral's repositories
sentence-similarity
对四种句子/文本相似度计算方法进行实验与比较
text_rnn_attention
嵌入Word2vec词向量的RNN+ATTENTION中文文本分类
text_bert_cnn
在bert模型的pre_training基础上进行text_cnn文本分类
find-Chinese-medical-words
发现新词 无监督词库生成 医学词库生成 发现未登录词
LLM-RAG-QA
LLM+RAG for QA
fast_adversarial_for_text_classification
基于TextCNN,测试三种对抗训练模型(FGSM,PGD,FREE)在text classification上的表现
modeling-data-imbalance-with-different-losses
compare the performance of cross entropy, focal loss, and dice loss in solving the problem of data imbalance
generate_question
generate question
sentencepiece-text-classification
use sentencepiece instead of word segmentation for text classification
sentence_representation
some tricks for sentence representation
crawl_examples
关于crawl的一些例子
ACNet
Attribute-driven-Capsule-Network-for-Entity-Relation-Prediction
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
the-most-complete-dictionary-ever
The most complete Chinese dictionaries ever. 史上最全的中文分类词库,包含地理信息、电子游戏、工程应用、农林牧渔、人文科学、社会科学、生活百科、医学医药、艺术设计、娱乐休闲、运动休闲、自然科学等12大类的超级字典。