anbo724's repositories
Awesome-Medical-Healthcare-Dataset-For-LLM
A curated list of popular Datasets, Models and Papers for LLMs in Medical/Healthcare
awesome-public-datasets
A topic-centric list of HQ open datasets.
awesome-tibetan-nlp
😎 Curated list of Tibetan NLP projects
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
chatgpt-comparison-detection
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
chinese-xinhua
中华新华字典数据库。包括歇后语,成语,词语,汉字。提供新华字典API。
cino
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)
datasets
🤗 Fast, efficient, open-access datasets and evaluation metrics in PyTorch, TensorFlow, NumPy and Pandas
Freebase-to-Wikipedia
This repository contains a Freebase dump parser that extracts links to Wikipedia.
keras
Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano and TensorFlow.
knowledge_representation_pytorch
Several knowledge graph representation algorithms implemented with pytorch
MedicalGPT-zh
MedicalGPT-zh:一个基于ChatGLM的在高质量指令数据集微调的中文医疗对话语言模型
nihao
A Relative Fine-Grained Chinese Word Segmentation - Nihao
PaddleNLP
Easy-to-use and Fast NLP library with awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications.
pybo
🦜 NLP for Tibetan, in Python.
stemming_dictionary
Hunspell Stemming Dictionary
tensorflow-workshop
Slides and code from our TensorFlow Workshop.
textdistance
Compute distance between sequences. 30+ algorithms, pure python implementation, common interface.
TSTD
Tibetan Sentiment Tweets Dataset
uni-wx-charts
微信小程序图表charts组件,Charts for WeChat small app