Yang Xu's starred repositories
chinese-dictionary
中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库
surprisal-across-languages
Code to calculate surprisal values from multilingual XGLM models.
minbert-default-final-project
CS 224N Winter 2023 Default Final Project: Multitask BERT
statsmodels
Statsmodels: statistical modeling and econometrics in Python
nanoGPT-LoRA
The simplest, fastest repository for training/finetuning medium-sized GPTs with LoRA support.
neural-ngram
Neural ngram language model in PyTorch.
fast-detect-gpt
Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".
a-PyTorch-Tutorial-to-Sequence-Labeling
Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling
DeepLearningForNLPInPytorch
An IPython Notebook tutorial on deep learning for natural language processing, including structure prediction.
cc-visualize
既适合程序员,也适合中文电子文字整编人员(in beta)。汉字繁、简、异、兼、笔、变等关联关系可视化。非寻常汉字字符、同形字符攻击、不可打印字符等检视工具。结合OpenCC、Unicode等数据 | Chinese characters relations or vatiants (simplified, traditional etc) visualization. Potential Unihan/UCD homograph/punycode attack/phishing, non-printable invisible characters inspector
vert-cjk-web
(in alpha) 网页竖排。右起縱書。像古代一样。Make webs vertical lined layout, like traditional CJK writing method in east asian culture circle.(招日韩蒙越翻译)
cs224n-win2223
Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2022/2023
GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
pytorch_ner
LSTM based model for Named Entity Recognition Task using pytorch and GloVe embeddings
Awesome-Chinese-NLP
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
NSFC-application-template-latex
国家自然科学基金申请书正文(面上项目)LaTeX 模板(非官方)
Dan-Jurafsky--Chris-Manning--NLP
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
nlp-cky-PCFG
This repository contains an implementation of the CKY parsing for English. (NLP)
Classical-Modern
非常全的文言文(古文)-现代文平行语料
alg_design
Java implementations of algorithms and structures from "Algorithm Design" by Kleinberg and Tardos.
chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
code-switching-papers
A curated list of research papers and resources on code-switching