JonyKai

JonyKai

Geek Repo

Github PK Tool:Github PK Tool

JonyKai's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129677Issues:1120Issues:15284

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37537Issues:996Issues:1142

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17959Issues:185Issues:730

BullshitGenerator

Needs to generate some texts to test if my GUI rendering codes good or not. so I made this.

Language:JavaScriptLicense:NOASSERTIONStargazers:15703Issues:270Issues:155

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:13971Issues:698Issues:1638

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonLicense:Apache-2.0Stargazers:11741Issues:123Issues:352

Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language:PythonLicense:Apache-2.0Stargazers:11705Issues:287Issues:166

chinese-xinhua

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Language:PythonLicense:MITStargazers:10780Issues:312Issues:57

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:9258Issues:131Issues:1513

wukong-robot

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

Language:PythonLicense:MITStargazers:6073Issues:173Issues:289

ToolGood.Words

一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。

Language:JavaScriptLicense:Apache-2.0Stargazers:4628Issues:104Issues:99

TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.

Language:C++License:NOASSERTIONStargazers:4342Issues:92Issues:950

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++License:NOASSERTIONStargazers:2458Issues:69Issues:366

tensorflow_template_application

TensorFlow template application for deep learning

Language:PythonLicense:Apache-2.0Stargazers:1867Issues:186Issues:40

myhtml

Fast C/C++ HTML 5 Parser. Using threads.

Language:CLicense:LGPL-2.1Stargazers:1645Issues:89Issues:135

CPM-1-Generate

Chinese Pre-Trained Language Models (CPM-LM) Version-I

Language:PythonLicense:MITStargazers:1590Issues:38Issues:76

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

Language:C++License:NOASSERTIONStargazers:1455Issues:41Issues:118

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptLicense:MITStargazers:1220Issues:10Issues:97

chaizi

漢語拆字字典

corpus

自然语言处理,知识图谱相关语料。按照Task细分,欢迎PR。

Language:PythonStargazers:706Issues:20Issues:0

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

Language:PythonLicense:MITStargazers:662Issues:18Issues:78

classifier-multi-label

多标签文本分类,多标签分类,文本分类, multi-label, classifier, text classification, BERT, seq2seq,attention, multi-label-classification

sccl

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

Language:PythonLicense:MIT-0Stargazers:287Issues:6Issues:29

hanzi_char_featurizer

汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese characters (pronunciation features, glyph features) as features for deep learning

Language:PythonLicense:Apache-2.0Stargazers:282Issues:7Issues:6

NLPDataAugmentation

Chinese NLP Data Augmentation, BERT Contextual Augmentation

NER

Implementation of BiLSTM-CRF in TF2.0 for Name-Entity-Recognition (NER)

Language:Jupyter NotebookLicense:MITStargazers:17Issues:2Issues:1
License:GPL-3.0Stargazers:11Issues:1Issues:0
Language:PythonStargazers:8Issues:1Issues:0

ChatGPT-Customized

One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。

Language:TypeScriptLicense:NOASSERTIONStargazers:2Issues:0Issues:0

variant-Chinese-words

A display system built on node-webkit to show Chinese word detection

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0