Sijun He's repositories
DataMiningMOOC
Stanford CS341: Projects for Mining Massive Dataset, MOOC
twitter-ner
Named Entity Recognition on Twitter data
blog
Public repo for HF blog posts
document_classification
Simple command-line scripts for document classification
ERNIE-Bot-SDK
The ERNIE Bot Python library provides convenient access to the ERNIE Bot API.
hub-docs
Frontend components, documentation and information hosted on the Hugging Face website.
langchain
⚡ Building applications with LLMs through composability ⚡
MiduCTC-competition
文本智能校对大赛(Chinese Text Correction)的baseline
models
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
nanoMoE
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
PaddleSlim
PaddleSlim is an open-source library for deep model compression and architecture search.
resume
My Chinese and English Resumes in LaTeX with Font Awesome 5
sijunhe.github.io
💎 A text first theme for Jekyll.
tensorflow
An Open Source Machine Learning Framework for Everyone
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.