Balzac's repositories
Bert-Chinese-Text-Classification-Pytorch
使用Bert,ERNIE,进行中文文本分类
Bert-Multi-Label-Text-Classification
This repo contains a PyTorch implementation of a pretrained BERT model for multi-label text classification.
BERT-whitening-pytorch
Pytorch version of BERT-whitening
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
Chinese-ELECTRA
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
FLASHQuad_pytorch
FLASHQuad_pytorch
interview-questions
Interview Questions for Google, Amazon, Apple, etc. 根据超过 2000 篇真实面经整理的腾讯,阿里,字节跳动,Shopee,美团,滴滴高频面试题
lama-cleaner
Image inpainting tool powered by SOTA AI Model
MarkovJunior
Probabilistic PL based on pattern matching and constraint propagation, 148 examples
million_game_server
《百万在线:大型游戏服务端开发》是C++与Lua方向游戏服务端开发的入门书籍,内容涵盖Skynet引擎、C++底层开发、服务端架构设计等多个方面,全景展现网络游戏核心技术。
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
PaddleNLP
Easy-to-use and powerful NLP library with Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including Neural Search, Question Answering, Information Extraction and Sentiment Analysis end-to-end system.
reverse-interview-zh
技术面试最后反问面试官的话
roformer-sim
SimBERT升级版(SimBERTv2)!
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
some_ml_code
一些深度学习的代码
transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF etc.
Warp
Warp is a blazingly-fast modern Rust based GPU-accelerated terminal built to make you and your team more productive.
XAgent
An Autonomous LLM Agent for Complex Task Solving