Kai Zhang's repositories
Web-Mining
text tokenization, part of speech, named entity recognition, vector space model, word embedding, text classification/clustering, sentiment mining, topic modeling, and application of deep learning in text analytics.
graph-adversarial-learning-literature
A curated list of adversarial attacks and defenses papers on graph-structured data.
AI-Product-Index
A curated index to track AI-powered products.
biomedical
Tools for curating biomedical training data for large-scale language modeling
DataCompression
Data compression of English text using the compressed tries data structure.
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
FedML
FedML - The federated and distributed machine learning library enabling machine learning anywhere at any scale. It's backed by FedML, Inc (https://FedML.ai). Supporting large-scale geo-distributed training, cross-device federated learning on smartphones/IoTs, cross-silo federated learning on data silos, and research simulation. Best Paper Award at NeurIPS 2020 Federated Learning workshop. FedML’s core technology is backed by years of cutting-edge research represented in 50+ publications in ML/FL Algorithms, Security/Privacy, Systems, and Applications, as well as 10 years of industrial experience in Distributed Systems, Cloud Computing, and Mobile/IoT Systems.
ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文总结+润色+审稿+审稿回复
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Large-Language-Model-Notebooks-Course
Practical course about Large Language Models.
Medical-Question-Understanding
Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.
mimic-code
MIMIC Code Repository: Code shared by the research community for the MIMIC-III database
oasis-scripts
Example download scripts for the OASIS3 project
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
poetry
最全的汉语现代诗歌语料库整理,2K+诗人,42K+诗歌,8M+字,包括五四至今的所有流派。持续扩充...
pytorch-frame
Tabular Deep Learning Library for PyTorch
rayeren.github.io
My personal homepage
speechless
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
UnifiedSKG
[EMNLP 2022] A Unified Framework and Analysis for Structured Knowledge Grounding with Text-to-Text Language Models
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch