taokz

Kai Zhang's repositories

BiomedGPT

BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks

Language:PythonApache-2.0299 23 24

FedR

Official codes for paper "Efficient Federated Learning on Knowledge Graphs via Privacy-preserving Relation Embedding Aggregation"

Language:Python22 2 3

FeDepth

Implementation of [Memory-adaptive Depth-wise Heterogenous Federated Learning]

Language:Python4 10

FuzzyFL

Implemented federated learning for binary classification (tabular data) with PyTorch. The data fuzzification technique and local differential privacy mechanism are applied to protect data privacy.

Language:PythonMIT4 10

text tokenization, part of speech, named entity recognition, vector space model, word embedding, text classification/clustering, sentiment mining, topic modeling, and application of deep learning in text analytics.

Language:Jupyter Notebook300

graph-adversarial-learning-literature

A curated list of adversarial attacks and defenses papers on graph-structured data.

100

AI-Product-Index

A curated index to track AI-powered products.

MIT000

BioGPT

Language:PythonMIT000

biomedical

Tools for curating biomedical training data for large-scale language modeling

Language:Python000

ChatDoctor

Language:PythonApache-2.0000

coco-caption

Language:Jupyter NotebookNOASSERTION000

DataCompression

Data compression of English text using the compressed tries data structure.

Language:C++010

DeepLearning-500-questions

Language:JavaScriptGPL-3.0000

FedML

FedML - The federated and distributed machine learning library enabling machine learning anywhere at any scale. It's backed by FedML, Inc (https://FedML.ai). Supporting large-scale geo-distributed training, cross-device federated learning on smartphones/IoTs, cross-silo federated learning on data silos, and research simulation. Best Paper Award at NeurIPS 2020 Federated Learning workshop. FedML’s core technology is backed by years of cutting-edge research represented in 50+ publications in ML/FL Algorithms, Security/Privacy, Systems, and Applications, as well as 10 years of industrial experience in Distributed Systems, Cloud Computing, and Mobile/IoT Systems.

Language:PythonApache-2.0000

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文总结+润色+审稿+审稿回复

Language:PythonNOASSERTION000

fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

000

Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

MIT000

Medical-Question-Understanding

Repository for 3 papers on Summarization and Entailment for Medical User-Generated Questions.

Language:PythonMIT000

mimic-code

MIMIC Code Repository: Code shared by the research community for the MIMIC-III database

Language:Jupyter NotebookMIT000

oasis-scripts

Example download scripts for the OASIS3 project

Language:Shell000

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonMIT000

poetry

最全的汉语现代诗歌语料库整理，2K+诗人，42K+诗歌，8M+字，包括五四至今的所有流派。持续扩充...

Language:PythonMIT000

pytorch-frame

Tabular Deep Learning Library for PyTorch

Language:PythonMIT000

rayeren.github.io

My personal homepage

Language:SCSSMIT000

speechless

LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.

Apache-2.0000

taokz

010

taokz.github.io

Language:SCSSMIT010

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION000

UnifiedSKG

[EMNLP 2022] A Unified Framework and Analysis for Structured Knowledge Grounding with Text-to-Text Language Models

Language:PythonApache-2.0000

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT000