windHat's repositories
Agriculture_KnowledgeGraph
农业知识图谱(AgriKG):农业领域的信息检索,命名实体识别,关系抽取,智能问答,辅助决策
awesome_Chinese_medical_NLP
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
biomedical
Tools for curating biomedical training data for large-scale language modeling
CBLUE
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
chip2021
Solution of CHIP2021 Task1
CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
competition-baseline
数据科学竞赛知识、代码、思路
GazeTracking
👀 Eye Tracking library easily implementable to your projects
GLUE-baselines
[DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations
HiC-Pro
HiC-Pro: An optimized and flexible pipeline for Hi-C data processing
imcs21-cblue
This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi
mimic-website
Website for the MIMIC Critical Care Database (currently version MIMIC-III)
OpenPrompt
An Open-Source Framework for Prompt-Learning.
RenalTumor
Single-cell multi-omics analysis reveals regulatory programs in clear cell renal cell carcinoma
RL4LMs
A modular RL library to fine-tune language models to human preferences
sicelore
Single Cell Long Read is a suite of tools dedicated to Cell barcode / UMI assignment and analysis of highly multiplexed single cell Nanopore long read sequencing data.
SimSiam
A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'
stimATAC_analyses_code
All code associated with manuscript detailing scATAC and scRNA-seq following stimulus of PBMCs
trl
Train transformer language models with reinforcement learning.
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo