Cheng Dahai (DaHaiHuha)

DaHaiHuha

Geek Repo

Company:Microsoft

Location:Suzhou, China

Github PK Tool:Github PK Tool

Cheng Dahai's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:132935Issues:1117Issues:15858

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:26422Issues:273Issues:731

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:22336Issues:635Issues:266

DeepLearning

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14139Issues:299Issues:13

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13115Issues:325Issues:321

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12397Issues:220Issues:607

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:10137Issues:127Issues:748

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Language:PythonLicense:MITStargazers:7451Issues:160Issues:251

LogicStack-LeetCode

公众号「宫水三叶的刷题日记」刷穿 LeetCode 系列文章源码

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonLicense:MITStargazers:3164Issues:64Issues:320

Awesome-Bioinformatics

A curated list of awesome Bioinformatics libraries and software.

DMTK

Microsoft Distributed Machine Learning Toolkit

deepmd-kit

A deep learning package for many-body potential energy representation and molecular dynamics

Language:C++License:LGPL-3.0Stargazers:1461Issues:47Issues:780

ERNIE

Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"

Language:PythonLicense:MITStargazers:1410Issues:28Issues:87

wdl

Workflow Description Language - Specification and Implementations

decision-forests

A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.

Language:PythonLicense:Apache-2.0Stargazers:658Issues:24Issues:165

tape

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.

Language:PythonLicense:BSD-3-ClauseStargazers:652Issues:22Issues:107

UniRep

UniRep model, usage, and examples.

rgn

Recurrent Geometric Networks for end-to-end differentiable learning of protein structure

Language:PythonLicense:MITStargazers:326Issues:40Issues:28

protein-sequence-embedding-iclr2019

Source code for "Learning protein sequence embeddings using information from structure" - ICLR 2019

Language:PythonLicense:NOASSERTIONStargazers:253Issues:11Issues:25

ML-Model-CI

MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging the gap between current ML training and serving systems.

Language:PythonLicense:Apache-2.0Stargazers:188Issues:18Issues:117

openprotein

A PyTorch framework for prediction of tertiary protein structure

Language:PythonLicense:MITStargazers:178Issues:10Issues:23

tape-neurips2019

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)

Language:PythonLicense:MITStargazers:118Issues:9Issues:17

protein-transformer

Predicting protein structure through sequence modeling

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:106Issues:5Issues:4

ProFOLD

A protein 3D structure prediction application

Language:PythonLicense:MITStargazers:65Issues:3Issues:6

PLUS

Official Pytorch implementation of PLUS (Protein sequence representations Learned Using Structural information), IEEE Access 2021

Language:PythonLicense:MITStargazers:41Issues:7Issues:3

pytorch-rgn

Recurrent Geometric Network in Pytorch

Language:Jupyter NotebookStargazers:29Issues:1Issues:2

parallel_mAP_evaluation

This repo parallelizes mAP_evaluation using python's multiprocessing module.

Language:PythonLicense:NOASSERTIONStargazers:18Issues:3Issues:2

rna-seq

bioinformatics pipelines

Language:WDLStargazers:7Issues:4Issues:0