Sol Wu's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130578Issues:1118Issues:15479

DeepLearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

Language:JavaScriptLicense:GPL-3.0Stargazers:54033Issues:2258Issues:188

big-list-of-naughty-strings

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.

Language:PythonLicense:MITStargazers:46079Issues:850Issues:99

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19660Issues:254Issues:72

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Language:PythonLicense:Apache-2.0Stargazers:15244Issues:466Issues:1247

TabNine

AI Code Completions

Language:ShellLicense:MITStargazers:10521Issues:144Issues:577

text_classification

all kinds of text classification models and more with deep learning

Language:PythonLicense:MITStargazers:7821Issues:299Issues:124

gcn

Implementation of Graph Convolutional Networks in TensorFlow

Language:PythonLicense:MITStargazers:7061Issues:157Issues:194

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:5464Issues:115Issues:654

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonLicense:MITStargazers:4200Issues:68Issues:558

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

MUSE

A library for Multilingual Unsupervised or Supervised word Embeddings

Language:PythonLicense:NOASSERTIONStargazers:3176Issues:99Issues:171

List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

char-rnn-tensorflow

Multi-layer Recurrent Neural Networks (LSTM, RNN) for character-level language models in Python using Tensorflow

Language:PythonLicense:MITStargazers:2639Issues:139Issues:74

SparrowRecSys

A Deep Learning Recommender System

Language:PythonLicense:Apache-2.0Stargazers:2377Issues:57Issues:33

fast-bert

Super easy library for BERT based NLP models

Language:PythonLicense:Apache-2.0Stargazers:1854Issues:42Issues:252

emoji-regex

A regular expression to match all Emoji-only symbols as per the Unicode Standard.

Language:JavaScriptLicense:MITStargazers:1729Issues:22Issues:78

CogDL

CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)

Language:PythonLicense:MITStargazers:1710Issues:42Issues:117

bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Language:PythonLicense:MITStargazers:1173Issues:28Issues:63

FakeNewsNet

This is a dataset for fake news detection research

cnn-text-classification-pytorch

CNNs for Sentence Classification in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1019Issues:15Issues:19

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Language:Jupyter NotebookLicense:MITStargazers:776Issues:34Issues:12

DeepGBM

SIGKDD'2019: DeepGBM: A Deep Learning Framework Distilled by GBDT for Online Prediction Tasks

hyperopt

Distributed Asynchronous Hyperparameter Optimization in Python

Language:PythonLicense:NOASSERTIONStargazers:511Issues:32Issues:0

keras-quora-question-pairs

A Keras model that addresses the Quora Question Pairs dyadic prediction task.

Language:Jupyter NotebookLicense:MITStargazers:368Issues:18Issues:9

indic-trans

The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including English.

Language:PythonLicense:AGPL-3.0Stargazers:257Issues:13Issues:43

RNN-TrajModel

The source of the IJCAI2017 paper "Modeling Trajectory with Recurrent Neural Networks"

Efficient-SSL

Implementation for IGCN and GLP model in our paper "Label Efficient Semi-Supervised Learning via Graph Filtering."

Language:PythonLicense:MITStargazers:74Issues:5Issues:4

Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch

Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch

Language:PythonLicense:MITStargazers:59Issues:6Issues:3