davidie's repositories
Ad-papers
Papers on Computational Advertising
alphaFM
Multi-thread implementation of Factorization Machines with FTRL for binary-class classification problem.
AnyQ
FAQ-based Question Answering System
bert
TensorFlow code and pre-trained models for BERT
bert_serving
export bert model for serving
elasticsearch-definitive-guide
The Definitive Guide to Elasticsearch
EMNLP2018_NLI
Repository for NLI models (EMNLP 2018)
faiss
consider this when you need semantic search, instead of term search
fastFM
FM的实现中用的更广泛的python工具包
jpmml-lightgbm
lgb的java部署版本
lantern
🔴蓝灯最新版本下载 https://github.com/getlantern/download 🔴 Lantern Latest Download https://github.com/getlantern/download 🔴
libffm
相比于python-fm包,libFM的支持相对完善一点,准确率也更高一些。但是libFM缺少了模型保存和加载的支持,也没有early-stopping的机制,mcmc的学习更简便一些,但是mcmc得到的模型不能保存,als/sgd虽然能够保存模型,但学习效果依赖于调参,准确率比不上mcmc,总体来讲,libFM的支持也不太好用。最后选用libffm,libffm支持完备,而且准确率比libFM还要更高。
lightfm
用的相对广泛的一个python-fm包。API写的最详细,输入形式比较友好,但是输出的没有归一化的0-1之间。最后还是放弃了python-fm,改用libfm原版。
LightGBM
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
nlp-competitions-list-review
复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
pyFM
试用了,训练速度慢,效果一般
ranklib
A library of learning to rank algorithms
redis-py-cluster
Python cluster client for the official redis cluster. Redis 3.0+.
rnn-nlu
意图识别&槽填充联合模型 - A TensorFlow implementation of Recurrent Neural Networks for Sequence Classification and Sequence Labeling
roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
serving
A flexible, high-performance serving system for machine learning models
simple-effective-text-matching
RE2算法 - Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
tensorflow
An Open Source Machine Learning Framework for Everyone
tensorflow-DSMM
Tensorflow implementations of various Deep Semantic Matching Models
tensorflow_template_application
TensorFlow template application for deep learning
Text-Pairs-Relation-Classification
About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based on Neural Network.
wide_deep
模型的实现细节直接用的Estimator定制的,主要工作在于特征配置