AINLP's repositories

MeCab-Chinese

Chinese morphological analysis with Word Segment and POS Tagging data for MeCab

allennlp

A natural language processing toolkit using state-of-the-art deep learning models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AutoPhrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

bible-corpus

A multilingual parallel corpus created from translations of the Bible.

License:CC0-1.0Stargazers:0Issues:0Issues:0

brook

Brook is a cross-platform(Linux/MacOS/Windows/Android/iOS) proxy software

Language:GoLicense:GPL-3.0Stargazers:0Issues:0Issues:0

brpc

Most common RPC framework used throughout Baidu, with 600,000+ instances and 500+ kinds of services, called "baidu-rpc" inside Baidu.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

chaizi

漢語拆字字典

License:NOASSERTIONStargazers:0Issues:0Issues:0

deep-siamese-text-similarity

Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character embeddings

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deepnlp

Deep Learning NLP Pipeline implemented on Tensorflow

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

discourse

A platform for community discussion. Free, open, simple.

Language:RubyLicense:GPL-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit

Language:LuaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fairseq-py

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fastText_multilingual

Multilingual word vectors in 78 languages

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

got-book-6

RNN trained on the first five GOT books

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

incubator-airflow

Apache Airflow (Incubating)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

interactive-coding-challenges

Huge update! Interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

NanGeMT

NanGe - A Rule-based Chinese-English Machine Translation System

Language:C++Stargazers:0Issues:0Issues:0

ngx_http_google_filter_module

Nginx Module for Google Mirror

Language:CLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialog datasets.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pipesocks

A pipe-like SOCKS5 tunnel system.

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

sent2vec

General purpose unsupervised sentence representations

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SIF

sentence embedding by Smooth Inverse Frequency weighting scheme

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

skip-thoughts

Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"

Language:PythonStargazers:0Issues:0Issues:0

Synonyms

这是一个可以标准化用户搜索关键词,并且返回近义的候选搜索关键词的程序。

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Text-Summarization-with-Amazon-Reviews

A seq2seq model that can generate summaries from fine food reviews on Amazon.

Language:HTMLStargazers:0Issues:0Issues:0

weibo_terminater

Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anythings. The Terminator

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

word2vec_pipeline

Pipeline to turn input text into a w2v embedding.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zmirror

The next-gen reverse proxy for full site mirroring

Language:PythonLicense:MITStargazers:0Issues:0Issues:0