GuohuaLi's repositories

chinese-word2vec

word2vec/glove/swivel binary file on chinese corpus

crf-shimo

A optimized version of crf++0.58 with a little faster and add some postprocess for chinese segment.

Language:ShellLicense:NOASSERTIONStargazers:2Issues:1Issues:0

online-coding-for-interview

在线协同编辑,用于远程面试写代码

CNN_sentence

CNNs for sentence classification

Language:PythonStargazers:1Issues:1Issues:0

cx-extractor

Automatically exported from code.google.com/p/cx-extractor

ik-analyzer

Automatically exported from code.google.com/p/ik-analyzer

Language:JavaStargazers:1Issues:0Issues:136

newspaper

News, full-text, and article metadata extraction in Python 3

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

AlphaHoldem

A Deep Reinforcment Learning Aproach to Texas Holdem

Language:RoffStargazers:0Issues:0Issues:0

brown-cluster

C++ implementation of the Brown word clustering algorithm.

Language:C++Stargazers:0Issues:1Issues:0

crfchunking-with-wordrepresentations

Train a CRF for syntactic chunking (CoNLL2000), and use word representations

Language:PythonStargazers:0Issues:1Issues:0

dict_build

自动构建中文词库:build dict from large chinese text using unsupervised method,algorithm:http://www.matrix67.com/blog/archives/5044

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Stargazers:0Issues:1Issues:0

flexse

mirror repository of flexse in google code. Author: scenbuffalo@gmail.com

Language:C++Stargazers:0Issues:0Issues:0

gensim

Topic Modelling for Humans

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

GloVe

GloVe model for distributed word representation

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

guava

Google Core Libraries for Java 6+

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

HanLP

汉语言处理包 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jsoncpp

A C++ library for interacting with JSON.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Go, Javascript and more

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

resource_list

resource list about QA/NLP/NLG/DL/ML/Dialogue/IE/AI

Stargazers:0Issues:2Issues:0

sentence2vec

Tools for mapping a sentence with arbitrary length to vector space

Language:PythonStargazers:0Issues:1Issues:0
Language:LuaStargazers:0Issues:1Issues:0

tidy-html5

The granddaddy of HTML tools, with support for modern standards

Language:CStargazers:0Issues:1Issues:0

tiny-cnn

header only, dependency-free deep learning framework in C++11

Language:C++Stargazers:0Issues:1Issues:0

TopNews

头条新闻

Language:JavaStargazers:0Issues:1Issues:0

utf8.h

📚 single header utf8 string functions for C and C++

Language:CLicense:UnlicenseStargazers:0Issues:0Issues:0

word2vec

Multiple version of word2vec. https://code.google.com/p/word2vec/

Language:PythonStargazers:0Issues:2Issues:0

word2vec-doc2vec

An extension of word2vec to efficiently represent new text as vectors. New text can be query, sentence and paragraph.

Language:CStargazers:0Issues:1Issues:0

xgboost

Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0