asitang's repositories

Language:PythonLicense:Apache-2.0Stargazers:7Issues:4Issues:0

sklearn-hierarchical-classification

Hierarchical classification module based on scikit-learn's interfaces

Language:PythonLicense:Apache-2.0Stargazers:6Issues:2Issues:0
Language:PythonStargazers:1Issues:0Issues:0

getting-started-with-git-and-github

Explaining Git and GitHub.

Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

markdown-cheatsheet

Markdown Cheatsheet for Github Readme.md

License:MITStargazers:0Issues:0Issues:0

nutch

Mirror of Apache Nutch

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

parser-indexer

Metadata Parser and Solr Indexer. For Python equivalent, checkout https://github.com/USCDataScience/parser-indexer-py

Language:JavaStargazers:0Issues:0Issues:0

pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pycel

A library for compiling excel spreadsheets to python code & visualizing them as a graph

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

pytesseract

A Python wrapper for Google Tesseract

Language:PythonStargazers:0Issues:0Issues:0

shangridocs

Document exploration tool

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

soft_cosine

Exploration of Soft Cosine measure in document similarity computation tasks

Stargazers:0Issues:0Issues:0

tika-similarity

Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

word2vec

Automatically exported from code.google.com/p/word2vec

Stargazers:0Issues:0Issues:0