zshwuhan's repositories
ai-ml-bridgecut
A network bridge cutting algorithm (BridgeCut) that creates clusters based solely on network structure.
ai-ml-clustering
Implementation of multiple clustering algorithms (K-means, Bisecting K-means, Agglomerative Hierarchial Clustering with Intra-Cluster Similarity (IST), Centroid Similarity (CST), and UPGMA) for performance comparisons on different data sets.
ai-ml-improved-bridgecut
An improved version of the network BridgeCut algorithm using rank tie breaking, depth bridging coefficient, and re-ranking.
ai-ml-lerad
Learning rules for Anomaly Detection.
ai-ml-ripperk
A high performance rule induction algorithm (RIPPERk).
anonymization
This project is about applying k-anonymity principle to tables of relational data. Later on, the centralized algorithms will be modified so as to be executed in a distributed manner.
BCFWstruct
Block-Coordinate Frank-Wolfe Optimization for Structural SVMs
boostingPL
BoostingPL - Scalable and Parallel Boosting with MapReduce
CMTreeMiner_Ordered
an algorithm for mining closed and maximal frequent rooted ordered trees
CMTreeMiner_Unordered
an algorithm for mining closed and maximal frequent rooted unordered trees
cs224u-project
event chains and schemas
Data-Mining
this c++ code can do sequence mining
DocumentClustering
Document clustering using a Suffix Tree Clustering algorithm.
FastDCS
FastDCS is a distributed computing system.
FreeTreeMiner
an a priori algorithm for mining frequent free trees
HybridTreeMiner_Free
an algorithm for mining frequent free trees
HybridTreeMiner_Rooted
an algorithm for mining frequent rooted unordered trees
iPM3F
Source code and data for the paper "Nonparametric Max-Margin Matrix Factorization for Collaborative Prediction" accepted to NIPS'2012 and "Fast Max-Margin Matrix Factorization with Data Augmentation" accepted to ICML'2013
kernels
Kernels for Graph Similarity and Node Similarity
MLC-PCC
Probabilistic Classifier Chains (PCC) algorithm solving the problem of Multi-Label Classification.
pegasos_multiclass
multiclass pegasos support vector machine implementation
PyMTL
PyMTL (Python library for Multi-task learning) is a Python module implementing a Multi-task learning framework built on top of scikit-learn, SciPy and NumPy.
pysvmlight
Python wrapper around the SVMLight support vector machine library, implemented in Cython
RecSys
A Recommender System Demo(Include CF\SVD\Graph-Model....)
Research-Gap-Mining
We propose a system to capture the topology of topic evolution inherent in a domain specific corpus and highlight the gaps in the existing study.
RicherPathSIM
Repository for research on an improvement for PathSIM with heterogeneous graph mining with node and edge attributes.
RootedTreeMiner
an algorithm for mining frequent rooted unordered trees
SCMiner
A Python Implementation of the SCMiner proposed in SIGKDD 2012
sdm
Implementation of Support Distribution Machines
slda
Supervised Latent Dirichlet Allocation for Classification