wzjmail's repositories
nlp-in-practice
NLP, Text Mining and Machine Learning starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, keyword extraction with TFIDF, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
TextRank
基于PageRank的TextRank方法, 可以应用于中文关键词、短语、摘要提取程序,代码使用Scala编写。
stringdistance
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
pytorch-beginner
pytorch tutorial for beginners
ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
CodingInterviews2-ByPython
此项目是《剑指offer》第二版里算法面试题的Python3实现版本,作为一本经典书籍,可以时常拿出来看一看、翻一翻、记一记。同时也是为了Python程序员能够更好的通过公司的技术面试,拿到心仪的offer。
code-of-learn-deep-learning-with-pytorch
This is code of book "Learn Deep Learning with PyTorch"
Machine-Learning-Algorithms
Python实现经典分类回归、关联分析、聚类以及推荐算法等
FPgrowth
FP-growth codes in "Machine Learning in Action"
TraClusAlgorithm
This is an implementation for TraClus algorithm in Java. A GUI was added.
scala-tfidf
keywords extraction
traminer
A Java library for preprocessing, managing and mining spatial trajectory data