JFanZhao's repositories
feature_extraction
文本特征提取算法,卡方校验(chi-square)和信息增益算法提取文本特征算法实现
technology-talk
汇总java生态圈常用技术框架、开源中间件,系统架构、项目管理、经典架构案例、数据库、常用三方库、线上运维等知识
alchemy
给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群
apollo_demo
Java 调用携程 Apollo 配置中心 Demo
chinese-name-score
httpcn.com网站的姓名测试打分项目,姓名五格三才剖析、八字五行分析、五格数理姓名测试打分、姓名八字测试打分 等
sylph-ivan
Stream computing platform for bigdata
DicSentimentAnalysis
基于词典的文本情感分析并且有用户界面“小白”
flink
Apache Flink
flink-streaming-platform-web
基于flink-sql的实时流计算web平台
hudi
Upserts, Deletes And Incremental Processing on Big Data.
JavaEE-Framework-Sample
Wrote Some Code Sample for Java EE (Java web)
learning-spark
Example code from Learning Spark book
myblog
有深度的Java技术博客
notes-python
中文 Python 笔记
pddSpider
拼多多爬虫,爬取所有商品、评论等信息
PersonalShare
Personal Stuff Share With Others
Pinduoduo
拼多多商品信息爬虫
pinduoduo-ivan
pdd 爬虫 js 解密 anti_content 参数解密及全站抓取代码思路实现
pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
QQSpider
QQ空间爬虫(日志、说说、个人信息)
scala
The Scala programming language
seasonal
Robustly estimate trend and periodicity in a timeseries.
simhash-1
中文文档simhash值计算
spark
Mirror of Apache Spark
spark-programming-guide-zh-cn
Spark 编程指南简体中文版
UnbalancedDataset
Python module to perform under sampling and over sampling with various techniques.
xhamster_analysis
The data analysiser and predictor of https://xhamster.com/