JFanZhao's repositories

spider

使用java+httpclient+httpcleaner,多线程、分布式爬去电商网站商品信息,数据存储在hbase上,并使用solr对商品建立索引,使用redis队列存储一个共享的url仓库;使用zookeeper对爬虫节点生命周期进行监视等。

feature_extraction

文本特征提取算法,卡方校验(chi-square)和信息增益算法提取文本特征算法实现

technology-talk

汇总java生态圈常用技术框架、开源中间件,系统架构、项目管理、经典架构案例、数据库、常用三方库、线上运维等知识

Stargazers:2Issues:0Issues:0

alchemy

给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

apollo_demo

Java 调用携程 Apollo 配置中心 Demo

Language:JavaStargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

chinese-name-score

httpcn.com网站的姓名测试打分项目,姓名五格三才剖析、八字五行分析、五格数理姓名测试打分、姓名八字测试打分 等

Language:PythonStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:0Issues:0

sylph-ivan

Stream computing platform for bigdata

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DicSentimentAnalysis

基于词典的文本情感分析并且有用户界面“小白”

Language:JavaStargazers:0Issues:0Issues:0

flink

Apache Flink

License:Apache-2.0Stargazers:0Issues:0Issues:0

flink-streaming-platform-web

基于flink-sql的实时流计算web平台

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

License:Apache-2.0Stargazers:0Issues:0Issues:0

JavaEE-Framework-Sample

Wrote Some Code Sample for Java EE (Java web)

Language:JavaStargazers:0Issues:0Issues:0

learning-spark

Example code from Learning Spark book

Language:JavaLicense:MITStargazers:0Issues:0Issues:0

myblog

有深度的Java技术博客

Stargazers:0Issues:0Issues:0

notes-python

中文 Python 笔记

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

pddSpider

拼多多爬虫,爬取所有商品、评论等信息

Stargazers:0Issues:0Issues:0

PersonalShare

Personal Stuff Share With Others

Stargazers:0Issues:0Issues:0

Pinduoduo

拼多多商品信息爬虫

Stargazers:0Issues:0Issues:0

pinduoduo-ivan

pdd 爬虫 js 解密 anti_content 参数解密及全站抓取代码思路实现

Stargazers:0Issues:0Issues:0

pydata-book

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

QQSpider

QQ空间爬虫(日志、说说、个人信息)

Language:PythonStargazers:0Issues:0Issues:0

scala

The Scala programming language

Language:ScalaStargazers:0Issues:0Issues:0

seasonal

Robustly estimate trend and periodicity in a timeseries.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

simhash-1

中文文档simhash值计算

Language:C++Stargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spark-programming-guide-zh-cn

Spark 编程指南简体中文版

License:NOASSERTIONStargazers:0Issues:0Issues:0

UnbalancedDataset

Python module to perform under sampling and over sampling with various techniques.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

xhamster_analysis

The data analysiser and predictor of https://xhamster.com/

Language:Jupyter NotebookStargazers:0Issues:0Issues:0