dawsongzhao's repositories
Anti-Anti-Spider
越来越多的网站具有反爬虫特性,有的用图片隐藏关键数据,有的使用反人类的验证码,建立反反爬虫的代码仓库,通过与不同特性的网站做斗争(无恶意)提高技术。(欢迎提交难以采集的网站)(因工作原因去TX写验证码了,项目暂停)
awesome-design-cn
设计师资源大全,包含:ICON图标、Logo设计、PhotoShop插件、交互设计工具、流程图、线框图/原型图、设计博客等
big-data-plugin
Kettle plugin that provides support for interacting within many "big data" projects including Hadoop, Hive, HBase, Cassandra, MongoDB, and others.
cdk
Cloudera Development Kit
cm_api
Cloudera Manager API Client
cm_ext
Cloudera Manager Extensibility Tools and Documentation.
etl-light
A light Kafka to HDFS/S3 ETL library based on Apache Spark
grappa
Grappa: scaling irregular applications on commodity clusters
hadoop-pcap
Hadoop library to read packet capture (PCAP) files
Hydrograph
A visual ETL development and debugging tool for big data
jackson-databind
General data-binding package for Jackson (2.x): works on streaming API (core) implementation(s)
java-user-agent-detection
Some code to deduce an OS/Platform/Browser out of a user-agent string
JsonPath
Java JsonPath implementation
kite
Kite SDK
kylin
Mirror of Apache Kylin
loppo
an extremely easy static site generator of markdown documents
pentaho-kettle
Pentaho Data Integration ( ETL ) a.k.a Kettle
python-goose
Html Content / Article Extractor, web scrapping lib in Python
riv.vim
Take Notes in rst.
scrapple
A framework for creating semi-automatic web content extractors
specs
ODPi specifications
spring-shell
Spring based interactive shell
streamingpro
Build Spark Streaming Application by SQL
superset
Superset is a data exploration platform designed to be visual, intuitive, and interactive
thingsboard
Open-source IoT Platform - Device management, data collection, processing and visualization.
thingsboard-gateway
Open-source IoT Gateway - integrates devices connected to legacy and third-party systems with Thingsboard IoT Platform using OPC-UA and MQTT protocols
udger-java
Java agent string parser based on Udger https://udger.com/products/local_parser
vim-markdown
Markdown Vim Mode
WhereHows
Data Discovery and Lineage for Big Data Ecosystem