0xqq's repositories
bigdata-sql-parser
基于antlr4 解析器,支持spark sql, tidb sql, flink sql, Spark/flink jar 运行命令解析器
flink-sql-lineage
FlinkSQL字段血缘解决方案及源码。FlinkSQL field lineage solution and source code, The core idea is to parse SQL through Calcite to generate a RelNode tree of relational expressions. Then get the optimized logical paln through optimization stage, and finally call Calcite RelMetadataQuery to get the lineage relationship at the field level.
Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
AI_Tutorial
精华机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理
algorithm-1
常用的图算法 JS 实现,提供给 G6 及 Graphin 用于图分析场景使用。
BigDataAudit
The security vulns detector for Hadoop and Spark(大数据安全检测工具)
chineseaddressanalyzer
本项目是基于Word分词插件实现的中文地址解析功能, 可解析出地址的省市区、行政区划代码和详细地址。地址是前置模糊匹配
data-integration
基于kettle实现的web版数据集成平台,致力于提供web可拖拽的数据集成平台。
eagle
Real time data processing system based on flink and CEP
flink-http-connector
Flink Http Connector
flink-sql-cookbook
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
flink-table-store-102
Playground for Flink Table Store with use cases and performance features
GRU-CRF
本项目将演示如何从用户的快递单中,提取姓名、电话、省、市、区、详细地址等内容,形成结构化信息。辅助物流行 业从业者进行有效的信息提取,简化客户填写表单的流程。本项目采用了Bi-GRU+CRF网络模型来进行序列化标注,使用Bi-GRU 来解决长期记忆和反向传播中梯度问题,能够有效对长序列建模,但是无法解决标签之间的依赖性,于是将Bi-GRU标注的结果喂给 CRF得到新的序列标注。
iamQA
中文wiki百科QA问答系统,使用了CCKS2016数据的NER模型和CMRC2018的阅读理解模型,还有W2V词向量搜索,使用torchserve部署
incubator-teaclave
Apache Teaclave (incubating) is an open source universal secure computing platform, making computation on privacy-sensitive data safe and simple.
LogiKM
一站式Apache Kafka集群指标监控与运维管控平台
MathModel-Pretrain
研究生数学建模,华为杯数学建模,2021D题,乳腺癌,机器学习,数据分析
mystars
很棒的列表,主要是机器学习、深度学习、NLP、GNN、推荐系统、生物医药、机器视觉等内容。持续更新!欢迎star!欢迎star!😀😀😀
o2k
oracle to kafka cdc tools, Synchronize Oracle online redo log to kafka or other big data platforms in realtime
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Pcap-Analyzer
Python编写的可视化的离线数据包分析器
PersonGraphDataSet
PersonGraphDataSet, nearly 10 thousand person2person relationship facts。 人物图谱数据集,近十万的人物关系图谱事实数据库,通过人物关系抽取算法抽取+人工整理得出,可用于人物关系搜索、查询、人物关系多跳问答,以及人物关系推理等场景提供基础数据。
pulsar-flink
Elastic data processing with Apache Pulsar and Apache Flink
questdb
An open source SQL database designed to process time series data, faster
secretflow
A unified framework for privacy-preserving data analysis and machine learning
streamx
Make Flink|Spark easier!!!
txtai
Build AI-powered semantic search applications
unif
仿 Scikit-Learn 设计的深度学习自然语言处理框架, 支持 40+ 种模型类, 涵盖语言模型、文本分类、NER、MRC、机器翻译等各个领域