0xqq's repositories

bigdata-sql-parser

基于antlr4 解析器,支持spark sql, tidb sql, flink sql, Spark/flink jar 运行命令解析器

Language:JavaStargazers:30Issues:3Issues:0

flink-sql-lineage

FlinkSQL字段血缘解决方案及源码。FlinkSQL field lineage solution and source code, The core idea is to parse SQL through Calcite to generate a RelNode tree of relational expressions. Then get the optimized logical paln through optimization stage, and finally call Calcite RelMetadataQuery to get the lineage relationship at the field level.

Language:JavaStargazers:1Issues:0Issues:0

Adlik

Adlik: Toolkit for Accelerating Deep Learning Inference

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

AI_Tutorial

精华机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理

Stargazers:0Issues:0Issues:0

algorithm-1

常用的图算法 JS 实现,提供给 G6 及 Graphin 用于图分析场景使用。

Language:TypeScriptStargazers:0Issues:1Issues:0

BigDataAudit

The security vulns detector for Hadoop and Spark(大数据安全检测工具)

Language:PythonStargazers:0Issues:2Issues:0

chineseaddressanalyzer

本项目是基于Word分词插件实现的中文地址解析功能, 可解析出地址的省市区、行政区划代码和详细地址。地址是前置模糊匹配

Language:JavaStargazers:0Issues:0Issues:0

data-integration

基于kettle实现的web版数据集成平台,致力于提供web可拖拽的数据集成平台。

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

eagle

Real time data processing system based on flink and CEP

Language:JavaStargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flink-http-connector

Flink Http Connector

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flink-sql-cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.

License:Apache-2.0Stargazers:0Issues:0Issues:0

flink-table-store-102

Playground for Flink Table Store with use cases and performance features

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GRU-CRF

本项目将演示如何从用户的快递单中,提取姓名、电话、省、市、区、详细地址等内容,形成结构化信息。辅助物流行 业从业者进行有效的信息提取,简化客户填写表单的流程。本项目采用了Bi-GRU+CRF网络模型来进行序列化标注,使用Bi-GRU 来解决长期记忆和反向传播中梯度问题,能够有效对长序列建模,但是无法解决标签之间的依赖性,于是将Bi-GRU标注的结果喂给 CRF得到新的序列标注。

Language:PythonStargazers:0Issues:0Issues:0

iamQA

中文wiki百科QA问答系统,使用了CCKS2016数据的NER模型和CMRC2018的阅读理解模型,还有W2V词向量搜索,使用torchserve部署

Language:PythonStargazers:0Issues:0Issues:0

incubator-teaclave

Apache Teaclave (incubating) is an open source universal secure computing platform, making computation on privacy-sensitive data safe and simple.

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LogiKM

一站式Apache Kafka集群指标监控与运维管控平台

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MathModel-Pretrain

研究生数学建模,华为杯数学建模,2021D题,乳腺癌,机器学习,数据分析

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

mystars

很棒的列表,主要是机器学习、深度学习、NLP、GNN、推荐系统、生物医药、机器视觉等内容。持续更新!欢迎star!欢迎star!😀😀😀

License:NOASSERTIONStargazers:0Issues:0Issues:0

o2k

oracle to kafka cdc tools, Synchronize Oracle online redo log to kafka or other big data platforms in realtime

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Pcap-Analyzer

Python编写的可视化的离线数据包分析器

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

PersonGraphDataSet

PersonGraphDataSet, nearly 10 thousand person2person relationship facts。 人物图谱数据集,近十万的人物关系图谱事实数据库,通过人物关系抽取算法抽取+人工整理得出,可用于人物关系搜索、查询、人物关系多跳问答,以及人物关系推理等场景提供基础数据。

Stargazers:0Issues:0Issues:0

pulsar-flink

Elastic data processing with Apache Pulsar and Apache Flink

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

questdb

An open source SQL database designed to process time series data, faster

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

secretflow

A unified framework for privacy-preserving data analysis and machine learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

streamx

Make Flink|Spark easier!!!

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

txtai

Build AI-powered semantic search applications

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

unif

仿 Scikit-Learn 设计的深度学习自然语言处理框架, 支持 40+ 种模型类, 涵盖语言模型、文本分类、NER、MRC、机器翻译等各个领域

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0