witnesslq's starred repositories

hertzbeat

A real-time monitoring system with agentless, performance cluster, prometheus-compatible, custom monitoring and status page building capabilities.

Language:JavaLicense:Apache-2.0Stargazers:4468Issues:54Issues:538

bigdata_analyse

大数据分析项目

Language:PythonLicense:MITStargazers:3586Issues:50Issues:8

pimcore

Core Framework for the Open Source Data & Experience Management Platform (PIM, MDM, CDP, DAM, DXP/CMS & Digital Commerce)

Language:PHPLicense:NOASSERTIONStargazers:3224Issues:180Issues:6617

cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2691Issues:68Issues:141

C-OCR

C-OCR是携程自研的OCR项目,主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。 项目包含4个部分,拒识、检测、识别、后处理。

amoro

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Language:JavaLicense:Apache-2.0Stargazers:664Issues:29Issues:1181

indexr

An open-source columnar data format designed for fast & realtime analytic with big data.

Language:JavaLicense:Apache-2.0Stargazers:451Issues:65Issues:35

samoa

SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.

Language:JavaLicense:Apache-2.0Stargazers:427Issues:61Issues:0

COVID19-data-analysis

该系列资源是Python疫情大数据分析,涉及网络爬虫、可视化分析、GIS地图、情感分析、舆情分析、主题挖掘、威胁情报溯源、知识图谱、预测预警及AI和NLP应用等。推荐大家结合作者CSDN博客阅读,武汉必胜、湖北必胜、**必胜!

DzFilter

【停止维护】使用DFA算法实现的内容安全,反垃圾,智能鉴黄,敏感词过滤,不良信息检测,文本校验,敏感词检测,包括关键词提取,过滤html标签等。

CanalX

基于 `Canal` 的数据感知服务框架. 可用于围绕数据库`Mysql`进行数据相关的各式业务开发, 并建立各式各样的服务平台.

weekly

周报系统的技术栈主要是node+vue+redis+mysql+es6,一个企业管理系统,企业员工汇报每周工作情况,以及完成情况,各级负责人可以查看和提醒相应未写周报人员,后端完全使用nodeJS,数据库使用mysql,基于nodejs的thinkjs框架搭建的,如果喜欢nodeJS写的后端,支持大前端,支持全栈开发。地址:http://weekly.mwcxs.top

Language:VueLicense:MITStargazers:188Issues:8Issues:9

fili

Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.

Language:JavaLicense:Apache-2.0Stargazers:172Issues:38Issues:485

wakatime-sync

wakatime 数据同步展示工具

BigdataAi

介绍Liao Wenzhe 的一些主要代表作品,包括AIOPS,异常检测,根因分析,告警降噪,关联分析,数据安全,数据挖掘,机器学习,深度学习,文本匹配,公开演讲,思维方式,学习方法,读书阅读等。欢迎star。 Introduce some of Liao Wenzhe's main representative works, including AIOPS, anomaly detection, root cause analysis, alarm noise reduction, correlation analysis, data security, data mining, machine learning, deep learning, text matching, public speech, way of thinking, learning methods, reading, reading, etc. Welcome star.

Database-SQL-Actual-Combat

牛客网数据库SQL实战题目汇总

License:MITStargazers:49Issues:2Issues:0

superBI

SuperBI 是达闼科技以开源项目superset为基础开发的企业级快速BI应用。 可扩展的框架设计,支持多种DBMS数据源,让数据BI更加简单。 superbi提供直观的UI,拖拽式的编辑体验,配置式的图例创建,轻松创建数据可视化dashboard的能力。

Language:PythonLicense:Apache-2.0Stargazers:44Issues:6Issues:1

bindbg

Bringing Dynamic Analysis to Java

Language:JavaLicense:GPL-3.0Stargazers:32Issues:4Issues:2

ev-gb-gateway

基于GB32960实现TSP数据接入网关

Language:JavaStargazers:25Issues:1Issues:0

log

集群异常告警根因分析

Language:PythonStargazers:20Issues:4Issues:0

ohmydata

数据服务 —— 写个 SQL 即可发布成 API

palo-deploy-k8s

使用K8S部署Apache Doris (incubating)(原百度palo)

pdf_annotate

在线编辑pdf文档

Language:JavaStargazers:11Issues:0Issues:0

DocxToPDFWithWatermark

批量处理小工具:①批量将word文档转换为pdf②给pdf文档批量添加水印

Language:PythonStargazers:9Issues:1Issues:0

bigData

北京大数据治理服务项目

Language:VueStargazers:4Issues:0Issues:0
Language:ShellStargazers:4Issues:0Issues:0

link-visitors-engine

link-visitors-engine是一个获客引擎,能够获取特定地区下客户的手机号码、区域、邮政编码、以及QQ信息等。专业网络营销助手。

Language:JavaLicense:Apache-2.0Stargazers:3Issues:0Issues:0

DataTagger

Lightweight & x-platform dataset markup tool

Language:JavaStargazers:2Issues:0Issues:0

watermark

基于Java代码给pdf文件添加水印

Language:JavaStargazers:2Issues:0Issues:0

pdfWater

给pdf文件批量添加指定的水印。

Language:JavaStargazers:1Issues:0Issues:0