youyoude's starred repositories

chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

Language:JavaScriptLicense:MITStargazers:47645Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165623Issues:0Issues:0

petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

Language:PythonLicense:Apache-2.0Stargazers:1773Issues:0Issues:0

gridstudio

Grid studio is a web-based application for data science with full integration of open source data science frameworks and languages.

Language:JavaScriptLicense:AGPL-3.0Stargazers:8873Issues:0Issues:0

h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6839Issues:0Issues:0

flink-boot

懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。

Language:JavaLicense:BSD-3-ClauseStargazers:802Issues:0Issues:0

LarkMidTable

LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。

Language:JavaLicense:Apache-2.0Stargazers:1754Issues:0Issues:0

flink-streaming-platform-web

基于flink的实时流计算web平台

Language:JavaLicense:MITStargazers:1785Issues:0Issues:0

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7348Issues:0Issues:0

APIJSON

🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users

Language:JavaLicense:NOASSERTIONStargazers:17008Issues:0Issues:0

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stargazers:9554Issues:0Issues:0
Language:JavaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

datax-web

DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。

Language:JavaLicense:MITStargazers:5496Issues:0Issues:0

data-lineage

Generate and Visualize Data Lineage from query history

Language:PythonLicense:MITStargazers:306Issues:0Issues:0

PandasGUI

A GUI for Pandas DataFrames

Language:PythonLicense:MIT-0Stargazers:3161Issues:0Issues:0

eladmin

eladmin jpa 版本:项目基于 Spring Boot 2.6.4、 Jpa、 Spring Security、Redis、Vue的前后端分离的后台管理系统,项目采用分模块开发方式, 权限控制采用 RBAC,支持数据字典与数据权限管理,支持一键生成前后端代码,支持动态路由

Language:JavaLicense:Apache-2.0Stargazers:21115Issues:0Issues:0

ServerManagement

服务器管理工具,目前有文件管理器、进程监控、计划任务、webSSH、多主机管理等,准备在自己服务器上用,后续会加入更多运维相关,本项目后端python+flask,前端使用layui+jquery,代码在线编辑使用codemirror,webSSH后端使用paramiko前端xterm

Language:PythonStargazers:600Issues:0Issues:0

joyful-pandas

pandas中文教程

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4522Issues:0Issues:0

pravega

Pravega - Streaming as a new software defined storage primitive

Language:JavaLicense:Apache-2.0Stargazers:1980Issues:0Issues:0

mongo-connector

MongoDB data stream pipeline tools by YouGov (adopted from MongoDB)

Language:PythonLicense:Apache-2.0Stargazers:1877Issues:0Issues:0

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Language:JavaLicense:Apache-2.0Stargazers:10296Issues:0Issues:0

spring-boot-examples

about learning Spring Boot via examples. Spring Boot 教程、技术栈示例代码,快速简单上手教程。

Language:JavaStargazers:30122Issues:0Issues:0

superman

supervisor多机器管理工具,supervisor, superman, flask

Language:CSSStargazers:4Issues:0Issues:0

weld

High-performance runtime for data analytics applications

Language:RustLicense:BSD-3-ClauseStargazers:2989Issues:0Issues:0

datahub

The Metadata Platform for your Data Stack

Language:JavaLicense:Apache-2.0Stargazers:9544Issues:0Issues:0

flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Language:JavaLicense:Apache-2.0Stargazers:14430Issues:0Issues:0

996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

License:NOASSERTIONStargazers:269589Issues:0Issues:0

data-warehouse

The book of data warehouse

Stargazers:196Issues:0Issues:0

hadoop_study

定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)

Language:JavaStargazers:916Issues:0Issues:0

DataLink

DataLink是一个满足各种异构数据源之间的实时增量同步、离线全量同步,分布式、可扩展的数据交换平台。

Language:JavaLicense:Apache-2.0Stargazers:1070Issues:0Issues:0