Luke Nguyen's repositories

robust-data-analytics-platform-with-duckdb-dbt-iceberg

Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution for valuable insights.

Language:ShellStargazers:23Issues:1Issues:0

modern-data-warehouse-modeling-and-data-quality-with-dbt-openmetadata

This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools

Language:PythonStargazers:20Issues:1Issues:0

open-source-modern-data-stack

This repo demonstrate a comprehensive modern data stack using popular open-source tools.

real-time-analytic-stack

This repo demonstrate a comprehensive real-time analytic stack using popular open-source tools.

private-generative-AI-model-for-data-warehouse

This repository helps to build a private AI in SQL analytics with generative models

Language:PythonStargazers:7Issues:1Issues:0

streaming-analytics-with-risingwave-and-dbt

This repo assists in building streaming analytics platform using RisingWave and dbt, empowering your real-time data insights.

Language:GoStargazers:7Issues:1Issues:0

openmetadata-duckdb-connector

This repository is OpenMetadata's custom DuckDB Connector

Language:PythonLicense:MITStargazers:6Issues:1Issues:0

airbyte-platform

The platform fundament of Airbyte powering all your ELT pipelines. Please file issues in https://github.com/airbytehq/airbyte

Language:JavaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

DB-GPT

Revolutionizing Database Interactions with Private LLM Technology

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

dbt-server

A web API for dbt.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

NSQL

Numbers Station Text to SQL model code.

License:Apache-2.0Stargazers:1Issues:0Issues:0

OpenMetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

License:Apache-2.0Stargazers:1Issues:0Issues:0

risingwave

The distributed streaming database: SQL stream processing with Postgres-like experience 🪄. 10X faster and more cost-efficient than Apache Flink 🚀.

Language:RustLicense:Apache-2.0Stargazers:1Issues:0Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:TypeScriptLicense:MITStargazers:1Issues:0Issues:0

seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Language:JavaLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

airbyte

Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

confluence-docker

The simplest docker file of Confluence. Support v8.7.2(latest) and v8.5.5(lts)

Language:DockerfileStargazers:0Issues:0Issues:0

dbt-athena

The athena adapter plugin for dbt (https://getdbt.com)

License:Apache-2.0Stargazers:0Issues:0Issues:0

dbt-dremio

dbt (data build tool) adapter for the Dremio

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dbt-run-remote-repository

This repo is a python library to run dbt project which store on S3 or Git server

Stargazers:0Issues:0Issues:0

dify

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.

License:NOASSERTIONStargazers:0Issues:0Issues:0

jira

The simplest docker file of JIRA. Support v9.17.0(latest) and v9.12.11(lts)

Stargazers:0Issues:0Issues:0

openmetadata-dremio-connector

Openmetadata connector for Deremio data source

Stargazers:0Issues:0Issues:0

sqlmesh

Efficient data transformation and modeling framework that is backwards compatible with dbt.

License:Apache-2.0Stargazers:0Issues:0Issues:0

weaviate

Weaviate is an open source vector database that stores both objects and vectors, allowing for combining vector search with structured filtering with the fault-tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients.

Language:GoLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0