Khoi Nguyen Anh (KoiDev13)

KoiDev13

Geek Repo

Company:@UnifiedPost

Location:Viet Nam

Home Page:www.linkedin.com/in/knguyenanh8194

Github PK Tool:Github PK Tool

Khoi Nguyen Anh's starred repositories

Prompt-Engineering-Guide

πŸ™ Guides, papers, lecture, notebooks and resources for prompt engineering

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:36149Issues:756Issues:9563

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:24434Issues:451Issues:125

duckdb

DuckDB is an analytical in-process SQL database management system

daily

daily.dev is a professional network for developers to learn, collaborate, and grow together πŸ‘©πŸ½β€πŸ’» πŸ‘¨β€πŸ’»

google-api-python-client

🐍 The official Python client library for Google's discovery based APIs.

Language:PythonLicense:Apache-2.0Stargazers:7644Issues:285Issues:1073

marp

The entrance repository of Markdown presentation ecosystem

Language:TypeScriptLicense:MITStargazers:7578Issues:66Issues:0

cloudquery

The open source high performance ELT framework powered by Apache Arrow

Language:GoLicense:MPL-2.0Stargazers:5773Issues:61Issues:2197

ByConity

ByConity is an open source cloud data warehouse

Language:C++License:Apache-2.0Stargazers:2182Issues:58Issues:575
Language:TeXStargazers:1363Issues:37Issues:0

data-engineering-wiki

The best place to learn data engineering. Built and maintained by the data engineering community.

Language:CSSLicense:CC0-1.0Stargazers:1300Issues:27Issues:27

data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

google-data-analytics

google data analytics professional certificate

Language:Jupyter NotebookStargazers:689Issues:20Issues:1

HashtagCashtag

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - ​Apache Kafka ​for data ingestions, Apache Spark ​& ​Spark Streaming ​for batch & real-time processing, ​Apache Cassandra f​ or storage, ​Flask​, ​Bootstrap and ​HighCharts f​ or frontend.

Language:PythonLicense:Apache-2.0Stargazers:385Issues:50Issues:451

astro-sdk

Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.

Language:PythonLicense:Apache-2.0Stargazers:338Issues:13Issues:829

open-data-contract-standard

Home of the Open Data Contract Standard (ODCS).

Language:ShellLicense:Apache-2.0Stargazers:291Issues:14Issues:6

scala-at-light-speed

The repository for the free Scala at Light Speed mini-course

spark-essentials

The official repository for the Rock the JVM Spark Essentials with Scala course

NOOBS-CMDR

A tool to create macros for OBS

cookiecutter-pypackage

Cookiecutter template for a poetry-managed Python package.

Language:PythonLicense:BSD-3-ClauseStargazers:71Issues:2Issues:17

hack-your-pipe

Efficient streaming data ingestion, transformation & activation

Language:PythonLicense:Apache-2.0Stargazers:27Issues:2Issues:0

C

C Programming

Language:C++Stargazers:22Issues:0Issues:0

TodoListBlazorWasm

This source code for TEDU Blazor for Beginners course

Language:C#Stargazers:18Issues:2Issues:0

Big-Data-Installation

The Complete Big Data Installation Solutions

Platformus-Sample-Ecommerce

Platformus CMS sample ecommerce

Language:TSQLLicense:NOASSERTIONStargazers:8Issues:3Issues:1

gcs-blob-store

Blob store for Google Cloud Storage

Language:TypeScriptLicense:MITStargazers:2Issues:2Issues:0

ApacheSparkDataKickstart

Notebooks and artifacts to support free YouTube course Apache Spark DataKickstart.

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0