Chih-Yu Yeh's repositories
tensorflow-tutorials
Coursera TensorFlow Specializations
codebase-understanding
Understand your python codebase effortlessly using natural language
data-systems-playground
Here is an experimentation playground to help me explore various tools related to data systems.
geospatial-datascience
Geospatial Data Science Learning Resources
awesome-db-tools
Everything that makes working with databases easier
awesome-duckdb
🦆 A curated list of awesome DuckDB resources
clickhouse-docs
Official documentation for the ClickHouse database management system
data-engineering-zoomcamp
Free Data Engineering course!
demo-kg-build
demo app for Knowledge Graph Build with LLM LlamaIndex and NebulaGraph
dspy
Stanford DSPy: The framework for programming—not prompting—foundation models
duckdb
DuckDB is an in-process SQL OLAP Database Management System
effective-typescript
Effective TypeScript: 62 Specific Ways to Improve Your TypeScript
FinMind
Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/
how-query-engines-work
This is the companion repository for the book How Query Engines Work.
jaffle_shop
A self-contained dbt project for testing purposes
LLM-Text-to-SQL-Architectures
A collection of architectural patterns leveraging Large Language Models (LLMs) for efficient Text-to-SQL generation.
llms-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
smol-course
A course on aligning smol models.
spark-playground
A repository for building local spark cluster.
vulcan-sql
Open-source Analytical Data API Framework for data apps. It turns SQL queries into RESTful APIs in no time!