Yutaro's starred repositories
semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
mountpoint-s3
A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.
Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
setup-python
Set up your GitHub Actions workflow with a specific version of Python
data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
streamsync
No-code in the front, Python in the back. An open-source framework for creating data apps.
japanese-addresses
全国の町丁目レベル(277,191件)の住所データのオープンデータ
Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.
matplotlib-venn
Area-weighted venn-diagrams for Python/matplotlib
wasminspect
An interactive debugger for WebAssembly
datavault4dbt
Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.
QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthetic data to finetune downstream QA models leading to improved accuracy in comparison to English-only and translation-based baselines.
instruction_ja
Japanese instruction data (日本語指示データ)