Keiji Yoshida's starred repositories

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:184140Issues:7617Issues:39499

keras

Deep Learning for humans

Language:PythonLicense:Apache-2.0Stargazers:61336Issues:1914Issues:11998

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:58934Issues:2142Issues:11013

fastai

The fastai deep learning library

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:25902Issues:609Issues:1792

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++License:Apache-2.0Stargazers:25820Issues:912Issues:5221

learning-spark

Example code from Learning Spark book

Language:JavaLicense:MITStargazers:3878Issues:396Issues:27

gluon-api

A clear, concise, simple yet powerful and efficient API for deep learning.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2300Issues:149Issues:14

LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Language:ScalaLicense:Apache-2.0Stargazers:1139Issues:40Issues:18

RedisAI

A Redis module for serving tensors and executing deep learning graphs

Language:CLicense:NOASSERTIONStargazers:814Issues:29Issues:319
Language:PythonLicense:Apache-2.0Stargazers:765Issues:26Issues:103

tech-talks

This repository contains the notebooks and presentations we use for our Databricks Tech Talks

Language:GoLicense:Apache-2.0Stargazers:658Issues:20Issues:13

databricks-cli

(Legacy) Command Line Interface for Databricks

Language:PythonLicense:NOASSERTIONStargazers:382Issues:38Issues:237

spark-knowledgebase

Spark Knowledge Base

dotaconstants

Constant data for Dota applications

Language:JavaScriptLicense:MITStargazers:285Issues:21Issues:43

glow

An open-source toolkit for large-scale genomic analysis

Language:ScalaLicense:Apache-2.0Stargazers:262Issues:19Issues:157

mlflow-example

An example MLflow project

Language:PythonLicense:Apache-2.0Stargazers:229Issues:22Issues:9

healthcare-data-harmonization

This is an engine that converts data of one structure to another, based on a configuration file which describes how. There is an accompanying syntax to make writing mappings easier and more robust.

Language:JavaLicense:Apache-2.0Stargazers:198Issues:39Issues:27
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:172Issues:25Issues:4

bigquery-data-lineage

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Language:JavaLicense:Apache-2.0Stargazers:141Issues:19Issues:17

datashare-toolkit

DIY commercial datasets on Google Cloud Platform

Language:JavaScriptLicense:Apache-2.0Stargazers:87Issues:41Issues:289

oozie-to-airflow

Oozie Workflow to Airflow DAGs migration tool

Language:PythonLicense:Apache-2.0Stargazers:83Issues:28Issues:184

datacatalog-connectors-rdbms

Sample code with integration between Data Catalog and RDBMS data sources.

Language:PythonLicense:Apache-2.0Stargazers:72Issues:8Issues:23

hydrator-plugins

Cask Hydrator Plugins Repository

Language:JavaLicense:NOASSERTIONStargazers:64Issues:59Issues:0

redis-dataflow-realtime-analytics

Build a real-time website analytics dashboard on GCP using Dataflow, Cloud Memorystore (Redis) and Spring Boot

Language:JavaLicense:Apache-2.0Stargazers:26Issues:15Issues:0

add-field

Plugin that adds a new field with a configurable value or UUID

Language:JavaStargazers:3Issues:6Issues:0

kafka-plugins

Kafka Source/Sink for reading/writing to kafka topic

WoWAH-parser

Reads the WoWAH files into one big csv.

Language:PythonStargazers:2Issues:1Issues:0