Andrew Shinohara (ashinohara)

ashinohara

Geek Repo

Company:@invitae

Location:San Francisco, CA

Github PK Tool:Github PK Tool

Andrew Shinohara's starred repositories

openai-cookbook

Examples and guides for using the OpenAI API

datahub

The Metadata Platform for your Data Stack

Language:JavaLicense:Apache-2.0Stargazers:9607Issues:254Issues:2134

PySyft

Perform data science on data that remains in someone else's server

Language:PythonLicense:Apache-2.0Stargazers:9417Issues:199Issues:3392

pdfGPT

PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!

Language:PythonLicense:MITStargazers:6901Issues:51Issues:97

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:3236Issues:81Issues:333

laravel-migrations-generator

Laravel Migrations Generator: Automatically generate your migrations from an existing database schema.

Language:PHPLicense:MITStargazers:2421Issues:44Issues:100

SDV

Synthetic data generation for tabular data

Language:PythonLicense:NOASSERTIONStargazers:2277Issues:44Issues:1278

awesome-he

✨ Awesome - A curated list of amazing Homomorphic Encryption libraries, software and resources

moode

moOde sources and configs

Language:PHPLicense:GPL-3.0Stargazers:975Issues:33Issues:292

bayeslite

BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.

Language:PythonLicense:Apache-2.0Stargazers:919Issues:59Issues:495
Language:GoLicense:Apache-2.0Stargazers:662Issues:20Issues:13

camilladsp

A flexible cross-platform IIR and FIR engine for crossovers, room correction etc.

Language:RustLicense:GPL-3.0Stargazers:527Issues:24Issues:183

datacompy

Pandas, Polars, and Spark DataFrame comparison for humans and more!

Language:PythonLicense:Apache-2.0Stargazers:458Issues:22Issues:132
Language:PythonLicense:Apache-2.0Stargazers:313Issues:26Issues:46

terraform-controller

Use K8s to Run Terraform

Language:GoLicense:Apache-2.0Stargazers:292Issues:21Issues:36

glow

An open-source toolkit for large-scale genomic analysis

Language:ScalaLicense:Apache-2.0Stargazers:263Issues:19Issues:159

healthcare-data-harmonization

This is an engine that converts data of one structure to another, based on a configuration file which describes how. There is an accompanying syntax to make writing mappings easier and more robust.

Language:JavaLicense:Apache-2.0Stargazers:201Issues:39Issues:31

fbpcf

Private computation framework library allows developers to perform randomized controlled trials, without leaking information about who participated or what action an individual took. It uses secure multiparty computation to guarantee this privacy. It is suitable for conducting A/B testing, or measuring advertising lift and learning the aggregate statistics without sharing information on the individual level.

Language:C++License:MITStargazers:139Issues:28Issues:1

snowflake-kafka-connector

Snowflake Kafka Connector (Sink Connector)

Language:JavaLicense:Apache-2.0Stargazers:135Issues:15Issues:221

bunsen

Explore, transform, and analyze FHIR data with Apache Spark

Language:JavaLicense:Apache-2.0Stargazers:114Issues:24Issues:60

terraform-provider-kafka-connect

Terraform provider for managing Apache Kafka Connect

Language:GoLicense:MITStargazers:107Issues:6Issues:33

talk-kafka-zipkin

Demo material from talk about tracing Kafka-based applications with Zipkin

Language:JavaLicense:MITStargazers:73Issues:5Issues:1

MegaSparkDiff

A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations of possible data sources. Multiple execution modes in multiple environments enable the user to generate a diff report as a Java/Scala-friendly DataFrame or as a file for future use. Comes with out of the box SparkFactory and SparkCompare tools.

Language:ScalaLicense:Apache-2.0Stargazers:49Issues:12Issues:23

lforms-fhir-app

A SMART on FHIR app that uses lforms widget to handle Questionnaire and QuestionnaireResponse

Language:JavaScriptLicense:NOASSERTIONStargazers:44Issues:8Issues:25

kafka-connectors-tests

Test suite for Kafka Connect connectors based on Landoop's Coyote and docker.

Language:DockerfileLicense:NOASSERTIONStargazers:32Issues:32Issues:1

esque

esque - an operational kafka tool.

Language:PythonLicense:MITStargazers:24Issues:8Issues:76

kafka-connect-rabbitmq

Fork of the RabbitMQ Source/Sink for Kafka Connect

Language:JavaLicense:Apache-2.0Stargazers:22Issues:4Issues:5

omoponfhir-main-r4-sql

OMOP v53 on FHIR R4

Language:BatchfileLicense:Apache-2.0Stargazers:18Issues:7Issues:10

kafka-interceptors

Set of interceptors to integrate to your Kafka Clients.

Language:JavaLicense:MITStargazers:9Issues:4Issues:5