lenamax2355's starred repositories

cockroach

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

Language:GoLicense:NOASSERTIONStargazers:29905Issues:694Issues:65467

presto

The official home of the Presto distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:15926Issues:859Issues:6580

sqlmodel

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Language:PythonLicense:MITStargazers:13793Issues:148Issues:333

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:5335Issues:1166Issues:3214

sqlpad

Web-based SQL editor. Legacy project in maintenance mode.

Language:JavaScriptLicense:MITStargazers:5036Issues:157Issues:493

UK_Biobank_GWAS

Overview of the data QC, code, and GWAS summary output from the 2017 UK Biobank data release

trino-python-client

Python client for Trino

Language:PythonLicense:Apache-2.0Stargazers:323Issues:14Issues:177

trino-the-definitive-guide

Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)

License:Apache-2.0Stargazers:204Issues:11Issues:0

OCT-Converter

Tools for extracting the raw optical coherence tomography (OCT) and fundus data from proprietary file formats.

Language:PythonLicense:MITStargazers:195Issues:18Issues:75

Frank-Kanes-Taming-Big-Data-with-Apache-Spark-and-Python

Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt

Language:PythonLicense:MITStargazers:118Issues:16Issues:1

tofu

Tofu is a Python tool for generating synthetic UK Biobank data.

ukbiobank-resources

A curated list for preprocessing, cleaning, mapping and analyzing UK Biobank data.

ukbrest

ukbREST: efficient and streamlined data access for reproducible research of large biobanks

Language:PythonLicense:MITStargazers:38Issues:6Issues:6

ukbb_parser

A Python module for loading phenotypic and genetic data from the UK Biobank.

Language:Jupyter NotebookStargazers:33Issues:1Issues:4

ehr-rwe

Weak supervision methods for extracting real world evidence from EHRs

Language:PythonLicense:Apache-2.0Stargazers:32Issues:7Issues:1

BiobankRead-Bash

Python scripts to extract and pre-process UKB data

Language:PythonLicense:GPL-3.0Stargazers:30Issues:3Issues:12

omop-etl

ETL Tool for converting datasets to OMOP CDM

Language:PythonLicense:GPL-3.0Stargazers:28Issues:2Issues:1

phemap

Functions to map between ICD-10 terms and PheCodes for UK Biobank hospital electronic health records

Language:PythonLicense:Apache-2.0Stargazers:28Issues:1Issues:1

ukbschemas

Use R to generate a database containing the UK Biobank data schemas from http://biobank.ctsu.ox.ac.uk/crystal/schema.cgi

Language:RLicense:NOASSERTIONStargazers:20Issues:5Issues:21

BiobankRead

Python for UK Biobank data analysis. Author: saphir746

Language:PythonLicense:Apache-2.0Stargazers:14Issues:2Issues:0

Genomic-Data-Science-Specialization

Genomic Data Science Specialization helps in learning genomic data science including Python, R, Bioconductor, and Galaxy. This is offered in coursera from John Hopkins University

ETL-Synthea-Python

ETL from Synthea to OMOP format using python pandas

Language:PythonLicense:Apache-2.0Stargazers:8Issues:4Issues:0

datalad-ukbiobank

Resources for working with UKBiobank as a DataLad dataset

Language:PythonLicense:MITStargazers:6Issues:6Issues:54

omero.biobank

Biobank: large scale biomedical computation on OMERO

Language:PythonLicense:GPL-2.0Stargazers:5Issues:11Issues:0
Language:Jupyter NotebookStargazers:5Issues:3Issues:0

ohdsi-etl-abucasis

Transformation scripts for Inclivia's Abucasis EHR data | BigData@Heart consortium

Time-Series-Prediction

Solve time series and forecasting problems in TensorFlow, Prepare data for time series learning using best practices, Explore how RNNs and ConvNets can be used for predictions, Build a sunspot prediction model using real-world data

Language:Jupyter NotebookStargazers:4Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0

nextstrain-hcov-19

Genomic Epidemiology of Novel Coronavirus (COVID-19) | Nextstrain

Language:ShellStargazers:1Issues:8Issues:0