drhchen's starred repositories

gff3toembl

Converts Prokka GFF3 files to EMBL files for uploading annotated assemblies to EBI

Language:PythonLicense:NOASSERTIONStargazers:29Issues:0Issues:0

ProNA2020

ProNA2020: System predicting protein-DNA, protein-RNA and protein-protein binding sites from sequence

Language:PythonStargazers:14Issues:0Issues:0

pydca

Direct coupling analysis software for protein and RNA sequences

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

Predicting-ICU-Patient-Clinical-Deterioration---Report

For this project, I used publicly available Electronic Health Records (EHRs) datasets. The MIT Media Lab for Computational Physiology has developed MIMIC-IIIv1.4 dataset based on 46,520 patients who stayed in critical care units of the Beth Israel Deaconess Medical Center of Boston between 2001 and 2012. MIMIC-IIIv1.4 dataset is freely available to researchers across the world. A formal request should be made directly to www.mimic.physionet.org, to gain access to the data. There is a required course on human research ‘Data or Specimens Only Research’ prior to data access request. I have secured one here -www.citiprogram.org/verify/?kb6607b78-5821-4de5-8cad-daf929f7fbbf-33486907. We built flexible and better performing model using the same 17 variables used in the SAPS II severity prediction model. The question ‘Can we improve the prediction performance of widely used severity scores using a more flexible model?’ is the central question of our project. I used the exact 17 variables used to develop the SAPS II severity prediction algorithm. These are 13 physiological variables, three underlying (chronic) disease variables and one admission variable. The physiological variables includes demographic (age), vital (Glasgow Comma Scale, systolic blood pressure, Oxygenation, Renal, White blood cells count, serum bicarbonate level, blood sodium level, blood potassium level, and blood bilirubin level). The three underlying disease variables includes Acquired Immunodeficiency Syndrome (AIDS), metastatic cancer, and hematologic malignancy. Finally, whether admission was scheduled surgical or unscheduled surgical was included in the model. The dataset has 26 relational tables including patient’s hospital admission, callout information when patient was ready for discharge, caregiver information, electronic charted events including vital signs and any additional information relevant to patient care, patient demographic data, list of services the patient was admitted or transferred under, ICU stay types, diagnoses types, laboratory measurments, microbiology tests and sensitivity, prescription data and billing information. Although I have full access to the MIMIC-IIIv1.4 datasets, I can not share any part of the data publicly. If you are interested to learn more about the data, there is a MIMIC III Demo dataset based on 100 patients https://mimic.physionet.org/gettingstarted/demo/. If you are interested to requesting access to the data - https://mimic.physionet.org/gettingstarted/access/. Linked repositories: Exploratory-Data-Analysis-Clinical-Deterioration, Data-Wrangling-MIMICIII-Database, Clinical-Deterioration-Prediction-Model--Inferential-Statistics, Clinical-Deterioration-Prediction-Model--Ensemble-Algorithms-, Clinical-Deterioration-Prediction-Model--Logistic-Regression, Clinical-Deterioration-Prediction-Model---KNN © 2020 GitHub, Inc.

Language:Jupyter NotebookStargazers:17Issues:0Issues:0

breast-cancer-gene-expression

This project aims to predict people who will survive breast cancer using machine learning models with the help of clinical data and gene expression profiles of the patients.

Language:Jupyter NotebookStargazers:38Issues:0Issues:0

autoprognosis

A system for automating the design of predictive modeling pipelines tailored for clinical prognosis.

Language:PythonLicense:Apache-2.0Stargazers:97Issues:0Issues:0

BacterialDuplicates

Identification of putative duplicated genes among bacterial genomes

Language:PerlLicense:GPL-3.0Stargazers:4Issues:0Issues:0

asap

A scalable bacterial genome assembly, annotation and analysis pipeline

Language:GroovyLicense:GPL-3.0Stargazers:66Issues:0Issues:0

SCORE

SCORE - a consensus tool for detecting differentially expressed genes in bacteria

Language:RLicense:GPL-3.0Stargazers:5Issues:0Issues:0

phastaf

Identify phage regions in bacterial genomes for masking purposes

Language:PerlLicense:GPL-3.0Stargazers:30Issues:0Issues:0

BIGSdb

Bacterial Isolate Genome Sequence Database (BIGSdb): A platform for gene-by-gene bacterial population annotation and analysis.

Language:PerlLicense:NOASSERTIONStargazers:56Issues:0Issues:0

GenoVi

GenoVi, an automated customizable circular genome visualizer for bacteria and archaea

Language:PythonLicense:NOASSERTIONStargazers:73Issues:0Issues:0

PPanGGOLiN

Build a partitioned pangenome graph from microbial genomes

Language:PythonLicense:NOASSERTIONStargazers:229Issues:0Issues:0

PopPUNK

PopPUNK 👨‍🎤 (POPulation Partitioning Using Nucleotide Kmers)

Language:PythonLicense:Apache-2.0Stargazers:88Issues:0Issues:0

piggy

Pipeline for analysing intergenic regions in bacteria

Language:PerlLicense:GPL-3.0Stargazers:36Issues:0Issues:0

AMRprediction_HomologyModeling_Docking

Using Machine learning framework to predict putative novel AMR genes and molecular docking studies to study the interaction between antibiotics

Language:Jupyter NotebookStargazers:3Issues:0Issues:0

PanPred

A library of Machine Learning tools to predict antibiotic resistance from NGS data

Language:CStargazers:13Issues:0Issues:0

scatterpie

:art: scatter pie plot

Language:RStargazers:57Issues:0Issues:0

autoprokka

A script that automatically invokes prokka on a directory of FASTA genomes and neatly organises the output.

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

autosnippy

A snakemake script to run core snp phylogeny to a batch of fasta files using snippy.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Assembly-Dereplicator

A tool for removing redundant genomes from a set of assemblies

Language:PythonLicense:GPL-3.0Stargazers:64Issues:0Issues:0

pyseer

SEER, reimplemented in python 🐍🔮

Language:PythonLicense:Apache-2.0Stargazers:106Issues:0Issues:0

nullarbor

:floppy_disk: :page_with_curl: "Reads to report" for public health and clinical microbiology

Language:PerlLicense:GPL-2.0Stargazers:131Issues:0Issues:0

rpv

Microbial resistance, plasmid and virulence genes identification

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

ResBlaster

find antimicrobial resistance/ virulence genes in assembled genomes

Language:PythonStargazers:5Issues:0Issues:0

BactDating

Bayesian inference of ancestral dates on bacterial phylogenetic trees

Language:RLicense:MITStargazers:79Issues:0Issues:0

BioinformaticsTools

A set of tools for common bioinformatics projects

Language:PythonStargazers:1Issues:0Issues:0

Bioinformatic_Scripts

common bioinformatic scripts

Language:PerlStargazers:14Issues:0Issues:0