John St. John (jstjohn)

jstjohn

Geek Repo

Company:@NVIDIA

Location:Santa Clara, CA

Home Page:https://www.linkedin.com/in/johnstjohn/

Github PK Tool:Github PK Tool

John St. John's repositories

SimSeq

An illumina paired-end and mate-pair short read simulator. This project attempts to model as many of the quirks that exist in Illumina data as possible. Some of these quirks include the potential for chimeric reads, and non-biotinylated fragment pull down in mate-pair libraries . Additionally the program provides the ability to model both site and base specific error, and scripts are provided to train this error model on real datasets. My hope in creating this program is to generate as realistic data as possible to assist in assessing the accuracy of genome assembly tools.

Language:CLicense:NOASSERTIONStargazers:66Issues:6Issues:11

KentLib

Subset of the kent source libraries (perhaps out of date) that are easily built and installed on OSX and Linux. These libraries provide usefull utilities for bioinformatics programming in C. This may contain some of my own libraries for bioinformatics utilities as well as long as they install easily on both my mac and linux box.

GrNMF

An R/Rcpp/RcppArmadillo implementation of Deng Cai's Non-negative Matrix Factorization on Manifold, sometimes called GNMF or GrNMF.

Language:RLicense:NOASSERTIONStargazers:5Issues:2Issues:1

Jellyfish

Fork of the jellyfish kmer counter. Here is the description copied from their site: JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers using an order of magnitude less memory and an order of magnitude faster than other k-mer counting packages by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism. JELLYFISH is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the "jellyfish stats" command. See the documentation below for more details.

Language:C++License:GPL-3.0Stargazers:5Issues:6Issues:2

re-pair

Program to re-do the pairing of fastq reads. This program is modified from http://code.google.com/p/ngopt/source/browse/trunk/tools/pair_reads/repair.cpp?r=85

Language:C++Stargazers:5Issues:2Issues:0

cscripts

Miscalanious C and C++ scripts.

Language:CStargazers:2Issues:2Issues:0

mia

MIA is an reference guided assembler for DNA reads as generated by recent sequencing technologies. It is designed to support unusual short reads like they come from ancient, fragmented DNA.

Language:CLicense:Artistic-2.0Stargazers:2Issues:2Issues:0

gatk

GATK Official Release Repository

Language:JavaLicense:MITStargazers:1Issues:2Issues:0

homebrew

The missing package manager for OS X.

Language:RubyStargazers:1Issues:2Issues:0

KinectOrbit

A fork of http://www.arduinoandkinectprojects.com/kinectorbit with an attempt at upgrading to Processing v2.

adam

A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

amazonaccess

Amazon Employee Access Challenge

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

cloneHD

High-definition reconstruction of clonal composition from next-generation sequencing data

Language:C++License:GPL-3.0Stargazers:0Issues:2Issues:0

flickribbon

Automatically exported from code.google.com/p/flickribbon

Stargazers:0Issues:1Issues:10

fonolo4android

Automatically exported from code.google.com/p/fonolo4android

Language:JavaStargazers:0Issues:1Issues:3

genefamilyfinder

Automatically exported from code.google.com/p/genefamilyfinder

Language:RubyStargazers:0Issues:1Issues:0

ggplot

ggplot for python

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:2Issues:0

puppet-playground

A Vagrant MultiOS environment to test Puppet code and modules

Language:PuppetStargazers:0Issues:2Issues:0
Language:RubyStargazers:0Issues:2Issues:0

seizure-detection

Kaggle competition winning submission for the UPenn and Mayo Clinic's Seizure Detection Challenge

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

StarCluster

StarCluster is a utility for creating and managing computing clusters hosted on Amazon's Elastic Compute Cloud (EC2).

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:2Issues:0

textmate

TextMate is a graphical text editor for OS X 10.7+

Language:C++License:GPL-3.0Stargazers:0Issues:2Issues:0