Manuel Amunategui (amunategui)

amunategui

Geek Repo

Company:VP, SpringML

Home Page:https://amunategui.github.io

Github PK Tool:Github PK Tool

Manuel Amunategui's repositories

BetterCrossValidation

Companion code for YouTube video: https://www.youtube.com/watch?v=Og7CGAfSr_Y&feature=youtu.be

Read-and-Process-Files-Larger-Than-RAM

Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what the machine's RAM can handle. Companion code for youtube: https://www.youtube.com/watch?v=Z5rMrI1e4kM

Language:RStargazers:11Issues:0Issues:0

SimpleEnsembleBlending

Ensemble/Blender example in R using Caret (companion code for YouTube video: https://www.youtube.com/watch?v=k7sTiTWWCXM)

Language:RStargazers:11Issues:0Issues:0

SMOTE-Oversample-Rare-Events

Youtube companion (https://www.youtube.com/watch?v=1Mt7EuVJf1A&feature=youtu.be) - Brief introduction to the SMOTE R package to super-sample/ over-sample imbalanced data sets. SMOTE will use bootstrapping and k nearest neighbor to synthetically create additional observations. SMOTE white paper: https://www.jair.org/media/953/live-953-2037-jair.pdf

Language:RStargazers:11Issues:0Issues:0

pca-dimension-reduction

Reducing High Dimensional Data with Principle Component Analysis (PCA)

Language:RStargazers:7Issues:0Issues:0

FSCARET-Feature-Selection-On-Steroids

Code for Youtube video: https://www.youtube.com/watch?v=dTRDZBltCTg&list=UUq4pm1i_VZqxKVVOz5qRBIA&index=1

Language:RStargazers:3Issues:0Issues:0
Language:RStargazers:2Issues:0Issues:0

MultinomWalkThru

Companion code for YouTube video: https://www.youtube.com/watch?v=zTlbMHw9CeY&feature=youtu.be

Language:RStargazers:2Issues:0Issues:0

Walkthrough-of-the-dummyVars-Function

Shows how to transform categorical and textual data into dummy variables using Caret's dummyVar function. Code for YouTube presention: https://www.youtube.com/watch?v=7rgzCjrIA-o

Language:RStargazers:2Issues:0Issues:0
Stargazers:1Issues:0Issues:0

CSV2Vowpal

My take on the fastest possible translator for CSV to VW (vowpal wabbit) files using R. This is a work in progress

Language:RStargazers:1Issues:0Issues:0

dummyfier

Function wrapper for Caret's dummyVars to quickly break out every factor for a given data.frame column. Handles one column formula on 2 or more levels and returns altered data.frame with new columns minus original.

Language:RStargazers:1Issues:0Issues:0

excel-geocoding-tool

Easy to use Geocoding tool for Excel.

Stargazers:1Issues:0Issues:0

ExData_Plotting1

Plotting Assignment 1 for Exploratory Data Analysis

Stargazers:1Issues:0Issues:0

GettingAndCleaningData

Getting and Cleaning Data - John Hopkins

Stargazers:1Issues:0Issues:0

greekmedia

Kaggle Competion - Multi-label prediction

Language:PythonStargazers:1Issues:0Issues:0

pdfkit

A JavaScript PDF generation library for Node and the browser

License:MITStargazers:1Issues:0Issues:0

phraug

A set of simple Python scripts for pre-processing large files

License:BSD-2-ClauseStargazers:1Issues:0Issues:0

PracticalMachineLearningProject

Coursera Practical Machine Learning Rep.

Stargazers:1Issues:0Issues:0

ProgrammingAssignment2

Repository for Programming Assignment 2 for R Programming on Coursera

Language:RStargazers:1Issues:0Issues:0

RepData_PeerAssessment1

Peer Assessment 1 for Reproducible Research

Stargazers:1Issues:0Issues:0

Sparse-Matrices-And-GLMNET-Demo

Walk-through of sparse matrices in R and basic use of them in GLMNET. Companion code for Youtube talk: https://www.youtube.com/watch?v=Ysh2gs8VKvQ

Language:RStargazers:1Issues:0Issues:0

vowpal_wabbit

John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm

License:NOASSERTIONStargazers:1Issues:0Issues:0

xgboost

eXtreme Gradient Boosting (Tree) Library

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0

ZipToZipInMiles

(R) Calculates distance in miles between two US zip codes using the R language

Language:RStargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0