Win Vector LLC (WinVector)

Win Vector LLC

WinVector

Geek Repo

Expert data science training and consulting.

Location:San Francisco, California

Home Page:http://www.win-vector.com/

Github PK Tool:Github PK Tool

Win Vector LLC's repositories

vtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

Language:HTMLLicense:NOASSERTIONStargazers:281Issues:23Issues:23

Examples

Various examples for different articles

wrapr

Wrap R for Sweet R Code

Language:RLicense:NOASSERTIONStargazers:135Issues:8Issues:15

PDSwR2

Code, Data, and Examples for Practical Data Science with R 2nd edition (Nina Zumel and John Mount) https://github.com/WinVector/PDSwR2

Language:HTMLLicense:NOASSERTIONStargazers:131Issues:18Issues:5

pyvtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

Language:PythonLicense:NOASSERTIONStargazers:114Issues:9Issues:20

data_algebra

Codd method-chained SQL generator and Pandas data processing in Python.

Language:PythonLicense:BSD-3-ClauseStargazers:113Issues:10Issues:3

rquery

Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.

Language:HTMLLicense:NOASSERTIONStargazers:108Issues:17Issues:16

WVPlots

Pre-packaged plots in R

Language:RLicense:NOASSERTIONStargazers:84Issues:13Issues:4

replyr

Patches for using dplyr with Databases and Big Data

Language:HTMLLicense:NOASSERTIONStargazers:66Issues:14Issues:10

seplyr

Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks

Language:RLicense:NOASSERTIONStargazers:48Issues:10Issues:4

cdata

Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.

Language:RLicense:NOASSERTIONStargazers:43Issues:7Issues:7

rqdatatable

Implement the rquery piped query algebra in R using data.table. Distributed under choice of GPL-2 or GPL-3 license.

Language:RLicense:NOASSERTIONStargazers:37Issues:0Issues:0

Logistic

Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.

addinexamplesWV

Ad-ins and keyboard shortcuts for building calculation pipelines in R

Language:RLicense:NOASSERTIONStargazers:32Issues:6Issues:0

sigr

Concise formatting of significances in R (GPL3 license).

Language:HTMLLicense:NOASSERTIONStargazers:27Issues:9Issues:2

ExploreModels

Code and data for "The Geometry of Classifiers"

Language:RLicense:GPL-3.0Stargazers:26Issues:0Issues:0

WinVector.github.io

Viewable pages from WinVector LLC view at: http://winvector.github.io

Language:HTMLStargazers:23Issues:0Issues:0

WVLPSolver

Experimental pure Java revised simplex linear program solver (Apache 2.0 license)

Language:JavaStargazers:15Issues:9Issues:0

Locality-Sensitive-Hashing-Example

Simple example of Locality Sensitive Hashing

Language:JavaStargazers:14Issues:7Issues:0

RcppDynProg

Dynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.

Language:C++License:NOASSERTIONStargazers:14Issues:6Issues:0

wvpy

Tools to convert from Jupyter notebooks to and from Python .py files, and render.

Language:HTMLLicense:NOASSERTIONStargazers:8Issues:0Issues:0

ExampleRPackage

Example of how to build a simple R package

Language:RStargazers:2Issues:4Issues:0

Importance-Sampling

Importance Sampling Example

Language:JavaStargazers:2Issues:0Issues:0

LStep

Trivial demonstration of a diverging Newton-Raphson step when solving a logistic regression

Language:JavaLicense:NOASSERTIONStargazers:2Issues:5Issues:0

OutOfCore

Example of out of core coding techniques

Language:JavaStargazers:2Issues:6Issues:0

ATasteOfDataScience

Working an example of supervised machine learning in Python

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1Issues:4Issues:0

ExperimentInspector

Java code to build synthetic data sets that match reported summary totals. Helps explore possible range of variation.

Language:JavaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

SessionExample

Example code for articles on sessionizing data.

License:GPL-2.0Stargazers:1Issues:4Issues:0

wvu

Win Vector LLC Python data science teaching tools (graphs and data manipulation)

Language:HTMLLicense:NOASSERTIONStargazers:1Issues:3Issues:0

TypicalityCoding

Simple example of how to use an embedding plus sphering/whitening transform to measure difference in distribution.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:3Issues:0