Win Vector LLC (WinVector)

Win Vector LLC

WinVector

Organization data from Github https://github.com/WinVector

Expert data science training and consulting.

Location:San Francisco, California

Home Page:http://www.win-vector.com/

GitHub:@WinVector

Win Vector LLC's repositories

vtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

Language:HTMLLicense:NOASSERTIONStargazers:286Issues:22Issues:23

Examples

Various examples for different articles

Language:Jupyter NotebookStargazers:184Issues:17Issues:1

wrapr

Wrap R for Sweet R Code

Language:RLicense:NOASSERTIONStargazers:138Issues:8Issues:15

PDSwR2

Code, Data, and Examples for Practical Data Science with R 2nd edition (Nina Zumel and John Mount) https://github.com/WinVector/PDSwR2

Language:HTMLLicense:NOASSERTIONStargazers:137Issues:17Issues:5

pyvtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

Language:PythonLicense:NOASSERTIONStargazers:119Issues:9Issues:20

data_algebra

Codd method-chained SQL generator and Pandas data processing in Python.

Language:PythonLicense:BSD-3-ClauseStargazers:118Issues:9Issues:3

rquery

Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.

Language:HTMLLicense:NOASSERTIONStargazers:110Issues:17Issues:16

WVPlots

Pre-packaged plots in R

Language:RLicense:NOASSERTIONStargazers:85Issues:13Issues:5

replyr

Patches for using dplyr with Databases and Big Data

Language:HTMLLicense:NOASSERTIONStargazers:67Issues:12Issues:10

seplyr

Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks

Language:RLicense:NOASSERTIONStargazers:51Issues:9Issues:4

cdata

Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.

Language:RLicense:NOASSERTIONStargazers:45Issues:7Issues:7

rqdatatable

Implement the rquery piped query algebra in R using data.table. Distributed under choice of GPL-2 or GPL-3 license.

Language:RLicense:NOASSERTIONStargazers:38Issues:9Issues:5

Logistic

Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.

addinexamplesWV

Ad-ins and keyboard shortcuts for building calculation pipelines in R

Language:RLicense:NOASSERTIONStargazers:34Issues:4Issues:0

sigr

Concise formatting of significances in R (GPL3 license).

Language:HTMLLicense:NOASSERTIONStargazers:28Issues:7Issues:2

ExploreModels

Code and data for "The Geometry of Classifiers"

Language:RLicense:GPL-3.0Stargazers:26Issues:8Issues:0

WinVector.github.io

Viewable pages from WinVector LLC view at: http://winvector.github.io

Language:HTMLStargazers:23Issues:12Issues:0

RcppDynProg

Dynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.

Language:C++License:NOASSERTIONStargazers:15Issues:6Issues:0

WVLPSolver

Experimental pure Java revised simplex linear program solver (Apache 2.0 license)

Language:JavaStargazers:15Issues:8Issues:0

Locality-Sensitive-Hashing-Example

Simple example of Locality Sensitive Hashing

Language:JavaStargazers:14Issues:6Issues:0

wvpy

Tools to convert from Jupyter notebooks to and from Python .py files, and render.

Language:HTMLLicense:NOASSERTIONStargazers:10Issues:3Issues:0

ATasteOfDataScience

Working an example of supervised machine learning in Python

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:3Issues:0

ExampleRPackage

Example of how to build a simple R package

Language:RStargazers:2Issues:3Issues:0

Importance-Sampling

Importance Sampling Example

Language:JavaStargazers:2Issues:6Issues:0

LStep

Trivial demonstration of a diverging Newton-Raphson step when solving a logistic regression

Language:JavaLicense:NOASSERTIONStargazers:2Issues:5Issues:0

OutOfCore

Example of out of core coding techniques

Language:JavaStargazers:2Issues:5Issues:0

ExperimentInspector

Java code to build synthetic data sets that match reported summary totals. Helps explore possible range of variation.

Language:JavaLicense:NOASSERTIONStargazers:1Issues:4Issues:0

wvu

Win Vector LLC Python data science teaching tools (graphs and data manipulation)

Language:HTMLLicense:NOASSERTIONStargazers:1Issues:3Issues:0

TypicalityCoding

Simple example of how to use an embedding plus sphering/whitening transform to measure difference in distribution.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:3Issues:0

WVExamples

Win Vector technical articles and example code

Language:HTMLStargazers:0Issues:0Issues:0