anonymousDataSet
Files are as follows:
First Problem
CapitalOneChallengeProblem1.ipynb (explanation of model building process)
output.txt (model prediction for unlabelled regression)
BuildModel.py (build model from command line using the following)
>> python BuildModel.py < codetest_train.txt
fitModel.py (predict the target values using this as follows)
>> fitModel.py < codetest_test.txt
morph_data.py (import of some helper functions)
To use all of this there are several libraries you may not have
-
py-earth (needed for Mars model)
-
xgboost (testing out xgboost model)
-
sklearn (testing out sklearns linear regression model)
Second Problem
Babynames.ipynb (work for answering questions and exploring the data)
BabyNames.pdf (write up explaining answers and explorations)
ken.pdf, karen.pdf, linda.png — images for this writeup
To run the ipynb you must have some of these more esoteric imports
-
fuzzy (for exploratory work)
-
graphviz (for graphing the exploratory work)