What will it take to get this working on regression problems as well?

Question

What will it take to get this working on regression problems as well?

ClimbsRocks opened this issue 7 years ago · comments

Looks like an awesome project!

We've found a fun use case for deep learning for regression problems with auto_ml. We train the neural network, then instead of getting it's output from the final layer (just a linear model), we get the features it learned in it's penultimate layer, then feed those into a gradient boosted model (which is typically better at turning features into predictions than a linear model).

Your library looks like a great way to optimize the deep learning model.

Any thoughts on what it would take to get regression models supported?

Joe Davison · Answer 1 · Thu Jun 08 2017 05:44:15 GMT+0800 (China Standard Time)

Thanks! That's definitely something that I've wanted to get added. I think the thing we could do that would make the most sense would be to allow the user the option to specify their own output layer. That's not something that the GP can evolve anyway, and that way the user has 100% control over the types of outputs they can have, allowing them to use eventually DEvol on any kind of deep net - regressors, autoencoders, potentially DQNs, etc. Obviously some more would have to be done on layer parameters to make that work right, but I think it would at least allow for regression vs. classification as you've brought up.

Preston Parry · Answer 2 · Thu Jun 08 2017 06:07:14 GMT+0800 (China Standard Time)

Great idea!

Elegant solution to many problems. Doesn't add much (any?) complexity to the project itself, while opening up tons of flexibility for users.

Big fan of this idea.

Preston Parry · Answer 3 · Tue Jun 13 2017 08:09:23 GMT+0800 (China Standard Time)

Any quick thoughts on what it will take to implement this @joeddav ?

Joe Davison · Answer 4 · Wed Jun 14 2017 08:09:01 GMT+0800 (China Standard Time)

It should be very straightforward. Just pass in the output layer to the GenomeHandler initializer and then at the end of the decode method add that instead of the softmax. You could probably do it in 2 minutes on your own branch - we just would have to update the readme, demos, etc. before merging to master.

Preston Parry · Answer 5 · Fri Jun 16 2017 05:47:22 GMT+0800 (China Standard Time)

That's exactly the response I was hoping for.

Would we have to modify the scoring functions at all?

Joe Davison · Answer 6 · Fri Jun 16 2017 22:41:06 GMT+0800 (China Standard Time)

Mmm I'm not sure as I've never done a regression problem with Keras. I would guess that you would just make sure to use loss instead of accuracy as your metric and you'd be fine but you may have to experiment.

Arinc Elhan · Answer 7 · Fri Jun 23 2017 06:36:33 GMT+0800 (China Standard Time)

Hi there,

First of all, regression tasks should expect only one output class, so change output layer class number to 1. Second, minimum squared error should be used when dealing regression tasks and in keras it has already been implemented under losses with name of 'mse'. Finally, as @joeddav mentioned, using loss instead of accuracy is much more suitable.

Joe Davison · Answer 8 · Tue Jun 27 2017 01:16:16 GMT+0800 (China Standard Time)

Right, so I guess the user also needs to control the parameters of the compile function in addition to the output layer - or at least just the loss function (also, small typo, it's mean squared error.) They should probably have access to them either way.

charlie · Answer 9 · Thu Jul 06 2017 23:12:36 GMT+0800 (China Standard Time)

Any movement on this ? If not I can attempt it

Joe Davison · Answer 10 · Fri Jul 07 2017 04:37:57 GMT+0800 (China Standard Time)

I haven't myself - I would go for it unless @ClimbsRocks has.

charlie · Answer 11 · Fri Jul 07 2017 05:16:52 GMT+0800 (China Standard Time)

I'll get started. I know we will want to add LSTM layers. For coding styles - should I just subclass GenomeHandler and call it RegressionGenomeHandler and add the new Regression specific things ?

Joe Davison · Answer 12 · Fri Jul 07 2017 07:06:14 GMT+0800 (China Standard Time)

That might be a good approach, depending on exactly how you're doing it. It may be easier to just pass it as a parameter, but your way would be cleaner in the long run. If I were doing it I would probably just extend the currently functionality to allow the user to pass an output layer of their own choosing.

Preston Parry · Answer 13 · Wed Jul 12 2017 02:18:58 GMT+0800 (China Standard Time)

@qorrect thanks for taking this on! any updates?

charlie · Answer 14 · Sat Jul 15 2017 07:43:56 GMT+0800 (China Standard Time)

No , I'm stumbling on the genome layout, not quite sure how to calculate the offsets given the new parameters I introduced - let me dig it up and post the specific problem.

Joe Davison · Answer 15 · Wed Jul 19 2017 00:17:58 GMT+0800 (China Standard Time)

I have a simple PR I can commit tonight once I work out a few kinks. It won't do anything with added LSTM layers, just allow for regression, but that should probably be separate anyway.

Preston Parry · Answer 16 · Wed Jul 19 2017 02:16:11 GMT+0800 (China Standard Time)

I like simple PRs, and iterative improvements that build on each other. Thanks for adding that in! Looking forward to trying it out soon.

Manuel Garrido · Answer 17 · Tue Jan 09 2018 07:16:10 GMT+0800 (China Standard Time)

any advance on this guys?

nforck · Answer 18 · Tue Jan 16 2018 00:37:42 GMT+0800 (China Standard Time)

Hi, I would be also interested, if there are any advances.

sohrabtowfighi · Answer 19 · Sun Jul 15 2018 04:43:51 GMT+0800 (China Standard Time)

@joeddav, could you commit the PR you wrote up?

Danilo Reinert · Answer 20 · Fri Feb 22 2019 21:07:24 GMT+0800 (China Standard Time)

Passing by to check if there are any plans to add both Regression and LSTM support ... ?

Manuel Garrido · Answer 21 · Fri Feb 22 2019 21:58:17 GMT+0800 (China Standard Time)

@reinert I think this project is dead.

Gavin Fernandes · Answer 22 · Fri Feb 22 2019 22:32:05 GMT+0800 (China Standard Time)

@reinert You could probably check out Tpot. It seems like it has a similar purpose and I think it supports regression, but I'm not sure

Manuel Garrido · Answer 23 · Thu Feb 28 2019 18:22:37 GMT+0800 (China Standard Time)

@gavinfernandes2012 yeah tpot is a much more mature project.

Joe Davison · Answer 24 · Thu Feb 28 2019 21:55:49 GMT+0800 (China Standard Time)

As I've said, I'm not presently able to actively work on this project, so short of occasional PRs from others, it is not being actively developed.

Check out tpot for genetic optimization. If you're just looking for hyperparameter optimization (i.e. not specifically genetic), then there are numerous other algorithms and open source projects to look at like Google Vizier. Bayesian optimization is typically a better approach in any case since it is penalized less by expensive evaluations. Many of these other tools just do hyperparameters and not architectures (adding & removing layers) as this one was intended to, but many provide a generic black box function that you can define any way you'd like.