Problem about classification of MNSIT dataset

Question

Problem about classification of MNSIT dataset

tsenggordon opened this issue 10 years ago · comments

Dear all
I' m trying use HTM spatial pooler as a feature extractor on image recognition problem. Now I use
MNIST dataset to be a start point. As far as I now, the input of spatial pooler should be binary data, but
the range of MNIST pixel value is from 0 to 255. I wonder how we convert that value to a binary input.
(filter the pixel value to be 0 or 1 by a threshold? or just use the binary system to represent the pixel
value? (28x28x8?))

David Ray · Answer 1 · Sat Mar 07 2015 21:48:41 GMT+0800 (China Standard Time)

Hi!

I believe this repository is a bleeding edge repository (for research only and provided as a courtesy) and is not supported, therefore you cannot open issues regarding this code.

Please come see us at: https://gitter.im/numenta/public or the Mailing List and we'll be very happy to help! :-)

breznak · Answer 2 · Sun Mar 08 2015 06:53:00 GMT+0800 (China Standard Time)

I think it's fine to open issues and ask, just you might not always get an
answer.

Have a look in the nupic.vision repo, there's an ImageRegion that serves as
an encoder for image data. Anyways, I think both mentioned approaches would
be interesting to try out.

0/1 threshold - has a significant loss of information in the color domain
28x28x8 - on the other hands breaks the spatial property of the
neighboring bits

So... personaly I'd think the B/W reduction would work quite well for
MNIST.

Scott Purdy · Answer 3 · Fri Jun 05 2015 06:44:44 GMT+0800 (China Standard Time)

@tsenggordon feel free to look at the image sensor for how it handles images:
https://github.com/numenta/nupic/blob/master/nupic/regions/ImageSensor.py