raamana / neuropredict

Easy and comprehensive assessment of predictive power, with support for neuroimaging features

Home Page:https://raamana.github.io/neuropredict/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

IO error, unable to load features

johnaeanderson opened this issue · comments

commented

Hi Pradeep,

I finally got neuropredict installed with the dependencies using python 2.7.13! I'm having some issues with running the command, see the error message below,

Cheers,

John

(py2713) Johns-MacBook-Pro-2:KNN John$ neuropredict -m /Users/John/Desktop/KNN/METADATAFILE.csv -o /Users/John/Desktop/KNN/features/results -u /Users/John/Desktop/KNN/features

Requested features for analysis:
get_dir_of_dirs from /Users/John/Desktop/KNN/features
Traceback (most recent call last):
File "/Users/John/anaconda3/envs/py2713/lib/python2.7/site-packages/neuropredict/neuropredict.py", line 333, in getfeatures
data, feat_names = getmethod(featdir, subjid)
File "/Users/John/anaconda3/envs/py2713/lib/python2.7/site-packages/neuropredict/neuropredict.py", line 274, in get_dir_of_dirs
raise IOError('Unable to load features from \n{}'.format(featfile))

Looks like your folder doesn't have the right format. Can you show me the output for ls -R /Users/John/Desktop/KNN/features

commented

I just realized the metadata file names didn't exactly match the subject folder names - it seems to be running now ... will keep you posted :)

Ok works - turns out my features are terrible though. Will need to think about better ones. Thanks for putting this together!

J

Great. Try different training percentage just to be sure. Default is 50%, which can be low with small sample sizes.

[ps: i wont get notifications if you edit your previous comment, send new comments!]

commented

Good advice! It works better with -t .95. I only have 50 subjects, so this may be ok, I'll try .9 as well to see if I get comparable results. Thanks again! Cheers,

John

Glad to hear that. As you already can see, we need to keep in mind that these estimates are coming from small sample sizes and the performance will change (higher or lower, we don't now) on a much bigger sample..