documentcloud / cloud-crowd

Parallel Processing for the Rest of Us

Home Page:https://github.com/documentcloud/cloud-crowd/wiki

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

File downloads 2 times code attached

wnoronha opened this issue · comments

Is this intended functionality?

Code and logs
http://gist.github.com/458155

Yep. You're returning the the input as the output of each step -- they're the same URL. So the next step will download that URL as a file. Return from split the array of what you want to pass to process, and return from process what you want to pass to merge (although merge will get an array of the outputs from all process steps.