drain event called multiple times

Question

drain event called multiple times

villesau opened this issue 4 years ago · comments

Seems that with current version of Bottleneck drain event might be called multiple times. This happens at least if some of the requests fail.

Other case when drain is called more than once is when using grouping.

Mike Chen · Answer 1 · Wed May 13 2020 11:12:13 GMT+0800 (China Standard Time)

I believe you do not queue the new task before done called in callback. You should check this in your code, but if not you could paste the code in your convenience.
What do you mean by 'grouping'?

Ville Saukkonen · Answer 2 · Wed May 13 2020 15:53:39 GMT+0800 (China Standard Time)

This probably happens because limited queueSize can go negative. This would be fixed by updating to latest Bottleneck: https://github.com/SGrondin/bottleneck I have a fork which have it updated and this issue is fixed, but it also gets rid of unnecessary features that I don't need: https://github.com/villesau/node-crawler/tree/update-bottleneck

By grouping I mean limiter prop. Every time any limiter empties, drain is called.

Mike Chen · Answer 3 · Thu May 14 2020 11:52:39 GMT+0800 (China Standard Time)

Thank you for your reply,

we customized bottleneck package named bottleneckP which has priority included
we have thousands of scripts written base on crawler and this won't happen if code is written in the right way
better to post more details will help to find out the root cause. e.g. in what situation the unfinishedClients will go negative. As you can see we use it a lot in daily work, so it's good for me to know a corner case in which probably we'll have trouble.

Ville Saukkonen · Answer 4 · Thu May 14 2020 15:53:00 GMT+0800 (China Standard Time)

@mike442144 the current version of Bottleneck also supports priority.

queueSize can apparently go negative when the system retries failed calls. Also if multiple limiters are used, drain seems to be called for each of them separately. By updating the library I couldn't see queueSize to go negative anymore, so the issue has to be in bottleneckP, except with multiple limiters.

Mike Chen · Answer 5 · Fri May 15 2020 14:48:31 GMT+0800 (China Standard Time)

Actually I don't think retry failed task will cause negative queueSize, each time we get error done will be called and queueSize - 1, then enqueue the task as usual, that means queueSize +1.
Which version are you using? I suggest you to adopt the latest version to test again and post your testing result here.

Mike Chen · Answer 6 · Tue Jun 09 2020 14:49:55 GMT+0800 (China Standard Time)

close due to inactive, will reopen if any update