How to train this on a custom dataset ?

Question

How to train this on a custom dataset ?

dmenig opened this issue 5 years ago · comments

I've been looking at this for the past few days and I can't seem to figure out how I can use this to train on any other dataset than Thumos or ActivityNet. How would you train this in the end-to-end fashion proposed in the article please ?

guancheng817 · Answer 1 · Thu May 23 2019 17:41:28 GMT+0800 (China Standard Time)

same question..

95xueqian · Answer 2 · Tue Jun 11 2019 14:00:10 GMT+0800 (China Standard Time)

@hyperfraise have you know how to train this model on other dataset? thanks

Damien Menigaux · Answer 3 · Tue Jun 11 2019 16:25:50 GMT+0800 (China Standard Time)

I have, I also added a classification part, and made it really end-to-end. I'm not sure t's the most proper way of doing it but I'll show you soon.

guancheng817 · Answer 4 · Tue Jun 11 2019 16:49:18 GMT+0800 (China Standard Time)

I have, I also added a classification part, and made it really end-to-end. I'm not sure t's the most proper way of doing it but I'll show you soon.

Thanks for your works, look forward to your open source..

Damien Menigaux · Answer 5 · Tue Jun 11 2019 17:54:17 GMT+0800 (China Standard Time)

Here you have my work. I welcome criticism and error pointing !

https://github.com/hyperfraise/end-to-end-BSN.pytorch

shawxiao · Answer 6 · Thu Jul 18 2019 22:35:35 GMT+0800 (China Standard Time)

Hello, I would like to ask what kind of input data you have implemented for end-to-end BSN. I don't understand some of your source code. I'm sorry!

Damien Menigaux · Answer 7 · Thu Jul 18 2019 22:52:50 GMT+0800 (China Standard Time)

Videos of such shape (batch_size, channels, video_length, image_size, image_size)

Damien Menigaux · Answer 8 · Thu Jul 18 2019 22:53:29 GMT+0800 (China Standard Time)

And labels of shape [class, t_start, t_end]

Damien Menigaux · Answer 9 · Thu Jul 18 2019 22:53:58 GMT+0800 (China Standard Time)

See the https://github.com/hyperfraise/end-to-end-BSN.pytorch/blob/master/example.py file for demo on how to use it.

shawxiao · Answer 10 · Thu Jul 18 2019 23:39:57 GMT+0800 (China Standard Time)

Is there any script file processed? Take the thmos2014 data set as an example.

…

------------------ 原始邮件 ------------------ 发件人: "hyperfraise"<notifications@github.com>; 发送时间: 2019年7月18日(星期四) 晚上10:52 收件人: "wzmsltw/BSN-boundary-sensitive-network"<BSN-boundary-sensitive-network@noreply.github.com>; 抄送: "摸摸头^^"<2804597917@qq.com>;"Comment"<comment@noreply.github.com>; 主题: Re: [wzmsltw/BSN-boundary-sensitive-network] How to train this on acustom dataset ? (#35) Videos of such shape (batch_size, channels, video_length, image_size, image_size) — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

Damien Menigaux · Answer 11 · Fri Jul 19 2019 04:49:01 GMT+0800 (China Standard Time)

Custom datasets will have the particularity of having custom dataloader and custom formats of data storage, label storage etc... So I thought that the only thing I could actually do is tell the reader the format of the data that should be passed to the loss functions, and let the reader decide how (s)he needed to transform his(er) jsons and videos to this end.

However, you're right that it would allow for more flexibility on Thumos14 architectures. I'll try to implement a simple dataloader for this as soon as possible.

shawxiao · Answer 12 · Fri Jul 19 2019 08:32:00 GMT+0800 (China Standard Time)

Thank you very much.

shawxiao · Answer 13 · Sat Jul 20 2019 19:48:22 GMT+0800 (China Standard Time)

I would like to ask your model features or the features of brother tianwei. The video feature extraction was not added to the end-to-end architecture.

…

------------------ 原始邮件 ------------------ 发件人: "hyperfraise"; 发送时间: 2019年7月18日(星期四) 晚上10:53 收件人: "wzmsltw/BSN-boundary-sensitive-network"; 抄送: "shawxiao"<2804597917@qq.com>;"Comment"; 主题: Re: [wzmsltw/BSN-boundary-sensitive-network] How to train this on acustom dataset ? (#35) And labels of shape [class, t_start, t_end] — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

Damien Menigaux · Answer 14 · Sat Jul 20 2019 19:58:12 GMT+0800 (China Standard Time)

Hey. i'll respond to issues on my code, on the aforementionned repo. If you've got suggestions, I'll be glad to adress them there. I don't think here is the best place :)

https://github.com/hyperfraise/end-to-end-BSN.pytorch

Malithi-gif · Answer 15 · Wed Jan 06 2021 00:16:16 GMT+0800 (China Standard Time)

@hyperfraise have you apply this into custom video dataset?

Damien Menigaux · Answer 16 · Wed Jan 06 2021 00:22:17 GMT+0800 (China Standard Time)

Yes I failed to go above baseline accuracy. But I never could with temporal detection framework anyways :/

Malithi-gif · Answer 17 · Wed Jan 06 2021 00:25:09 GMT+0800 (China Standard Time)

@hyperfraise, Thanks for your reply. What method use for get the video features? TSN?

Damien Menigaux · Answer 18 · Wed Jan 06 2021 00:35:05 GMT+0800 (China Standard Time)

I trained it fully end to end with 2d resnet 18 feature extractor with the code I linked. So I extracted features on the fly.