wzmsltw / BSN-boundary-sensitive-network

Codes of our paper: "BSN: Boundary Sensitive Network for Temporal Action Proposal Generation"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to train this on a custom dataset ?

dmenig opened this issue · comments

I've been looking at this for the past few days and I can't seem to figure out how I can use this to train on any other dataset than Thumos or ActivityNet. How would you train this in the end-to-end fashion proposed in the article please ?

same question..

@hyperfraise have you know how to train this model on other dataset? thanks

I have, I also added a classification part, and made it really end-to-end. I'm not sure t's the most proper way of doing it but I'll show you soon.

I have, I also added a classification part, and made it really end-to-end. I'm not sure t's the most proper way of doing it but I'll show you soon.

Thanks for your works, look forward to your open source..

Here you have my work. I welcome criticism and error pointing !

https://github.com/hyperfraise/end-to-end-BSN.pytorch

Hello, I would like to ask what kind of input data you have implemented for end-to-end BSN. I don't understand some of your source code. I'm sorry!

Videos of such shape (batch_size, channels, video_length, image_size, image_size)

And labels of shape [class, t_start, t_end]

Custom datasets will have the particularity of having custom dataloader and custom formats of data storage, label storage etc... So I thought that the only thing I could actually do is tell the reader the format of the data that should be passed to the loss functions, and let the reader decide how (s)he needed to transform his(er) jsons and videos to this end.

However, you're right that it would allow for more flexibility on Thumos14 architectures. I'll try to implement a simple dataloader for this as soon as possible.

Thank you very much.

Hey. i'll respond to issues on my code, on the aforementionned repo. If you've got suggestions, I'll be glad to adress them there. I don't think here is the best place :)

https://github.com/hyperfraise/end-to-end-BSN.pytorch

@hyperfraise have you apply this into custom video dataset?

Yes I failed to go above baseline accuracy. But I never could with temporal detection framework anyways :/

@hyperfraise, Thanks for your reply. What method use for get the video features? TSN?

I trained it fully end to end with 2d resnet 18 feature extractor with the code I linked. So I extracted features on the fly.