I3D, SlowFast (action recognition)

grega · February 10, 2021, 6:04pm

I am using gluon’s I3D and SlowFast models for activity recognition on my own dataset (training and testing). Both models work ok out of the box, but there is something I don’t understand and would like to clear it out.

Both I3D and SlowFast are supposed to be two-stream models, where in case of I3D, color and flow modality is used, while in case of SlowFast, one stream works on lower number of frames sampled from time while another on a higher number of images, but using less complex architecture.

I guess the gluon’s implementations are onestream, and one have to manually combine two instances in order to obtain two-stream models (as it is in the original papers)? Is there any example of that?
What would be the implementation of original SlowFast, that uses two different architectures (one per stream)?

Thue66 · July 27, 2021, 5:36pm

Hi
Did you figure this out?
Is gluon’s implementation a onestream and one have to manually combine the two instances in order to obtain a two-stream model?

Hope people can help here.

Topic		Replies	Views
Human activity recognition Gluon	3	708	February 2, 2020
VideoClsCustom's new_lenght parameter when the number of frames in the videos is fewer than new_lenght Gluon	2	580	May 14, 2021
Accuracy issue on video classification Gluon	1	414	February 12, 2020
Information on 3 splits of action recognition pretrained models Gluon	0	392	August 8, 2020
Video movement recognition Gluon	1	299	July 21, 2020

I3D, SlowFast (action recognition)

Related Topics