Algorithm for video classification and labelling

Hi, I am a newbie to ML and currently involved in developing a ML project in which I am trying to identify pets in a video. Can you tell me if there are any MXNet deep learning algorithms both CNN and RNN which can take the video feed directly and do video classification and labelling. Appreciate your help on this.