Hypothesis

3 Matching Annotations

May 2021
arxiv.org arxiv.org

1711.11248.pdf

1
1. dominik.lewy 26 May 2021
  
  in Public
  
  The second spatiotemporal variant isa “(2+1)D” convolutional block, which explicitly factorizes3D convolution into two separate and successive operations,a 2D spatial convolution and a 1D temporal convolution.
  
  More nonlinearites and easier optimization task
  
  action_recognition r3d_18 CV_for_HC
Visit annotations in context

Tags

action_recognition

r3d_18

CV_for_HC

Annotators

dominik.lewy

URL

arxiv.org/pdf/1711.11248.pdf
Mar 2021
arxiv.org arxiv.org

1711.11248.pdf

2
1. dominik.lewy 17 Mar 2021
  
  in Public
  
  Regardless of the size of output pro-duced by the last convolutional layer, each network appliesglobal spatiotemporal average pooling to the final convolu-tional tensor, followed by a fully-connected (fc) layer per-forming the final classification (the output dimension of thefc layer matches the number of classes, e.g.,400for Kinet-ics).
  
  action_recognition r3d_18 CV_for_HC
2. dominik.lewy 17 Mar 2021
  
  in Public
  
  The first formulation is named mixed con-volution (MC) and consists in employing 3D convolutionsonly in the early layers of the network, with 2D convolu-tions in the top layers.
  
  r3d_18 CV_for_HC action_recognition
Visit annotations in context

Tags

r3d_18

CV_for_HC

action_recognition

Annotators

dominik.lewy

URL

arxiv.org/pdf/1711.11248.pdf