The architecture is simple enough: image input, batch normalization, lstm, convolution, max pool, fully connected, output.
Obviously a flatten layer is needed between batch norm and lstm, however the flatten layer provided in matlab is not compatible with image input layers (both 2D and 3D).
Reading the Flatten.m source file, the comments list the basic details of image dimensions, however the FlattenLayer.m (class) and flattenLayer.m (function), only list sequence data dimensions in their comments.
This makes me wonder if it is at all possible to make a custom flatten layer that is compatible with image input.
0 Comments
Sign in to comment.