I'm getting this error while I'm trying to train using object_detection_recordio…_format.ipynb
Here's the error log:
Docker entrypoint called with argument(s): train
[07/25/2018 14:57:59 INFO 139826690209600] Reading default configuration from /opt/amazon/lib/python2.7/site-packages/algorithm/default-input.json: {u'lr_scheduler_step': u'', u'weight_decay': u'0.0005', u'optimizer': u'sgd', u'_tuning_objective_metric': u'', u'base_network': u'vgg-16', u'freeze_layer_pattern': u'', u'use_pretrained_model': u'0', u'_kvstore': u'device', u'label_width': u'350', u'kv_store': u'device', u'epochs': u'30', u'nms_threshold': u'0.45', u'momentum': u'0.9', u'overlap_threshold': u'0.5', u'lr_scheduler_factor': u'0.1', u'image_shape': u'300', u'_num_kv_servers': u'auto', u'mini_batch_size': u'32', u'learning_rate': u'0.001', u'num_classes': u'', u'num_training_samples': u''}
[07/25/2018 14:57:59 INFO 139826690209600] Reading provided configuration from /opt/ml/input/config/hyperparameters.json: {u'lr_scheduler_step': u'3,6', u'weight_decay': u'0.0005', u'mini_batch_size': u'32', u'optimizer': u'sgd', u'base_network': u'resnet-50', u'learning_rate': u'0.001', u'use_pretrained_model': u'0', u'label_width': u'350', u'epochs': u'20', u'overlap_threshold': u'0.5', u'num_training_samples': u'924', u'num_classes': u'10', u'nms_threshold': u'0.45', u'image_shape': u'224', u'momentum': u'0.9', u'lr_scheduler_factor': u'0.1'}
[07/25/2018 14:57:59 INFO 139826690209600] Final configuration: {u'label_width': u'350', u'epochs': u'20', u'overlap_threshold': u'0.5', u'lr_scheduler_factor': u'0.1', u'_num_kv_servers': u'auto', u'weight_decay': u'0.0005', u'mini_batch_size': u'32', u'use_pretrained_model': u'0', u'freeze_layer_pattern': u'', u'lr_scheduler_step': u'3,6', u'momentum': u'0.9', u'optimizer': u'sgd', u'_tuning_objective_metric': u'', u'learning_rate': u'0.001', u'kv_store': u'device', u'nms_threshold': u'0.45', u'num_classes': u'10', u'base_network': u'resnet-50', u'num_training_samples': u'924', u'_kvstore': u'device', u'image_shape': u'224'}
[07/25/2018 14:57:59 INFO 139826690209600] Using default worker.
[07/25/2018 14:57:59 INFO 139826690209600] Loaded iterator creator application/x-image for content type ('application/x-image', '1.0')
[07/25/2018 14:57:59 INFO 139826690209600] Loaded iterator creator application/x-recordio for content type ('application/x-recordio', '1.0')
[07/25/2018 14:57:59 INFO 139826690209600] Loaded iterator creator image/png for content type ('image/png', '1.0')
[07/25/2018 14:57:59 INFO 139826690209600] Loaded iterator creator image/jpeg for content type ('image/jpeg', '1.0')
[07/25/2018 14:57:59 WARNING 139826690209600] Training images are resized to image shape (3, 224, 224)
[14:57:59] /opt/brazil-pkg-cache/packages/AIAlgorithmsMXNet/AIAlgorithmsMXNet-1.1.x.200530.0/RHEL5_64/generic-flavor/src/src/io/iter_image_det_recordio.cc:281: ImageDetRecordIOParser: /opt/ml/input/data/train/mydata_train.rec, use 7 threads for decoding..
Algorithm Error: Internal Server Error
[14:57:59] /opt/brazil-pkg-cache/packages/AIAlgorithmsMXNet/AIAlgorithmsMXNet-1.1.x.200530.0/RHEL5_64/generic-flavor/src/src/io/iter_image_det_recordio.cc:315: Not enough label packed in img_list or rec file.
Stack trace returned 9 entries:
[bt] (0) /opt/amazon/lib/libaialgsdataiter.so(dmlc::StackTrace()+0x3d) [0x7f2bee78d46d]
[bt] (1) /opt/amazon/lib/libaialgsdataiter.so(dmlc::LogMessageFatal::~LogMessageFatal()+0x1a) [0x7f2bee78d70a]
[bt] (2) /opt/amazon/lib/libmxnet.so(+0x17bba60) [0x7f2be37a5a60]
[bt] (3) /opt/amazon/lib/libiomp5.so(__kmp_invoke_microtask+0x93) [0x7f2bd819aac3]
[bt] (4) /opt/amazon/lib/libiomp5.so(+0x84257) [0x7f2bd8169257]
[bt] (5) /opt/amazon/lib/libiomp5.so(+0x838d5) [0x7f2bd81688d5]
[bt] (6) /opt/amazon/lib/libiomp5.so(+0xb5fa4) [0x7f2bd819afa4]
[bt] (7) /lib64/libpthread.so.0(+0x7dc5) [0x7f2beff08dc5]
[bt] (8) /lib64/libc.so.6(clone+0x6d) [0x7f2bef3056ed]