For each image in the test dataset, you will predict a list of label and bounding boxes.

It contains two columns:

  • ImageId: the id of the test image, for example Adachi_test_00000001
  • PredictionString: the prediction string should be a space delimited of 5 integers. For example, 2 240 170 260 240 means it's label 2, with a bounding box of coordinates (x_min, y_min, x_max, y_max). We accept up to 5 predictions. For example, if you submit 3 42 24 170 186 1 292 28 430 198 4 168 24 292 190 5 299 238 443 374 2 160 195 294 357 6 3 214 135 356 which contains 6 bounding boxes, we will only take the first 5 into consideration.