Intercept the corresponding target image from the xml in the voc data set