Home / Articles / Bag-of-visual-words vs global image descriptors on two-stage multimodal retrieval