Identifying visual attributes for object recognition from text and taxonomy

Attributes of objects such as “square”, “metallic”, and“red” allow a way for humans to explain or discriminate object categories. These attributes also provide a useful intermediate representation for object recognition, including support for zero-shot learning from textual descriptions of object appearance. However, manual selection of relevant attributes among thousands of potential candidates is labor intensive. Hence, there is increasing interest in mining attributes for object recognition. In this paper, we introduce two novel techniques for nominating attributes and a method for assessing the suitability of candidate attributes for object recognition. The first technique for attribute nomination estimates attribute qualities based on their ability to discriminateobjectsatmultiplelevels ofthetaxonomy.Thesecondtechnique leveragesthelinguistic concept of distributional similarity to further refine the estimated qualities. Attribute nomination is followed by our attribute assessment procedure, which assesses the quality of the candidate attributes based on their performance in object recognition. Our evaluations demonstrate that both taxonomy and distributional similarity serveasuseful sourcesofinformation forattribute nomination,andourmethodscaneffectively exploit them. We use the mined attributes in supervised and zero-shot learning settings to show the utility of the selected attributes in object recognition. Our experimental results show that in the supervised case we can improve on a state of the art classifier while in the zero-shot scenario we make accurate predictions outperforming previous automated techniques.

Author: Caglar Tirkaz, Jacob Eisenstein, T. Metin Sezgin and Berrin Yanikoglu.

Read the full paper.

Leave a Reply

Your email address will not be published. Required fields are marked *