The resulting categories contained a high proportion of related objects. For example, one category assigned the highest
weights for highway, car, sky, vehicle, and signpost—most likely corresponding to highways or ground transportation. Furthermore, the model assigned intuitive categories to the scenes in the database, tagging a harbor scene with nautical and cityscape categories. This is not surprising, given that LDA and its extensions have proven widely applicable in an analogous problem, determining categories from text documents (Blei et al., 2003). The LDA approach taken by Stansbury et al. (2013) has revealed hidden structure in natural images, but does the visual system exploit this structure in its representation http://www.selleckchem.com/products/cx-5461.html of visual scenes? One way to answer this question is to ask whether some aspect of brain activity correlates systematically with scene categories during the viewing of natural images. This would suggest that the brain encodes the scene categories in the same way that previous work has suggested an encoding of faces or orientations. To tackle this question, Stansbury et al. (2013) had subjects view a variety of different scenes and simultaneously recorded their brain activity with fMRI. Then, the authors attempted to predict the BOLD response in each voxel under the assumption that
the response to a scene was given by a weighted sum of the scene’s category AZD6244 clinical trial vector. Responses in low-level striate and extrastriate visual areas, which are sensitive to elementary features such as orientation and contrast, were poorly modulated by scene category. However, responses in anterior visual areas such as the fusiform face area (FFA) and the parahippocampal place area (PPA) could be accurately predicted by the encoding model. The authors found that the predictions were most accurate when
the LDA model contained 20 categories and 850 objects, indicating that there is substantially more categorical information available at the macroscopic fMRI scale than previously appreciated. Importantly, the number of voxels significantly predicted by the category-encoding model was larger than alternative models relying on elementary visual features, such as orientation or spatial frequency. This Dipeptidyl peptidase was a crucial test of the hypothesis that high-level visual areas actually represent scene categories rather than visual stimuli per se (Malach et al., 1995). Consistent with this idea, the model was also significantly more accurate than others that relied only on the presence of individual objects. Category preferences in different areas were, to some degree, consistent with previous literature. For example, the FFA showed a relative preference for the portraits category, whereas the PPA was most selective for categories that could be labeled “places.