Effective and efficient ROI-wise visual encoding using an end-to-end CNN regression model and selective optimization

Qiao, Kai; Zhang, Chi; Chen, Jian; Wang, Linyuan; Tong, Li; Yan, Bin

Abstract:Recently, visual encoding based on functional magnetic resonance imaging (fMRI) have realized many achievements with the rapid development of deep network computation. Visual encoding model is aimed at predicting brain activity in response to presented image stimuli. Currently, visual encoding is accomplished mainly by firstly extracting image features through convolutional neural network (CNN) model pre-trained on computer vision task, and secondly training a linear regression model to map specific layer of CNN features to each voxel, namely voxel-wise encoding. However, the two-step manner model, essentially, is hard to determine which kind of well features are well linearly matched for beforehand unknown fMRI data with little understanding of human visual representation. Analogizing computer vision mostly related human vision, we proposed the end-to-end convolution regression model (ETECRM) in the region of interest (ROI)-wise manner to accomplish effective and efficient visual encoding. The end-to-end manner was introduced to make the model automatically learn better matching features to improve encoding performance. The ROI-wise manner was used to improve the encoding efficiency for many voxels. In addition, we designed the selective optimization including self-adapting weight learning and weighted correlation loss, noise regularization to avoid interfering of ineffective voxels in ROI-wise encoding. Experiment demonstrated that the proposed model obtained better predicting accuracy than the two-step manner of encoding models. Comparative analysis implied that end-to-end manner and large volume of fMRI data may drive the future development of visual encoding.

Comments:	under review in Computational Intelligence and Neuroscience
Subjects:	Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1907.11885 [q-bio.NC]
	(or arXiv:1907.11885v1 [q-bio.NC] for this version)
	https://doi.org/10.48550/arXiv.1907.11885

Quantitative Biology > Neurons and Cognition

Title:Effective and efficient ROI-wise visual encoding using an end-to-end CNN regression model and selective optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators