https://www.ncbi.nlm.nih.gov/pubmed/29253864

2017 Dec 18;13(12):e1005859. doi: 10.1371/journal.pcbi.1005859. eCollection 2017 Dec.

Invariant recognition drives neural representations of action sequences.

Author information

1: Center for Brains Minds and Machines, Massachusetts Institute of Technology, Cambridge, MA, United States.

Abstract

Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding action recognition at the neural level have not always translated into precise accounts of the computational principles underlying what representations of action sequences are constructed by human visual cortex. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, Convolutional Neural Networks (CNNs), that achieve human level performance in complex discriminative tasks. Within this class, architectures that better support invariant object recognition also produce image representations that better match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations of actions remains unknown. Here we show that spatiotemporal CNNs accurately categorize video stimuli into action classes, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed in the brain. These results broaden the scope of the invariant recognition framework for understanding visual intelligence from perception of inanimate objects and faces in static images to the study of human perception of action sequences.

PMID:: 29253864
PMCID:: PMC5749869
DOI:: 10.1371/journal.pcbi.1005859

MIT 컴퓨터생명공학 에서 낸 논문. 시공간 CNN (Spatiotemporal Convolutional Neural Networks model) 을 통해서, 사람의 움직임 (마시기, 먹기, 점프하기, 달리기, 걷기)을 판별해보니, 시신경 신호와 근접하게 나왔다...

201801 PLoS _Invariant recognition drives neural representations of action sequences.pdf

저작자표시 비영리 변경금지

'Others' 카테고리의 다른 글

Time to regenerate: the doctor in the age of artificial intelligence (0)	2018.04.26
Network Configurations in the Human Brain Reflect Choice Bias during Rapid Face Processing. (0)	2018.01.27
Machine learning in cardiovascular medicine: are we there yet? (0)	2018.01.27
A Removal of Eye Movement and Blink Artifacts from EEG Data Using Morphological Component Analysis. (0)	2017.10.16
Conflicting results between the analysis of skin lesions using a mobile-phone application and a dermatologist's clinical diagnosis: a pilot study. (0)	2017.10.16

의료와 인공지능

Invariant recognition drives neural representations of action sequences

Invariant recognition drives neural representations of action sequences.

Author information

Abstract

'Others' 카테고리의 다른 글

티스토리툴바

Invariant recognition drives neural representations of action sequences

Invariant recognition drives neural representations of action sequences.

Author information

Abstract

'Others' 카테고리의 다른 글

'Others' Related Articles

티스토리툴바