https://www.ncbi.nlm.nih.gov/pubmed/29735249

2018 Feb 22. pii: S0893-6080(18)30049-2. doi: 10.1016/j.neunet.2018.02.010. [Epub ahead of print]

An adaptive deep Q-learning strategy for handwritten digit recognition.

Author information

1: Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100124, China. Electronic address: junfeiq@bjut.edu.cn.
2: Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100124, China. Electronic address: xiaowangqsd@163.com.
3: Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing 100124, China. Electronic address: wenjing.li@bjut.edu.cn.
4: Department of Obstetrics Gynecology, Civil Aviation General Hospital, Beijing 100123, China. Electronic address: cmcajh@163.com.

Abstract

Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time.

KEYWORDS:

Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; Deep learning; Handwritten digits recognition; Reinforcement learning

PMID:

29735249

DOI:

10.1016/j.neunet.2018.02.010

adaptive Q-learning deep belief network (Q-ADBN)

201805 Neural Netw An adaptive deep Q-learning strategy for handwritten digit recognition.pdf

저작자표시 비영리 변경금지

'Reinforcement Learning' 카테고리의 다른 글

★Randomised controlled trial of WISENSE, a real-time quality improving system for monitoring blind spots during esophagogastroduodenoscopy. (0)	2019.03.16
Model-based and model-free pain avoidance learning. (0)	2018.11.07
Introduction to the special issue on deep reinforcement learning: An editorial. (0)	2018.08.24
Encouraging Physical Activity in Patients With Diabetes: Intervention Using a Reinforcement Learning System. (0)	2017.10.16

의료와 인공지능

An adaptive deep Q-learning strategy for handwritten digit recognition.

An adaptive deep Q-learning strategy for handwritten digit recognition.

Author information

Abstract

KEYWORDS:

'Reinforcement Learning' 카테고리의 다른 글

티스토리툴바

An adaptive deep Q-learning strategy for handwritten digit recognition.

An adaptive deep Q-learning strategy for handwritten digit recognition.

Author information

Abstract

KEYWORDS:

'Reinforcement Learning' 카테고리의 다른 글

'Reinforcement Learning' Related Articles

티스토리툴바