Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108959
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Applied Physics-
dc.contributorMainland Development Office-
dc.contributorResearch Institute for Smart Energy-
dc.creatorGuo, F-
dc.creatorIo, WF-
dc.creatorDang, Z-
dc.creatorDing, R-
dc.creatorPang, SY-
dc.creatorZhao, Y-
dc.creatorHao, J-
dc.date.accessioned2024-09-11T08:33:55Z-
dc.date.available2024-09-11T08:33:55Z-
dc.identifier.issn2051-6347-
dc.identifier.urihttp://hdl.handle.net/10397/108959-
dc.language.isoenen_US
dc.publisherRoyal Society of Chemistryen_US
dc.rightsThis journal is © The Royal Society of Chemistry 2023en_US
dc.rightsThe following publication Guo, F., Io, W. F., Dang, Z., Ding, R., Pang, S.-Y., Zhao, Y., & Hao, J. (2023). Achieving reinforcement learning in a three-active-terminal neuromorphic device based on a 2D vdW ferroelectric material [10.1039/D3MH00714F]. Materials Horizons, 10(9), 3719-3728 is available at https://doi.org/10.1039/D3MH00714F.en_US
dc.titleAchieving reinforcement learning in a three-active-terminal neuromorphic device based on a 2D vdW ferroelectric materialen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage3719-
dc.identifier.epage3728-
dc.identifier.volume10-
dc.identifier.issue9-
dc.identifier.doi10.1039/d3mh00714f-
dcterms.abstractCurrently, for most three-terminal neuromorphic devices, only the gate terminal is active. The inadequate modes and freedom of modulation in such devices greatly hinder the implementation of complex neural behaviors and brain-like thinking strategies in hardware systems. Taking advantage of the unique feature of co-existing in-plane (IP) and out-of-plane (OOP) ferroelectricity in two-dimensional (2D) ferroelectric α-In2Se3, we construct a three-active-terminal neuromorphic device where any terminal can modulate the conductance state. Based on the co-operation mode, controlling food intake as a complex nervous system-level behavior is achieved to carry out positive and negative feedback. Specifically, reinforcement learning as a brain-like thinking strategy is implemented due to the coupling between polarizations in different directions. Compared to the single modulation mode, the chance of the agent successfully obtaining the reward in the Markov decision process is increased from 68% to 82% under the co-operation mode through the coupling effect between IP and OOP ferroelectricity in 2D α-In2Se3 layers. Our work demonstrates the practicability of three-active-terminal neuromorphic devices in handling complex tasks and advances a significant step towards implementing brain-like learning strategies based on neuromorphic devices for dealing with real-world challenges.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationMaterials horizons, 1 Sept 2023, v. 10, no. 9, p. 3719-3728-
dcterms.isPartOfMaterials horizons-
dcterms.issued2023-09-01-
dc.identifier.scopus2-s2.0-85165305518-
dc.identifier.pmid37403831-
dc.identifier.eissn2051-6355-
dc.description.validate202409 bcch-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera3188en_US
dc.identifier.SubFormID49758en_US
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Guo_Achieving_Reinforcement_Learning.pdfPre-Published version1.91 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

89
Citations as of Feb 9, 2026

Downloads

91
Citations as of Feb 9, 2026

SCOPUSTM   
Citations

16
Citations as of May 8, 2026

WEB OF SCIENCETM
Citations

15
Citations as of Apr 23, 2026

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.