STUDIA UNIVERSITATIS

AMBIENTUM BIOETHICA BIOLOGIA CHEMIA DIGITALIA DRAMATICA EDUCATIO ARTIS GYMNAST. ENGINEERING EPHEMERIDES EUROPAEA GEOGRAPHIA GEOLOGIA HISTORIA HISTORIA ARTIUM INFORMATICA IURISPRUDENTIA MATHEMATICA MUSICA NEGOTIA OECONOMICA PHILOLOGIA PHILOSOPHIA PHYSICA POLITICA PSYCHOLOGIA-PAEDAGOGIA SOCIOLOGIA THEOLOGIA CATHOLICA THEOLOGIA CATHOLICA LATIN THEOLOGIA GR.-CATH. VARAD THEOLOGIA ORTHODOXA THEOLOGIA REF. TRANSYLVAN ROMÂNA ENGLISH INFO PARTENERI ADRESE DE CONTACT ACCES PARTENERI FORMULAR ABONAMENT NEWSLETTER & DOWNLOAD CELE MAI NOI APARITII APARITII ÎN ANUL CURENT TOATA ARHIVA STUDIA CAUTARE ÎN ARHIVA ISTORIE PREZENT SCOP SI OBIECTIVE ECHIPA


	Rezumat articol ediţie STUDIA UNIVERSITATIS BABEŞ-BOLYAI În partea de jos este prezentat rezumatul articolului selectat. Pentru revenire la cuprinsul ediţiei din care face parte acest articol, se accesează linkul din titlu. Pentru vizualizarea tuturor articolelor din arhivă la care este autor/coautor unul din autorii de mai jos, se accesează linkul din numele autorului.


	STUDIA INFORMATICA - Ediţia nr.2 din 2020

	Articol:	A VIEW ON DEEP REINFORCEMENT LEARNING IN IMPERFECT INFORMATION GAMES. Autori: TIDOR-VLAD PRICOPE.


	Rezumat: DOI: 10.24193/subbi.2020.2.03 Published Online: 2020-12-09 Published Print: 2020-12-30 pp. 31-49 FULL PDF VIEW PDF Abstract. Many real-world applications can be described as large-scale games of imperfect information. This kind of games is particularly harder than the deterministic one as the search space is even more sizeable. In this paper, I want to explore the power of reinforcement learning in such an environment; that is why I take a look at one of the most popular game of such type, no limit Texas Hold’em Poker, yet unsolved, developing multiple agents with different learning paradigms and techniques and then comparing their respective performances. When applied to no-limit Hold’em Poker, deep reinforcement learning agents clearly outperform agents with a more traditional approach. Moreover, if these last agents rival a human beginner level of play, the ones based on reinforcement learning compare to an amateur human player. The main algorithm uses Fictitious Play in combination with ANNs and some handcrafted metrics. We also applied the main algorithm to another game of imperfect information, less complex than Poker, in order to show the scalability of this solution and the increase in performance when put neck in neck with established classical approaches from the reinforcement learning literature. Received by the editors: 27 July 2020. 2010 Mathematics Subject Classiffication. 68T05. 1998 CR Categories and Descriptors. I.2.1 [Artificial Intelligence]: Applications and Expert Systems - Games. Key words and phrases. Artificial Intelligence, Computer Poker, Adaptive Learning, Fictitious Play, Deep Reinforcement Learning, Neural Networks.




			Revenire la pagina precedentă