Historikk

Cristin-resultat-ID: 2104034

Sist endret: 23. januar 2023, 08:02

NVI-rapporteringsår: 2022

Resultat

Vitenskapelig artikkel

2022

The Hierarchical Discrete Learning Automaton Suitable for Environments with Many Actions and High Accuracy Requirements

Rebekka Olsson Omslandseter
Lei Jiao
Xuan Zhang
Anis Yazidi og
John Oommen

Tidsskrift Tidsskrift

Lecture Notes in Computer Science (LNCS)

ISSN 0302-9743
e-ISSN 1611-3349

NVI-nivå 1

Finn i kanalregisteret

Om resultatet Om resultatet

Vitenskapelig artikkel

Publiseringsår: 2022

Publisert online: 2022

Trykket: 2022

Volum: 13151

Sider: 507 - 518

Open Access

Lenker Lenker

original online (doi)

https://doi.org/10.1007/978-3-030-97546-3_41

Institusjonsarkiv

hdl.handle.net/11250/3069016

Beskrivelse Beskrivelse

Engelsk

Tittel

The Hierarchical Discrete Learning Automaton Suitable for Environments with Many Actions and High Accuracy Requirements

Sammendrag

Since its early beginning, the paradigm of Learning Automata (LA), has attracted much interest. Over the last decades, new concepts and various improvements have been introduced to increase the LA’s speed and accuracy, including employing probability updating functions, discretizing the probability space, and implementing the “Pursuit” concept. The concept of incorporating “structure” into the ordering of the LA’s actions is one of the latest advancements to the field, leading to the ϵ-optimal Hierarchical Continuous Pursuit LA (HCPA) that has superior performance to other LA variants when the number of actions is large. Although the previously proposed HCPA is powerful, its speed has a handicap when the required action probability of an action is approaching unity. The reason for this slow convergence is that the learning parameter operates in a multiplicative manner within the probability space, making the increment of the action probability smaller as its probability becomes close to unity. Therefore, we propose the novel Hierarchical Discrete Learning Automata (HDPA) in this paper, which does not possess the same impediment as the HCPA. The proposed machine infuse the principle of discretization into the action probability vector’s updating functionality, where this type of updating is invoked recursively at every depth within a hierarchical tree structure and we pursue the best estimated action in all iterations through utilization of the Estimator phenomenon. The proposed machine is ϵ-optimal, and our experimental results demonstrate that the number of iterations required before convergence is significantly reduced for the HDPA, when compared with the HCPA.

Vis fullstendig beskrivelse

Bidragsytere Bidragsytere

Rebekka Olsson Omslandseter

Forfatter
ved Institutt for informasjons- og kommunikasjonsteknologi ved Universitetet i Agder

Lei Jiao

Forfatter
ved Institutt for informasjons- og kommunikasjonsteknologi ved Universitetet i Agder

Xuan Zhang

Forfatter
ved NORCE Klima og miljø ved NORCE Norwegian Research Centre AS

Anis Yazidi

Forfatter
ved Institutt for informasjonsteknologi ved OsloMet - storbyuniversitetet

John Oommen

Forfatter
ved Carleton University
Forfatter
ved Institutt for informasjons- og kommunikasjonsteknologi ved Universitetet i Agder

1 - 5 av 5