. This problem can be modelled as a stochastic shortest path
search in a Markov Decision Process [1], where transitions between nodes... Scientific, 4th ed., 2012. [2] Bandit Algorithms, T. Lattimore and C. Szepesvari, Cambridge University Press,
2020. [3] Multi... - €41000 - 45000 per year -
Voir cette offre d'emploi