IntervalMDP.jl is a Julia package that provides tools for the analysis of Interval Markov Decision Processes (IMDPs). Key features of the package include:
The package solves the robust value iteration problem for IMDPs by iteratively computing the optimal pessimistic or optimistic probabilities of reaching a given set of states or optimizing a discounted reward. The authors introduce a GPU-accelerated algorithm that leverages parallel sorting and cumulative sum computations to significantly improve the efficiency of value iteration compared to existing tools.
Computational experiments on a set of 35 IMDP benchmarks show that the CPU implementation of IntervalMDP.jl is on average 2-4 times faster than the state-of-the-art, while the GPU implementation can achieve speedups of up to 200 times on larger models. Additionally, the package requires less memory compared to other tools due to its use of sparse matrices and the Julia type system.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Frederik Bay... a las arxiv.org 04-30-2024
https://arxiv.org/pdf/2401.04068.pdfConsultas más profundas