Advanced search
1 file | 825.76 KB Add to list

Learning temporal task specifications from demonstrations

Mattijs Baert (UGent) , Sam Leroux (UGent) and Pieter Simoens (UGent)
Author
Organization
Project
Abstract
As we progress towards real-world deployment, the critical need for interpretability in reinforcement learning algorithms grows more pivotal, ensuring the safety and reliability of intelligent agents. This paper tackles the challenge of acquiring task specifications in linear temporal logic through expert demonstrations, aiming to alleviate the burdensome task of specification engineering. The rich semantics of temporal logics serve as an interpretable framework for delineating intricate, multi-stage tasks. We propose a method which iteratively learns a task specification and a nominal policy solving this task. In each iteration, the task specification is refined to better distinguish expert trajectories from trajectories sampled from the nominal policy. With this process we obtain a concise and interpretable task specification. Unlike previous work, our method is capable of learning directly from trajectories in the original state space and does not require predefined atomic propositions. We showcase the effectiveness of our method on multiple tasks in both an office and a Minecraft-inspired environment.

Downloads

  • (...).pdf
    • full text (Published version)
    • |
    • UGent only
    • |
    • PDF
    • |
    • 825.76 KB

Citation

Please use this url to cite or link to this publication:

MLA
Baert, Mattijs, et al. Learning Temporal Task Specifications from Demonstrations. Vol. 14847, Springer, 2024, pp. 81–98, doi:10.1007/978-3-031-70074-3_5.
APA
Baert, M., Leroux, S., & Simoens, P. (2024). Learning temporal task specifications from demonstrations. 14847, 81–98. https://doi.org/10.1007/978-3-031-70074-3_5
Chicago author-date
Baert, Mattijs, Sam Leroux, and Pieter Simoens. 2024. “Learning Temporal Task Specifications from Demonstrations.” In , 14847:81–98. Cham: Springer. https://doi.org/10.1007/978-3-031-70074-3_5.
Chicago author-date (all authors)
Baert, Mattijs, Sam Leroux, and Pieter Simoens. 2024. “Learning Temporal Task Specifications from Demonstrations.” In , 14847:81–98. Cham: Springer. doi:10.1007/978-3-031-70074-3_5.
Vancouver
1.
Baert M, Leroux S, Simoens P. Learning temporal task specifications from demonstrations. In Cham: Springer; 2024. p. 81–98.
IEEE
[1]
M. Baert, S. Leroux, and P. Simoens, “Learning temporal task specifications from demonstrations,” presented at the 6th International Workshop, EXTRAAMAS 2024, Auckland, New Zealand, 2024, vol. 14847, pp. 81–98.
@inproceedings{01J914A7MG4B2042HT11123XAH,
  abstract     = {{As we progress towards real-world deployment, the critical need for interpretability in reinforcement learning algorithms grows more pivotal, ensuring the safety and reliability of intelligent agents. This paper tackles the challenge of acquiring task specifications in linear temporal logic through expert demonstrations, aiming to alleviate the burdensome task of specification engineering. The rich semantics of temporal logics serve as an interpretable framework for delineating intricate, multi-stage tasks. We propose a method which iteratively learns a task specification and a nominal policy solving this task. In each iteration, the task specification is refined to better distinguish expert trajectories from trajectories sampled from the nominal policy. With this process we obtain a concise and interpretable task specification. Unlike previous work, our method is capable of learning directly from trajectories in the original state space and does not require predefined atomic propositions. We showcase the effectiveness of our method on multiple tasks in both an office and a Minecraft-inspired environment.}},
  author       = {{Baert, Mattijs and Leroux, Sam and Simoens, Pieter}},
  isbn         = {{9783031700736}},
  issn         = {{0302-9743}},
  language     = {{eng}},
  location     = {{Auckland, New Zealand}},
  pages        = {{81--98}},
  publisher    = {{Springer}},
  title        = {{Learning temporal task specifications from demonstrations}},
  url          = {{http://doi.org/10.1007/978-3-031-70074-3_5}},
  volume       = {{14847}},
  year         = {{2024}},
}

Altmetric
View in Altmetric