Follow
Thiago D. Simão
Title
Cited by
Cited by
Year
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
1022021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
452021
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
32*2020
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning 112 (3), 859-887, 2023
312023
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
Proceedings of the AAAI Conference on Artificial Intelligence 33, 4967-4974, 2019
312019
Robust anytime learning of Markov decision processes
M Suilen, TD Simão, D Parker, N Jansen
Advances in Neural Information Processing Systems 35, 28790-28802, 2022
212022
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
Proceedings of the 28th International Joint Conference on Artificial …, 2019
122019
Decision-making under uncertainty: beyond probabilities: Challenges and perspectives
T Badings, TD Simão, M Suilen, N Jansen
International Journal on Software Tools for Technology Transfer 25 (3), 375-391, 2023
102023
Safe policy improvement for POMDPs via finite-state controllers
TD Simão, M Suilen, N Jansen
Proceedings of the AAAI Conference on Artificial Intelligence 37 (12), 15109 …, 2023
92023
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
arXiv preprint arXiv:2210.01801, 2022
82022
Reinforcement Learning by Guided Safe Exploration
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
arXiv preprint arXiv:2307.14316, 2023
7*2023
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
2022 IEEE 25th International Conference on Intelligent Transportation …, 2022
52022
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen
arXiv preprint arXiv:2305.07958, 2023
42023
Act-then-measure: reinforcement learning for partially observable environments with active measuring
M Krale, TD Simão, N Jansen
Proceedings of the International Conference on Automated Planning and …, 2023
32023
Scalable Safe Policy Improvement via Monte Carlo Tree Search
A Castellini, F Bianchi, E Zorzi, TD Simão, A Farinelli, MTJ Spaan
International Conference on Machine Learning, 3732-3756, 2023
32023
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
22017
Utilização de algoritmos genéticos para otimização de soluções para o timetabling escolar
TD SIMÃO
Tese apresentada ao Departamento de Ciência da Computação da Universidade …, 2013
22013
Risk-aware curriculum generation for heavy-tailed task distributions
C Koprulu, TD Simão, N Jansen, U Topcu
Uncertainty in Artificial Intelligence, 1132-1142, 2023
12023
Recursive small-step multi-agent A* for dec-POMDPs
W Koops, N Jansen, S Junges, TD Simão
Sl: IJCAI, 2023
12023
Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments.
TD Simão
IJCAI, 6460-6461, 2019
12019
The system can't perform the operation now. Try again later.
Articles 1–20