Theses
context of technical systems such as robots or autonomous vehicles, however, there are additional challenges, since it is not possible to perform arbitrarily many experiments on the real system, in particular [...] to the real system. However, there is a sim-to-real gap . This means that the model is never 100% accurate such that the learned policy can be suboptimal or even infeasible for the real system. The task [...] modifying the action such that it successfully works on the real system, while requiring only a very small number of interactions with the real system. Reinforcement learning with continuous symmetries (MSc) Extension …