Update README.md

This commit is contained in:
Denis Yarats 2019-10-12 14:23:11 -04:00 committed by GitHub
parent be6aa03262
commit 82cee4746f
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -64,4 +64,4 @@ which just tells the expected reward `ER` evaluating current policy after `S` st
Our method demonstrates significantly improved performance over the baseline SAC:pixel. It matches the state-of-the-art performance of model-based algorithms, such as PlaNet (Hafner et al., 2018) and SLAC (Lee et al., 2019), as well Our method demonstrates significantly improved performance over the baseline SAC:pixel. It matches the state-of-the-art performance of model-based algorithms, such as PlaNet (Hafner et al., 2018) and SLAC (Lee et al., 2019), as well
as a model-free algorithm D4PG (Barth-Maron et al., 2018), that also learns from raw images. Our as a model-free algorithm D4PG (Barth-Maron et al., 2018), that also learns from raw images. Our
algorithm exhibits stable learning across ten random seeds and is extremely easy to implement. algorithm exhibits stable learning across ten random seeds and is extremely easy to implement.
![Results](https://lh5.googleusercontent.com/Q2sYkdKmNa7qgUX-XaH3i2Nl2b-YtcAnjTXfi46Abp_F2L2IgAelgEp6iqbK6RaJI0xt886yJM_92G5E9osCIQK_QgGnJntn41CdsxCHC76wjSxIZGM=w1175) ![Results](results/graph.png)