1 min readMar 11, 2017
Hi Ryan,
It would certainly be possible to employ some of the exploration techniques discussed in part 7 in A3C. The issue would be that each technique introduces new hyperparameters to adjust. While adding them on top of entropy regularization may help, they would need to be tuned properly.