TD Learning with Linear Function Approximation

News

Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation

The focus is on decentralized temporal-difference (TD) learning with linear function approximation in the presence of unreliable or even malicious agents, termed as Byzantine agents. In order to ...

www.cs.utexas.edu10y

On the Convergence of Temporal-Difference Learning with Linear Function Approximation

@Article{Tadic:2001, author = "Tadi\'{c}, Vladislav", title = "On the Convergence of Temporal-Difference Learning with Linear Function Approximation", journal ...

Microsoft2y

Effective Multi-step Temporal-Difference Learning for Non-Linear Function Approximation

is one of the most popular forms of TD learning for linear function approximation. The reason is that multi-step methods often yield substantially better performance than their single-step ...

www.cs.utexas.edu10y

Fast gradient-descent methods for temporal-difference learning with linear function approximation

@InProceedings{Sutton+MPBSSW:2009, author = "Sutton, Richard S. and Maei, Hamid Reza and Precup, Doina and Bhatnagar, Shalabh and Silver, David and Szepesv{\'a}ri, Csaba and Wiewiora, Eric", title = ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results