TD Learning with Linear Function Approximation

News

Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation

The focus is on decentralized temporal-difference (TD) learning with linear function approximation in the presence of unreliable or even malicious agents, termed as Byzantine agents. In order to ...

www.cs.utexas.edu10y

On the Convergence of Temporal-Difference Learning with Linear Function Approximation

@Article{Tadic:2001, author = "Tadi\'{c}, Vladislav", title = "On the Convergence of Temporal-Difference Learning with Linear Function Approximation", journal ...

GitHub3y

linear-function-approximation

Chen, Z., Zhang, S., Doan, T. T., Clarke, J. P., & Maguluri, S. T. (2019). Finite-sample analysis of nonlinear stochastic approximation with applications in ...

www.cs.utexas.edu10y

Fast gradient-descent methods for temporal-difference learning with linear function approximation

@InProceedings{Sutton+MPBSSW:2009, author = "Sutton, Richard S. and Maei, Hamid Reza and Precup, Doina and Bhatnagar, Shalabh and Silver, David and Szepesv{\'a}ri, Csaba and Wiewiora, Eric", title = ...

Microsoft2y

Effective Multi-step Temporal-Difference Learning for Non-Linear Function Approximation

is one of the most popular forms of TD learning for linear function approximation. The reason is that multi-step methods often yield substantially better performance than their single-step ...

Microsoft4y

Optimism in reinforcement learning with generalized linear function approximation

We design a new provably efficient algorithm for episodic reinforcement learning with generalized linear function approximation. We analyze the algorithm under a new expressivity assumption that we ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results