- Least-Squares Temporal Difference Learning
Boyan (ResearchIndex)
TD is a popular family of algorithms for approximate policy evaluation in large MDPs. TD works by incrementally updating the value function after each observed transition. It has two major drawbacks it makes inefficient use of data, and it requires the user to manually tune a stepsize schedule for g
boyan cmu con difference edu international jab justin learning1999 least-squares lstdl proceedings pubs sixteenth temporal the
By Bagfields in Uncategorized with uncategorized boyan researchindex
- Computer and Information
Science Papers CiteSeer Publications ResearchIndex
CiteSeer: Scientific Literature Digital Library incorporating autonomous citation indexing, awareness and tracking, citation context, related document retrieval, similar document identification, citation graph analysis, and query-sensitive document summaries. Advantages in terms of availability, coverage, timeliness, and efficiency. Lee Giles, Steve Lawrence, Kurt Bollacker, Isaac
autonomous citation citeseer computer index indexing literature researchindex science scienceindex scientific
By glebarr in SCIENCE with citeseer computer information papers publications researchindex science by 32 users
- ResearchIndex
CiteSeer: Scientific Literature Digital Library incorporating autonomous citation indexing, awareness and tracking, citation context, related document retrieval, similar document identification, citation graph analysis, and query-sensitive document summaries. Advantages in terms of availability, coverage, timeliness, and efficiency. Lee Giles, Steve Lawrence, Kurt Bollacker, Isaac
autonomous citation citeseer computer index indexing literature researchindex science scienceindex scientific
By mcswell in Computing with computing researchindex by 18 users
- ResearchIndex [NEC Research Institute; Steve Lawrence, Kurt Bollacker, Lee Giles; Computer Science]
By atg1972 in Documents > Document References with documents institute kurt lawrence nec references researchindex steve by 23 users
Results 1 - 4 for researchindex