subject

We would like to use a Q-learning agent for Pacman (as you did in PA3), but the state space for a large grid is too massive to hold in memory. To solve this, we will switch to feature-based representation of the Pacman game state (similar to PA3 Q10), where we assume that Q(s, a) can be expressed as a (weighted) linear combination of state-action features: Q(s, a)= Σωifi(s, a)

Required:
Suppose we design two features to represent the state (independent of actions): f (s) is the number of ghosts within one step of Pacman, and fp(s) is the number of food pellets within one step of Pacman. Note that we do not have any features dependent on a. Why might this be a bad idea?

ansver
Answers: 3

Another question on Computers and Technology

question
Computers and Technology, 23.06.2019 00:30
If joey was single and his taxable income was $9,500, how much would he pay in taxes each year?
Answers: 1
question
Computers and Technology, 23.06.2019 02:30
Experimental data that is expressed using numbers is said to be
Answers: 1
question
Computers and Technology, 23.06.2019 04:10
2pointswho was mikhail gorbachev? oa. a russian leader who opposed a coupob. a polish leader who founded the labor union "solidarityoc. a soviet leader who called for a closer relationship with the unitedstates, economic reform, and a more open societyd. a soviet leader who called for more oppression in the soviet union
Answers: 3
question
Computers and Technology, 23.06.2019 21:50
Description: write function lastfirst() that takes one argument—a list of strings of the format "lastname, firstname" —and returns a list consisting of two lists: (a) a list of all the last names (b) a list of all the first names
Answers: 2
You know the right answer?
We would like to use a Q-learning agent for Pacman (as you did in PA3), but the state space for a la...
Questions
question
History, 28.01.2020 16:06
question
History, 28.01.2020 16:06
Questions on the website: 13722363