subject
Mathematics, 14.02.2020 17:12 ldestl

In a coin game, you repeatedly toss a biased coin (0.4 for head, 0.6 for tail). Each head represents 3 points and the tail represents 1 point. You can either Toss or Stop if the total number of points you have tossed is no more than 7. Otherwise, you must Stop. When you Stop, your utility is equal to your total points (up to 7), or 0 if you get a total of 8 points or higher. When you Toss, you receive no utility. There is no discounting.
a. What are the states and the actions for this MDP? Which states are terminal?
b. What is the transition function and the reward function for this MDP? Hint: The problem may be simpler to formulate using the general version of rewards: R(s, a, s')
c. Run value iteration to find the optimal value function V* for the MDP. Show each Vk step (starting from Vo(s) = 0 for all states s). For a reasonable MDP formulation, this should converge in fewer than 10 steps. If you find it too tedious to do by hand, you may write a program to do this for you; however, there may be some benefit in seeing the calculation unfolding in front of you.
d. Using the V* you found, determine the optimal policy for this MDP.

ansver
Answers: 2

Another question on Mathematics

question
Mathematics, 21.06.2019 16:30
Acarnival sold tickets for $1.50 for adults and $1.00 for students. there were 54 tickets sold for a total of $70.50. write a system of equations to represent the number of adult tickets,x, and the number of student tickets,y. find the solution and explain what it means. i'll you if you me
Answers: 1
question
Mathematics, 21.06.2019 17:00
Orly uses 2 cups of raisins for every 12 cups of trail mix she makes. how many cups of trail mix will she make if she uses 8 cups of raisins?
Answers: 1
question
Mathematics, 21.06.2019 17:30
The table shows the balance of a money market account over time. write a function that represents the balance y(in dollars) after t years.
Answers: 2
question
Mathematics, 21.06.2019 19:00
Billy plotted −3 4 and −1 4 on a number line to determine that −3 4 is smaller than −1 4 .is he correct? explain why or why not
Answers: 3
You know the right answer?
In a coin game, you repeatedly toss a biased coin (0.4 for head, 0.6 for tail). Each head represents...
Questions
Questions on the website: 13722363