Given the Grid World as shown in Figure (a): The agent starts from point S and must reach the goal G(4,6). The gray areas represent walls, which the agent cannot pass through. Each step gives a reward of -1, so the objective is to reach the goal in the shortest path possible. Modify the QLearning.ipynb file as follows: Implement the gray wall solutionspile.com

Question

Given the Grid World as shown in Figure (a):  The agent starts from point S and must reach the goal G(4,6). The gray areas  represent walls, which the agent cannot pass through.  Each step gives a reward of -1, so the objective is to reach the goal in the shortest  path possible.  Modify the QLearning.ipynb file as follows:  Implement the gray wall areas in the class GridWorld().  Inside the training loop, after a fixed number of iterations (e.g., every 50 or 100  episodes),  print a grid that displays the best action (with the highest Q-value) for each state  using arrows, as in Figure (b).  Leave the wall cells blank, without any arrows.  Repeat this process every 50 or 100 iterations and generate about 8 to 10 grids in  total.   Q-learning  Algorithm 9-9: Q-Learning (State-Action Version of TD Control) - Input: Episode generator - Output: Optimal policy  (  hat{ pi}  ), optimal state-action value function  (  hat{ mathrm{q}}  ) ``` plaintext for(s  in S and a  in A) initialize q(s, a) with arbitrary values.     If s is a terminal state, set q(s, a) = 0 repeat     Initialize the starting state s.     repeat                 Choose an action a from state s according to q. // apply epsilon-greedy                 Take action a, observe the next state s' and reward r. // role of MDP q(s, a) = q(s,a) +  rho(r + v max_a' q(s', a') - q(s, a)) // Eq. (9.26)         s = s'     until (s' is a goal state) until (termination condition is met); q}= ```  Q-learning Flowchart 1. Start 2. I solutionspile.com

Accepted Answer

Expert Answer to - Given the Grid World as shown in Figure (a):  The agent starts from point S and must reach the goal

Answer

Solution for - Given the Grid World as shown in Figure (a):  The agent starts from point S and must reach the goal

Answer

This an additional answer to - Given the Grid World as shown in Figure (a):  The agent starts from point S and must reach the goal

(Solved): Given the Grid World as shown in Figure (a): The agent starts from point S and must reach the goal ...

View Expert Answer

Expert Answer

Buy This Answer $5

Place Order

We Provide Services Across The Globe