# CP4 Competition Instruction
📗 In this competition, you will implement the policy gradient algorithm to find and approximate an equilibrium to a Markov game. In particular, you are one of many players (influencers) whose house is at a fixed location in \(\left[0, 1\right] \times \left[0, 1\right]\) and your goal is to get the dog (receiver) to a location as close as possible to your house, while the other players with possibly different house locations want to do the same. In every round, the players move simultaneous to a location within a fixed radius of their current location. The dog will always move to the center (average location) of all players after every round.
📗 Submit a policy neural network to play against other students.
# Submission
Your submission should contain (i) your player name (not necessarily your real name), (ii) your player icon (single emoji from this list:
Link), (iii) your house location, (iv) your network to control the player.