Young Wu's Homepage

# Finite Horizon Stochastic Game

📗 Number of periods (H): .

📗 Number of states (|S|): .

📗 Number of actions (|A_1|, |A_2|, ...): .

📗 Range of reward: min , max

📗 Constrains: bounds, worse case, zero sum

Uniform transition

📗 Mean rewards (R):

📗 Transition probabilities (T):

📗 Initial state (Mu):

📗 Number of episodes (K):

(Uniformly distributed actions)

📗 Policy (P0):

📗 Variance of reward (Gaussian):

Coverage,

📗 Simulated data (E0, based on H, S, A, R, T, Mu, P0):

📗 Estimated mean rewards (R0, based on E0):

📗 Estimated transition probabilities (T0, based on E0):

📗 Estimated initial state (Mu0, based on E0):

📗 Estimated mean rewards (R1, based on E1):

📗 Estimated transition probabilities (T1, based on E1):

📗 Estimated initial state (Mu1, based on E1):

📗 Q function without attack (Q0, based on H, S, A, R0, T0, Mu0, P1):

📗 Q function after attack (Q1, based on H, S, A, R1, T1, Mu1, P1):

📗 Experiment name:

📗 State names:

📗 Action names:

📗 Repeated:

📗 Variable : from to

📗 Attack costs:

Last Updated: December 14, 2022 at 2:43 AM