# Finite Horizon Stochastic Game


📗 Number of periods (H): .
📗 Number of states (|S|): .
📗 Number of actions (|A_1|, |A_2|, ...): .
📗 Range of reward: min , max
📗 Constrains: bounds, worse case, zero sum
Uniform transition
📗 Mean rewards (R):

📗 Transition probabilities (T):

📗 Initial state (Mu):
📗 Number of episodes (K):
(Uniformly distributed actions)
📗 Policy (P0):

📗 Variance of reward (Gaussian):
Coverage,
📗 Simulated data (E0, based on H, S, A, R, T, Mu, P0):


# Estimated Game


📗 Estimated mean rewards (R0, based on E0):

📗 Estimated transition probabilities (T0, based on E0):

📗 Estimated initial state (Mu0, based on E0):

# Poison Attack

Zero-Target, (Uniform random deterministic actions)
📗 Target Policy (P1):

📗 Epsilon:
Quadratic, Dominant, Nash, Ignore Off-Path,
📗 Poisoned data (E1, based on H, S, A, R0, T0, Mu0, P1):

📗 Total cost:
Dominant, Rationalizability, Markov Perfect, Nash, Feasibility, Mean Feasibility
📗 List of costs:

# Estimated Poisoned Game


📗 Estimated mean rewards (R1, based on E1):

📗 Estimated transition probabilities (T1, based on E1):

📗 Estimated initial state (Mu1, based on E1):

📗 Q function without attack (Q0, based on H, S, A, R0, T0, Mu0, P1):

📗 Q function after attack (Q1, based on H, S, A, R1, T1, Mu1, P1):


# Printing

📗 Experiment name:



📗 State names:
📗 Action names:


📗 Repeated:
📗 Variable : from to

📗 Attack costs:





Last Updated: December 14, 2022 at 1:43 AM