Become a member
Sign in
Jiskani Ashfaq Muhammad
Jiskani Ashfaq Muhammad

Jiskani Ashfaq Muhammad

16 Following
9 Followers
  • Profile

  • Highlights

Highlighted by Jiskani Ashfaq Muhammad

See more

From Deep Reinforcement Learning Demysitifed (Episode 2) — Policy Iteration, Value Iteration and… by Moustafa Alzantot

…s introduce another function which is the state-action pair Q function. Q is a function of a state-action pair and returns a real value.