Devise suitable features for reinforcement learning in stochastic grid worlds (generalizations of the $4\times 3$ world) that contain multiple obstacles and multiple terminal states with rewards of $+1$ or $-1$.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question.md

question.md

Files

question.md

Latest commit

History

question.md

File metadata and controls