i designed a custom gridworld environment. i trained an agent to navigate from a start to an end point while avoiding obstacles and collecting rewards.