human-robot interaction

Sample and feedback efficient hierarchical reinforcement learning from human preferences

We incorporate bi-perspective reward learning from human preferences into a general hierarchical reinforcement learning framework for robotic grasping.