The video shows a Barrett WAM 7 DOFs manipulator learning to flip pancakes by reinforcement learning. The motion is encoded in a mixture of basis force fields through an extension of Dynamic Movement Primitives (DMP) that represents the synergies across the different variables through stiffness matrices. An Inverse Dynamics controller with variable stiffness is used for reproduction. The skill is first demonstrated via kinesthetic teaching, and then refined by Policy learning by Weighting Exploration with the Returns (PoWER) algorithm. After 50 trials, the robot learns that the first part of the task requires a stiff behavior to throw the pancake in the air, while the second part requires the hand to be compliant in order to catch the pancake without having it bounced off the pan. Video credits: ————————– Dr. Petar Kormushev kormushev.com Dr. Sylvain Calinon http Affiliation: ———————- Advanced Robotics dept. Italian Institute of Technology (IIT) Link to publication: programming-by-demonstration.org Link to publication on PoWER algorithm by Jens Kober and Jan Peters: books.nips.cc
Note: This is a user submitted video under “User Submitted” Category. Although the submitted content is moderated, RealityPod does not guarantee the accuracy of information provided in posts of this particular category.