Goal the ability to recover from a fall of a legged robot Contribution a hierarchical behavior-based controller controller有4个neural network policies, 包括3个behaviors和1个behavior selector. muti-task DRL