The field of Human-Robot interaction (HRI) encompasses a vast of research, from psychology, to reinforcement learning, to action recognition. For our project, we are particularly interested in interaction in outdoor environments, where communication with both collaborators (such as policemen and firemen in an emergency response situations) or victims has to be established.
First, however, we need to create a real-time person recognition system, which is a challenge given the complexity in available recognition systems related to this topic. Fortunately, there are parts that are robust and real time. They are face and pedestrian detection. It is absolutely critical that the vision system detects the people around Beobot 2.0 efficiently and accurately. After that, the system can then continue and refine the information that we have about them, through processes such as pose, facial expression, gesture, and action recognition. This process is illustrated in the figure below
In the future, our target scenarios are situations where the robot walks into an urban outdoor setting and has to:
Once the robot is in close proximity with the person, we can then go to another (natural language processing based) interaction system to ask them their current situation. For an outdoor robot, the main problem is the initial communication stage.
Our research in this field differ from the traditional HRI and vision-related fields in two ways:
We are currently implementing our social awareness infrastructure that can robustly recognize the people surrounding Beobot 2.0. Afterwards we can focus on the more detailed people information. We are also interested in utilizing information about group activities.
Another research focus that we have is in implementing a biological motion detector (much like a saliency map approach):