Diff for "roboscopie"

Differences between revisions 20 and 21

Roboscopie, the robots take the stage!

Contents

Watch it!
The storyline
Making-of
On the technical side
Press coverage

Watch it!

Short version (3 min)

Full-length version (18 min)

The storyline

Xavier and PR2, its robotic companion, try to find ways to understand each other.

Xavier and PR2 share a white, almost empty, stage. To get the robot to see his world, Xavier must keep being recognized by the human tracking module that lies on the wall, and must stick everywhere 2D barcodes, instead of the real objects. The robot can read and identify these barcodes, and while the stage get covered by the tags, the robot constructs for itself a 3D world with the right-looking objects: a phone, a lamp, a hanger...

While Xavier is drawing by hand more and more of these 2D tags, the robot tries to offer its help. It brings first a bottle of water, then a fan... which blows away all Xavier's code. Angry, Xavier leaves, and PR2 remains alone.

The night comes, and the robot decides to explore the stage, scattered with those barcodes on the ground.

On the next morning, Xavier enters, and as soon as he get recognized by the tracking system, he discovers that the robot's 3D model is a mess, full of random objects: an elephant, a boat, a van... Xavier resets the robot model with a special black tag, and starts to tidy up the place.

The robot decides to help, fetchs a trashbin, but starts to behave strangely (or is it playing?) and both Xavier and PR2 start a clumsy basket ball game with paper balls.

The robot suddently gives up and a new program slowly starts: a home-training session. Xavier seems to be expecting it, switches his tshirt, and starts the exercices. But as the program goes along, the robot looks more and more menacing, up to the point that Xavier shouts "Stop!".

Xavier shows one after the other the objects -- actually, the barcodes of the objects -- to the robot, explaining there are all fake, and one after the other, the robot connects in its mind the objects to the idea of being fake. And like the robot, we realize that everything was just an experiment.

Download the original script (in French!)

Making-of

PR2 gets used to carry boxes	The looong list of TODOs, a couple of days before the performance
Xavier, trying to figure out how to get recognized by the Kinect and to show tags to the robot at the same time	Mamoun, our light in the darkness
Last inspection of the stage by Nicolas	Video of the first test of the 'home training' session

On the technical side

The PR2 robot was running softwares developped at the LAAS/CNRS. While the performance tries to picture some of the challenges in the human-robot interaction field that are studied at the LAAS/CNRS, including the needed autonomy of a robot working with humans, the robot was partialy pre-programmed for this theater performance.

Most of the behaviours were written in Python, relying both on the PR2 ROS middleware and the LAAS's Genom+Pocolibs modules.

What was pre-programmed?

While the real perception routines were running (see below), the robot did not have any synchronization methods with the human during the play: each sequence was manually started by one of the engineer.

Click here to view the performance's master script

Places on the stage were hard-coded: for instance, the position of the table was known to the robot from the beginning, so was the position of the entrance door, etc.

Manipulation tasks (like grasping the fan or the paper bin) were much simplified: the robot would simply open its gripper in front of itself, and wait for "something" to be detected in its hand. It would then simply close the gripper. Likewise, the robot special postures to enter or leave the stage with an object in hand (required to avoid collision with the door) were all pre-defined.
At the end of the play, when Xavier talks to the robot ("Stop!", "Look at this phone!", "Everything is fake", etc.), sentences were manually typed in the system. we could have used speech recognition as we do in the lab, but converting speech to its textual version (that the robot can process) is relatively slow and error prone. So we decided to avoid it on the stage.
While what Xavier said was successfully understood by the robot (cf below), the actions that followed (like looking at the phone, turning the head back to the audience,...) were manually triggered.

What was autonomously managed by the robot?

All navigation tasks were computed "in live" by PR2, using the ROS naviagtion stack. The main script just tell the robot to go from the engineer desk to the center of the stage for instance. The robot would then find a path that avoid obstacle.
The 3D world that is displayed above the stage during the show is a live capture of the Move3D and SPARK softwares. These software are used daily on the robot to compute trajectories, symbolic locations, visibility and reachability of objects...

However, during the performance, we deactivated the computation of symbolic facts (like xavier looksAt jacket, RED_PHONE isOn table,...) which is not reliable enough for being used on the stage. Thus, we had to manually add several symbolic facts in a so-called scenario-specific ontology.

The 2D barcode are actually a key perception mechanism for our PR2. In the lab, all our objects have such 2D tags attached to them. They are used to identify and localize (both for the position and the orientation) objects surrounding the robot.

As it can be seen in the 3D model of the world perceived by the robot, PR2 knew where Xavier was and what was its posture. This is done using the Microsoft Kinect plus the OpenNI human tracker. In several occasion, the robot automatically tracks the human head or the human hands with this system.
At the end, Xavier talks to the robot. The textual version of what he said was fed to the system "as it". Natural language understanding is done by the Dialogs module and use extensively the oro-server knowledge database to make sense of the word in the current context. The result of the language processing was then added back to the knowledge base and automatically displayed by the oro-view OpenGL ontologies viewer.

Hence, the sentence <look at this phone> get translated into symbolic facts: [human desires action1, action1 type Look, action1 receivedBy RED_PHONE]. The robot is able to know that <this phone> is indeed the RED_PHONE by taking into account what the human focuses on.

Press coverage

"PR2 takes the stage", Willow Garage blog

-  ⇤ ← Revision 20 as of 2011-11-15 09:06:19 → 
  Size: 7816
  Editor: slemaign
  Comment:
+   ← Revision 21 as of 2011-11-15 09:34:51 → ⇥
  Size: 9167
  Editor: slemaign
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 3:
-||<tablestyle="width:100%;text-align:center;" style="border:0px; color:#934909;font-size:2em;font-weight:bold"> Roboscopie, the robots take the stage! ||
+||<tablestyle="width:100%;text-align:center;" style="border:0px; color:#934909;font-size:2em;font-weight:bold"> Roboscopie, the robots take the stage!||

<<TableOfContents(2)>>
-Line 33:
+Line 35:
-||<style="border:0px"> [[attachment:roboscopie4.jpg | {{attachment:roboscopie4.jpg||height=250px}}]] <<BR>> Last inspection of the stage by Nicolas|| [[http://vimeo.com/27810716 | Video of the first test of the 'home training' session]] ||
+||<style="border:0px"> [[attachment:roboscopie4.jpg | {{attachment:roboscopie4.jpg||height=250px}}]] <<BR>> Last inspection of the stage by Nicolas||<style="border:0px"> [[http://vimeo.com/27810716 | Video of the first test of the 'home training' session]] ||
-Line 56:
+Line 58:
+ * While what Xavier said was successfully understood by the robot (cf below), the actions that followed (like looking at the phone, turning the head back to the audience,...) were manually triggered.
-Line 64:
+Line 68:
+However, during the performance, we deactivated the computation of symbolic facts (like `xavier looksAt jacket`, `RED_PHONE isOn table`,...) which is not reliable enough for being used on the stage. Thus, we had to manually add several symbolic facts in a so-called scenario-specific ''ontology''.
-Line 70:
+Line 76:
- * At the end, when Xavier talks to the robot, the
+ * At the end, Xavier talks to the robot. The textual version of what he said was fed to the system "as it". Natural language understanding is done by the [[http://dialogs.openrobots.org | Dialogs]] module and use extensively the [[http://oro.openrobots.org | oro-server]] knowledge database to make sense of the word in the current context. The result of the language processing was then added back to the knowledge base and automatically displayed by the [[http://trac.laas.fr/git/oro-view | oro-view]] OpenGL ontologies viewer.
-Line 73:
+Line 79:
+Hence, the sentence <''look at this phone''> get translated into symbolic facts: [`human desires action1`, `action1 type Look`, `action1 receivedBy RED_PHONE`]. The robot is able to know that <''this phone''> is indeed the `RED_PHONE` by taking into account what the human focuses on.

{{{#!wiki comment/dotted
-Line 81:
+Line 91:
+}}}

Quick Links

Search Wiki

Page Tools