What's new? | Help | Directory | Sign in
Google
sentience
A 3D volumetric perception system for mobile robots
  
  
  
  
    
Search
for
Updated Jan 31, 2007 by fuzzgun
WhyStereoVision  
Why choose stereo vision ?

Multiple methods exist for determining the range to an object. The most popular for robotics use are active sensing methods such as infrared, ultrasonic and laser ranging devices. The trouble with all of these methods is that they only give quite low resolution information. For infrared and ultrasonics the area of space within which objects may be detected is quite wide (several degrees at least), so that the precise position of an object may only be known in very vague terms.

A laser ranger on the other hand has a far more specific narrow beam, but at any point in time only gives a tiny tunnel vision-like amount of information about the depth of the scene. To overcome this problem in recent years scanning laser rangefinders have become increasingly popular. By scanning in one or more directions using a spinning mirror to direct a laser beam it's possible to build up detailed information about the geometric structure of the environment. However, this scanning process is relatively slow and relies upon precision engineered mechanical parts. At the time of writing scanning laser rangefinders are still very expensive devices costing thousands of dollars, and remain only within the realm of high end academic, industrial or military applications.

Cameras can also be used as range sensing devices. When more than one camera are arranged in some configuration, such that the same features in the environment can be observed from multiple viewpoints, matches made between corresponding features may be used to calculate range values. Potentially, every pixel in the camera image can be used as a highly focussed ranging device.

A few possibilities for camera based range sensing exist.

  1. A single camera taking pictures as it moves through space, with features being tracked and correlated over time.
  2. Two cameras taking photos near symultaneously, calculating ranges from stereo correspondences under the epipolar constraint.
  3. Three cameras - a trinocular system - working in a similar manner to stereo but with greater possibilities for accuracy and dissambiguation of features. The only dissadvantage here is the greater computing resources needed to process three images rather than two, and calculate three dimensional light ray intersections.
  4. Use of two or three cameras together with tracking of features over time to produce a spatio-temporal correlation system. This is the ideal visual perception system, since tracking of features over time facilitates accurate long range measurements to be taken.

Key advantages of camera based systems.

  1. Although a trinocular system might be more accurate, using only two cameras gives a minimally complex solution.
  2. High speed. Unlike methods which require mechanical scanning cameras can gather a large amount of range data in a very short space of time.
  3. Very low cost of digital imaging devices. In the last five years digital imagers have become ubiquitous devices, used in digital cameras and mobile phones. Low cost means that robots using stereo vision easily fall within the means of the robotics hobbyist or even consumer robotics domains.
  4. Cameras are entirely solid state, whereas laser scanners have moving parts which could potentially become unreliable during the rough and tumble of robotic excursions.
  5. Colour information can be easily acquired at the same time as range data, helping to build realistic full colour 3D models of the environment.

Sign in to add a comment