Tested News

Why the 'Minority Report' Interface is Far from Practical

You can't trust a movie to predict the future of computing.

It's hard to believe that Minority Report was released 8 years ago. The Steven Spielberg film, while being a perfectly serviceable science fiction thriller and adaptation of Phillip K. Dick's work, is still most remembered in the technology community for introducing gesture-based computing to the masses. The one scene where Tom Cruise's character controls a computer and manipulates video playback by waving his hands in front of a screen stands out as an eye-opening moment in cinematic-technology history. It's far and away the most referenced film scene in modern tech journalism--bloggers can't seem to help but allude to it whenever any new touch or gesture-based interface innovation debuts.

But you may not know that the interface used in Minority Report isn't mere fiction. The science adviser on the film was inventor John Underkoffler, founder of Oblong Industries. And over the past decade, Underkoffler has been committed to bringing gesture interfaces--which he calls g-speak--to consumers as a new standard for computer interaction.
 
Underkoffler gave a presentation about g-speak and the future of the user interface at the TED2010 conference this past February. He demonstrated gesture-based interfaces and discussed its advantages and potential for growth. He showed off the non-movie version of g-speak to a packed auditorium, complete with image browsing and video editing.
 
 
G-speak is almost exactly as it's portrayed in Minority Report. The user wears gloves that a computer (in this case, using a camera) tracks in 3D space. Using various hand gestures, the user can move files and objects between different displays, opening and closing aspects of them as needed. In his demonstration, Underkoffler pulled the image of a girl out of a video clip and moved it onto a secondary, table-based display, and "steered" the perspective of the main display through a 3D space of cascading photographs.  
 
In a presentation space, with all the necessary equipment deployed and configured, g-speak looks like an incredibly intuitive and simple way to control information. However, like most next-generation interfaces, g-speak has to overcome several problems before it can be embraced by the masses. 
 
While they seem functional on stage, hand gestures aren't quite as easy to capture at a desk. If you have a webcam, consider how much of your body is on display when you sit comfortably at your desk, and how much you have to move to get your hands consistently into frame. For hand gesture interfaces, the device that records the gestures has to reliably and consistently monitor the user's hands at comfortable positions. Holding up "jazz hands" for minutes at a time just to browse the web is not an improvement to the computer interface.  
 
The leads to a more important ergonomic issue: arm motion. The mouse and keyboard interface works well because it requires very little arm or wrist movement. The user can cover the entire display and enter a full range of text commands by only moving a few inches. Compare that interface design with g-speak both in Minority Report and in Underkoffler's demonstration; while waving arms like you're guiding planes on an aircraft carrier might feel a bit more natural than tapping away with your fingers, it's not the sort of activity most office workers can sustain for hours at a time. It's a great interface for a 15-minute presentation, but for an 8-hour stint in a cubicle it's a recipe for cramps and soreness. 
 
Finally, gesture-based interfaces, and other so-called natural user interfaces, may not be as intuitive as researchers believe. We've written about this topic before, and you can read our analysis here. The upshot is that while these kinds of interfaces may be relatively more intuitive than a keyboard or remote control, there is no universal or innate standard for computing gestures--users are still going to have to learn a whole new lexicon of actions.
 
    
G-speak isn't the only gesture-based interface in development. Toshiba recently showed off its "AirSwing" interface, a gesture-based system for electronic advertising.  While it's designed more for digital billboards than computers, it gives the viewer the same sort of gesture control as g-speak. And from the video demo of the interface (below), you can tell it looks clumsy and slow in its current implementation.  
  
 
Microsoft and Sony are also working on their own gesture and motion-based control systems for the Xbox 360 and Playstation 3. Microsoft's Project Natal uses a multi-camera accessory to track user motion, translating it into input for video games. Playstation Move uses a glowing wand controller tracked by a camera to offer gesture-based controls similar to the Nintendo Wii.   
 
Advertising and gaming-oriented gesture controls potentially overcome many of the issues computer-oriented gesture-based interfaces face. In larger, more casual environments, gestures are much easier to make, and since billboards and gaming systems generally use much larger displays than workstations, gestures are more easily translated to scale. It's one of the reasons the Wii, with its remote-based gesture controls, has become so popular. It's easier to wave your arms from the couch or the middle of your living room than it is at a desk (though Wii fatigue is not unheard of). 
 
Underkoffler predicts that every computer will have g-speak or a similar gesture-based interface in the next five years, but that seems overly optimistic. Gesture-based controls have been in development for over a decade, and they've yet to replace the mouse and keyboard as the main input method for computer systems. Even if Underkoffler can make g-speak affordable and accessible, he can't change the fact that his system requires more room and movement than the current user interfaces available. That said, while it will take much more effort to make gesture interfaces popular in computers, it may pick up steam in the home entertainment sector, taking advantage of ever-growing HDTVs and the generous room most home theaters offer to fully embrace the technology. 
 
We're cautiously optimistic about the promise of gesture-based interfaces, but it's yet to prove its worth just yet. The future predicted by Minority Report is on the right track, but its gesture-based vision is hardly the holy grail of computer interfaces, and shouldn't be treated as such. 
 
Image credits: Dreamworks, TED2010
Zaphon June 2, 2010 at 5:02 p.m.
Call me old fashion, but the less feedback an interface has, the more I dislike it.
 
Mice click, keyboards depress etc. The more abstract the tactile response becomes, the less faith I have in what I'm doing.
 
I literately forcing myself to use the virtual keyboard on my Android phone currently, because I know whatever I upgrade to won't have a hardware one. This was only made possible because I found an App that makes the 'keys' vibrate and when pressed. Even that really isn't enough.
Rehehellyon June 2, 2010 at 5:12 p.m.
I hate waving my arms around unless i'm in the mood. I can't really see it being too practical. It's the same reason I don't like playing motion assisted games. 
I like to be lazy, the less movement the better (not being lazy, just, it feels better?). Clicking a mouse feels better I guess.
 
also, Steven Spielberg.
norman staff on June 2, 2010 at 5:15 p.m.
@Rehehelly said:
 Steven Spielberg. "
the film geek in me is shamed.
GozerTCon June 2, 2010 at 5:57 p.m.

Just like I mentioned in the previous discussions on motion based interfaces I really don't see this working any time soon.  Mainly because I don't see a TRUE and practical intuitive system working until we have proper voice 2 way communication with the computer system.  Until then a purely gesture based system will always be sub-standard to a physical interface system.  Even one as old as a mouse and keyboard.  
 
Basically until you can talk and motion to a system it's not going to be as "simple" and "efficient" as our current systems. 
norman staff on June 2, 2010 at 6:04 p.m.
@GozerTC:  very good point about voice interaction. but an intuitive voice recognition system is still years away--the complexities of linguistics makes it a programming nightmare.
nschwermannon June 2, 2010 at 7:43 p.m.
pfft the Minority Report interface is soooo 2002.  Give me the Iron Man interface baby!!
HypoXenophobiaon June 2, 2010 at 9:25 p.m.
I wrote a giant diatribe about why this frustrated me, but deleted it out of fear of appearing crazy. So, to simplify my grievances, gesture based movements elongates an already simple process and is equally way more imprecise. [five paragraphs down to two lines.]
GozerTCon June 2, 2010 at 10:48 p.m.
@norman said:
" @GozerTC:  very good point about voice interaction. but an intuitive voice recognition system is still years away--the complexities of linguistics makes it a programming nightmare. "

Oh no doubt, but I still think that's what will need to happen before any of these systems really "works" or is at least worth the effort.  HypoXenophobia has a point since pure gestures does nothing more than make you do MORE to do what you already do. :)  
Binman88on June 3, 2010 at 5:46 a.m.
@HypoXenophobia said:
" I wrote a giant diatribe about why this frustrated me, but deleted it out of fear of appearing crazy. So, to simplify my grievances, gesture based movements elongates an already simple process and is equally way more imprecise. [five paragraphs down to two lines.] "
Exactly. I can see this interface being incredibly useful for large group projects or presentations (really anything that requires you to move around a lot or where using a keyboard and mouse would be awkward), but for small scale computing, the keyboard and mouse is still the best way to interact with a machine.
wonko33on June 3, 2010 at 4:38 p.m.
Two reasons why it would suck
 
1- I'm not waving my hands in the air like a moron to open an email. look at the movie, in half the time he does anything we would have five tasks completed with mouse and keyboard, it just looks impractical.
2- Since the named has already been coined and it's a sucky Tom Cruise movie, i want nothing to do with it.
greks224on June 4, 2010 at 7:14 a.m.
I was disappointed in that TED presentation, since I feel it didn't well present the problem being solved (but then again, they do have very limited time to speak). As a result, he came off sounding kind of pretentious, alluding to problems that I don't really recognize exist, as well as using a vocabulary that is very insular to UI academia. However, I really don't like the reasons you gave for "why it won't work," Will.  You have to recognize that the problems that you address, like the impracticality of using the interface at a cubicle, is contingent upon how this "new" interface will clash with workspaces designed for traditional interfaces.  The cubicle exists the way it does because it works best for a computer. If a workplace adopts gesture based interfaces, the style of work spaces would innovate and adapt accordingly.  As the presenter was saying, work is becoming a more collaborative process--while many workers today do use cubicles, office spaces come in many forms which suit the work being done. 
 
So, if a cubicle doesn't work with an interface which is superior for ...some other reason (I'm not sure what that reason would be), then the cubicle will get bigger, or the cubicle just won't exist. There's more than one variable in the UI equation.
Viderianon June 4, 2010 at 9:55 a.m.
I could see this happening as a neat-o billboard at a mall or store or something.  There is too much movement to make this work in an actual work environment.    

If anything, I think new interfaces should reduce movement instead is better then increasing movement.
Tebbiton June 4, 2010 at 11:01 a.m.
You know what would be a cool idea? 
 
Step 1: Turn on webcam. 
Step 2: Webcam looks at reflection in your eyes, pinpointing what you are looking at on monitor. 
Step 3: computer program selects that thing. 
 Step 4: Profit 
 
Thats copyright to me, by the way.
DarkZoomon June 4, 2010 at 11:04 a.m.
I would like the iron man interface program that they used in the movie.he was able to do complex things and it didd'nt take to much effort.anyway i see what you guys are saying about thow much motion you have to put in it. it seems tireing to me but its smething i think would be cool to have and enjoy
 
DZ
hguytaylor1on July 3, 2010 at 6:37 p.m.
I use a gestural system on my smart phone . I have to touch the screen but I see no reason I have to. Somehow I imagine a TED demo on my Netbook could not be seen from the front row though -- my gestures would need to be confined to the screen space.  
 
I get the impression that all these big demo systems are influencing the perception and expectations of the product. very few PC operators work before audiences. As a designer I am used to the drawing board or drawing table metaphor and I think that would be the easiest adaptation for close-in personal work. Minority Report was a movie Reality implementation will necessarily look different.
  
GT
How To Rip DVDs to Play on Any Device—For Free!

Our ultimate DVD copying guide shows you how to use Handbrake to rip discs for playback on your laptop, Xbox, PS3, iPhone, Zune, netbook, iPad, PSP, iPod, and pretty much anything else

Tested: Apple Mac OS X Lion

The $30 upgrade from Snow Leopard seems like a no-brainer, but that's not the full story. Read the official Tested review.

Apple iPad 2 Review

Our definitive review of the iPad 2.

Tested: Nvidia GeForce GTX 580 Video Card

The green machine comes back swinging.

Apple iPad Wi-Fi Review

Apple's device isn't magical, but it is going to make the tablet a real category--for better or for worse

Video: How To Build the Best $1500 Gaming PC, Step-by-Step

Do you like saving money? What about playing PC games? Have you ever considered building your own PC? We show you exactly what you need to know to build an awesome $1500 gaming PC.

How To Build an Awesome $500 Windows Home Server

We pick the best parts to build a lean media backup and streaming machine.

The Best Android Phone for Your Network (February 2011)

Don't get locked in with the wrong phone.

Why I’ve Switched from Browser-Based Apps Back to Native Software on the Desktop

Moving from powerful, native Desktop clients to clunky web-apps isn’t always a good thing. In fact, it’s often a significant step backwards.

Qualcomm's Snapdragon S4 Integrates LTE, Tops Benchmarks

Qualcomm's latest Snapdragon SoC outperforms 2011's fastest chips by a wide margin and integrates LTE processing in the name of battery efficiency.

Ivy Bridge Chip Delay to Summer Gives Apple a Shot at Early Access

As PC makers hold onto new models in the summer months before Windows 8's launch, Apple will have plenty of time to launch new laptops with new processors.

Nokia Teases "Pure View" Camera, Which Hopefully Isn't Just More Megapixels [Video]

Nokia's next phone could mark the return of the Xenon flash, a large image sensor, or high quality 1080p video recording.

Future SSDs May Suffer from Speed and Reliability Issues

Researchers at the University of California, San Diego project doubled latency and tripled bit errors in SSDs as storage increases.

Purported 2048x1536 iPad 3 Display Spotted--Now Who's Making It?

A quad-pixel iPad display in the flesh, but it is unlikely that Sharp will manufacture it.

How Windows 8 Will Integrate SkyDrive for Cloud Storage and Remote PC Access

SkyDrive's omnipresence in Windows 8 extends from the profile to Metro apps to the legacy desktop.

Major Changes to Flickr to Begin Rolling Out February 28th

The Photo View and Upload pages are getting much-needed redesigns, which Flickr needs to stop the user bleed.

What Kind of Computer User Needs a High-End 17-inch Laptop?

According to Loyd, there exist three key uses that necessitate a high-end 17-inch laptop.

Why I’ve Switched from Browser-Based Apps Back to Native Software on the Desktop

Moving from powerful, native Desktop clients to clunky web-apps isn’t always a good thing. In fact, it’s often a significant step backwards.

Future SSDs May Suffer from Speed and Reliability Issues

Researchers at the University of California, San Diego project doubled latency and tripled bit errors in SSDs as storage increases.

Comcast Announces Its Netflix Rival: Xfinity Streampix

The streaming video service is launching this Thursday for existing Comcast subscribers.

Tested: The New and Improved Swype Beta on Android 4.0

After acquiring Swype, Nuance had updated the app for Android 4.0 and added some new features.

Everything You Need to Know About Ubuntu for Android

Linux gets more Linux-y when Android gets mashed up with Ubuntu.

Purported 2048x1536 iPad 3 Display Spotted--Now Who's Making It?

A quad-pixel iPad display in the flesh, but it is unlikely that Sharp will manufacture it.

Siri On OS X: Will Most Users Even Care?

One notable feature missing from the upcoming Mountain Lion release of OS X is Apple's Siri.

Submissions can take several hours to be approved.

Save ChangesCancel