Watson's Jeopardy Win A Victory For Mankind - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Infrastructure // PC & Servers
Commentary
2/23/2011
07:09 PM
Charles Babcock
Charles Babcock
Commentary
Connect Directly
Twitter
RSS
E-Mail
50%
50%

Watson's Jeopardy Win A Victory For Mankind

The IBM supercomputer not only beat the quiz show's most successful human contestants, but also showed what skilled researchers will be able to do as they tackle big problems in the future.

Watson, the quiz show winning computer, doesn't think like his namesake, IBM founder Thomas J. Watson, or his two vanquished Jeopardy competitors.

Rather, Watson sorts and separates relevant information by parallel processing. It is a most un-human proceeding, unless you, too, know how to address a question using 2,880 brains.

Since his victory, Watson's been described as a computer that thinks like a human. But I think Watson's middle name should be "Hadoop," the underlying piece of cloud-based open source code that enabled Watson to frequently come up with answers to complex, nuanced questions in 10 milliseconds, or one hundredth of a second. Brad Rutter and Ken Jennings barely had time to put a finger to the forehead before Watson had sounded the buzzer.

That's because Watson had about 200 million pages of information loaded into 15 TB of random access memory, where it could be accessed at the speed of light. Hadoop normally draws information off of distributed server disks in 64-MB chunks. Watson drew it out of RAM in 500-GB gulps. It then could feed it into nearby CPUs, either the nearest server or at least a server in the same rack, avoiding backbone traffic.

Hadoop is a powerful finder, sorter, and organizer of masses of information because it functions both as a distributed file system, and a mapper of known data to the nearest processor in the cluster. Behind its non-emotive face, Watson was just a common ordinary server cluster, occupying a 10-rack unit that looked, one observer said, like a stack of library shelves. Actually, I thought it looked more like an Adirondack mountain hut with no door.

Inside the cluster were 90 IBM 750 Power servers, each with four CPUs; each CPU contained eight Power 7 chip cores, for a total of 32 per server, or a total of 2,880 in the cluster. Once the question was typed into Watson's memory, it was decomposed, analyzed, and processed by those 2,880 cores in parallel.

But the number of threads -- for example individual search processes, each tapping a 500-GB section of memory for references to come up with, say, Toronto is a U.S. city (heh, heh) -- was much higher than 2,880. A little-known fact is that Watson's cores were virtualized under Red Hat's kernel virtual machine (KVM).

So exactly how much parallel processing was going on inside Watson? Each core of the IBM Power 7 chip is capable of running multiple threads; further, it's been engineered for ease of virtualized operation. IBM's Sept. 21, 2009, Clipper white paper says one Power 7 "system" is capable of running up to 1,000 virtual machines. Since each server is a four-socket host, with eight cores per socket, that probably means each Power 7 core hosts 32 VMs, or 256 per socket and 1,004 per server. Those are extremely high numbers, even in intensely virtualized data center settings.

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Previous
1 of 2
Next
Comment  | 
Print  | 
More Insights
News
The State of Chatbots: Pandemic Edition
Jessica Davis, Senior Editor, Enterprise Apps,  9/10/2020
Commentary
Deloitte on Cloud, the Edge, and Enterprise Expectations
Joao-Pierre S. Ruth, Senior Writer,  9/14/2020
Slideshows
Data Science: How the Pandemic Has Affected 10 Popular Jobs
Cynthia Harvey, Freelance Journalist, InformationWeek,  9/9/2020
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
IT Automation Transforms Network Management
In this special report we will examine the layers of automation and orchestration in IT operations, and how they can provide high availability and greater scale for modern applications and business demands.
Slideshows
Flash Poll