Will Microsoft's Hadoop Bring Big Data To Masses? - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Software
News
10/30/2012
12:31 PM
Connect Directly
Twitter
LinkedIn
RSS
E-Mail
50%
50%

Will Microsoft's Hadoop Bring Big Data To Masses?

HDInsight Server "dramatically" lowers the cost and complexity of deploying Hadoop, Microsoft exec says.

Microsoft announced last week HDInsight Server, a version of the Hadoop big data analytic framework designed to run in simpler, less expensive environments than Hadoop usually requires.

Microsoft characterized the product, now in beta, as an effort to simplify big data implementations and enable IT managers to run Hadoop on Windows machines, without having to jump through the usual hoops.

Apache.org asserts that Hadoop can run on Win32-based machines but recommends doing so only for development, not as a production-quality system.

The on-premises version of HDInsight is fully supported and certified by Microsoft and partners as a production platform. It includes an automated install-and-configuration process and links that enable business users to download subsets of Hadoop data sets so they can run their own scenarios using Excel, PowerPivot for Excel and PowerView.

Microsoft says the HDInsight preview runs on its Windows Server and Azure platform-as-a-service cloud offering. Branded Windows Azure HDInsight Service and Microsoft HDInsight Server for Windows, the Windows versions "dramatically" lower the cost and complexity of deploying Hadoop, according to Microsoft technical fellow David Campbell.

The Hadoop framework is designed to run on implementations as small as one server, but is typically configured to run on a whole server cluster running the open source Apache Web server.

Even installed on just one node, Hadoop must be configured as a cluster within which Hadoop's own name servers and resource managers coordinate the integration of a series of Hadoop modules that manage, schedule, process, query, analyze and publish both data and analytics.

Integration with Microsoft's System Center 2012 is designed to simplify management of both the Windows Server and the Azure versions of HDInsight by allowing IT managers to tweak or control the applications using Microsoft's familiar management tools.

Though HDInsight is fully compatible with Apache Hadoop, according to Microsoft, it is designed to be more adaptable because it can be run either on-premises or in the cloud -- or in both places with connections secured through Active Directory that allow on-premises and cloud versions to exchange data and/or queries.

In addition, having HDInsight running on Windows Azure provides the same dynamic resource configuration as every other cloud service, so admins can install it as if it is running on a single server, then increase RAM, storage, CPU cycles and other resources to cover peaks in demand. They can also expand a virtual single-node version of HDInsight into a multi-node, clustered version without having to reinstall or migrate the installation to a different set of physical servers.

Both also ship with links to the U.S. Census Bureau, United Nations, Dun & Bradstreet and other data sources via the online Windows Azure Marketplace. They also allow data-crunching business users to download subsets of Hadoop data or the results of previous queries to refine, reprint or republish the results using Excel.

Both versions can also be linked with installations of SQL Server to trade data or run cross-queries on both systems, using connectors from Hortonworks, the Hadoop specialist that handled most of the porting and integration of Hadoop onto Windows.

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Previous
1 of 2
Next
Comment  | 
Print  | 
More Insights
Commentary
Gartner Forecast Sees 7.3% Shrinkage in IT Spending for 2020
Joao-Pierre S. Ruth, Senior Writer,  7/15/2020
Slideshows
10 Ways AI Is Transforming Enterprise Software
Cynthia Harvey, Freelance Journalist, InformationWeek,  7/13/2020
Commentary
IT Career Paths You May Not Have Considered
Lisa Morgan, Freelance Writer,  6/30/2020
White Papers
Register for InformationWeek Newsletters
The State of IT & Cybersecurity Operations 2020
The State of IT & Cybersecurity Operations 2020
Download this report from InformationWeek, in partnership with Dark Reading, to learn more about how today's IT operations teams work with cybersecurity operations, what technologies they are using, and how they communicate and share responsibility--or create risk by failing to do so. Get it now!
Video
Current Issue
Key to Cloud Success: The Right Management
This IT Trend highlights some of the steps IT teams can take to keep their cloud environments running in a safe, efficient manner.
Slideshows
Flash Poll