Will Microsoft's Hadoop Bring Big Data To Masses? - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Software
News
10/30/2012
12:31 PM
Connect Directly
Twitter
LinkedIn
RSS
E-Mail
50%
50%

Will Microsoft's Hadoop Bring Big Data To Masses?

HDInsight Server "dramatically" lowers the cost and complexity of deploying Hadoop, Microsoft exec says.

Microsoft announced last week HDInsight Server, a version of the Hadoop big data analytic framework designed to run in simpler, less expensive environments than Hadoop usually requires.

Microsoft characterized the product, now in beta, as an effort to simplify big data implementations and enable IT managers to run Hadoop on Windows machines, without having to jump through the usual hoops.

Apache.org asserts that Hadoop can run on Win32-based machines but recommends doing so only for development, not as a production-quality system.

The on-premises version of HDInsight is fully supported and certified by Microsoft and partners as a production platform. It includes an automated install-and-configuration process and links that enable business users to download subsets of Hadoop data sets so they can run their own scenarios using Excel, PowerPivot for Excel and PowerView.

Microsoft says the HDInsight preview runs on its Windows Server and Azure platform-as-a-service cloud offering. Branded Windows Azure HDInsight Service and Microsoft HDInsight Server for Windows, the Windows versions "dramatically" lower the cost and complexity of deploying Hadoop, according to Microsoft technical fellow David Campbell.

The Hadoop framework is designed to run on implementations as small as one server, but is typically configured to run on a whole server cluster running the open source Apache Web server.

Even installed on just one node, Hadoop must be configured as a cluster within which Hadoop's own name servers and resource managers coordinate the integration of a series of Hadoop modules that manage, schedule, process, query, analyze and publish both data and analytics.

Integration with Microsoft's System Center 2012 is designed to simplify management of both the Windows Server and the Azure versions of HDInsight by allowing IT managers to tweak or control the applications using Microsoft's familiar management tools.

Though HDInsight is fully compatible with Apache Hadoop, according to Microsoft, it is designed to be more adaptable because it can be run either on-premises or in the cloud -- or in both places with connections secured through Active Directory that allow on-premises and cloud versions to exchange data and/or queries.

In addition, having HDInsight running on Windows Azure provides the same dynamic resource configuration as every other cloud service, so admins can install it as if it is running on a single server, then increase RAM, storage, CPU cycles and other resources to cover peaks in demand. They can also expand a virtual single-node version of HDInsight into a multi-node, clustered version without having to reinstall or migrate the installation to a different set of physical servers.

Both also ship with links to the U.S. Census Bureau, United Nations, Dun & Bradstreet and other data sources via the online Windows Azure Marketplace. They also allow data-crunching business users to download subsets of Hadoop data or the results of previous queries to refine, reprint or republish the results using Excel.

Both versions can also be linked with installations of SQL Server to trade data or run cross-queries on both systems, using connectors from Hortonworks, the Hadoop specialist that handled most of the porting and integration of Hadoop onto Windows.

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Previous
1 of 2
Next
Comment  | 
Print  | 
More Insights
Slideshows
Reflections on Tech in 2019
James M. Connolly, Editorial Director, InformationWeek and Network Computing,  12/9/2019
Slideshows
What Digital Transformation Is (And Isn't)
Cynthia Harvey, Freelance Journalist, InformationWeek,  12/4/2019
Commentary
Watch Out for New Barriers to Faster Software Development
Lisa Morgan, Freelance Writer,  12/3/2019
White Papers
Register for InformationWeek Newsletters
State of the Cloud
State of the Cloud
Cloud has drastically changed how IT organizations consume and deploy services in the digital age. This research report will delve into public, private and hybrid cloud adoption trends, with a special focus on infrastructure as a service and its role in the enterprise. Find out the challenges organizations are experiencing, and the technologies and strategies they are using to manage and mitigate those challenges today.
Video
Current Issue
The Cloud Gets Ready for the 20's
This IT Trend Report explores how cloud computing is being shaped for the next phase in its maturation. It will help enterprise IT decision makers and business leaders understand some of the key trends reflected emerging cloud concepts and technologies, and in enterprise cloud usage patterns. Get it today!
Slideshows
Flash Poll