MapR Brings Search To Hadoop - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Software // Information Management
01:28 PM
Connect Directly

MapR Brings Search To Hadoop

MapR brings new power to HBase, taps LucidWorks to integrate Apache Lucene/Solr search into M7 Hadoop distribution.

Last fall MapR set out to improve on HBase, Hadoop's built-in NoSQL database. On Wednesday it delivered on that promise and it announced a next move: integrating search capabilities with its M7 Hadoop distribution with partner LucidWorks.

With the latest MapR M7 release, available immediately, the company says it has delivered higher performance and easier administration for both Hadoop and HBase by forging its own path on certain aspects of Hadoop infrastructure and administration. Specifically, M7 does away with region servers, table splits and merges, and data compaction steps tied to standard Apache software. Instead it implements an architecture exclusive to MapR for snapshotting, high availability and system recovery.

"We've eliminated the tradeoffs that organizations face in terms of getting scale, consistency, reliability and continuous low-latency performance in one solution, but M7 works across all these dimensions," MapR VP of marketing Jack Norris told InformationWeek.

MapR points to advantages including instant recovery from hardware or software errors, the ability to do online schema modifications for HBase applications, and performance specs exceeding 1 million operations per second on a 10-node Hadoop cluster.

[ Want more on improvements to Hadoop's NoSQL database? Read MapR Promises A Better HBase. ]

To support search, MapR introduced the beta offering of LucidWorks Search software integrated with the M7 platform. The search technologies will be optional, and plans call for general release next quarter. LucidWorks offers a supported software distribution, consulting and training for open source Apache Lucene/Solr search, and it adds commercial development platforms designed to simplify and accelerate the building of search applications.

With search integrated directly with Hadoop, customers will have an easier time building out recommendation engines for retail scenarios, fraud-detection for financial transactions and predictive applications for any number of industries, according to Norris.

"You could do some of these applications in a MapReduce framework, but if you need online performance, MapReduce latency is a problem and having a search platform is extremely useful," Norris explained. MapR can stream data from Hadoop clusters into the search engine from NFS, the file system used in M7 in place of HDFS.

LucidWorks offers an enterprise-hardened and secured version of Apache Lucene/Solr. The software provides a REST-based API, ODBC connectivity, provisions for LDAP and NIS security, and connections to HDFS and NFS among other features.

MapR will provide first-level support for the new search option, but LucidWorks will be available for deeper problem solving when tougher problems emerge, according to Norris. The cost of the LucidWorks Search option was not disclosed.

E2 is the only event of its kind, bringing together business and technology leaders across IT, marketing, and other lines of business looking for new ways to evolve their enterprise applications strategy and transform their organizations to achieve business value. Join us June 17-19 for three days of 40+ conference sessions and workshops across eight tracks and discover the latest insights in enterprise social software, big data and analytics, mobility, cloud, SaaS and APIs, UI/UX and more. Register for E2 Conference Boston today and save $200 off Full Event Passes, $100 off Conference, or get a FREE Keynote + Expo Pass!

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Data Science: How the Pandemic Has Affected 10 Popular Jobs
Cynthia Harvey, Freelance Journalist, InformationWeek,  9/9/2020
The Growing Security Priority for DevOps and Cloud Migration
Joao-Pierre S. Ruth, Senior Writer,  9/3/2020
Dark Side of AI: How to Make Artificial Intelligence Trustworthy
Guest Commentary, Guest Commentary,  9/15/2020
White Papers
Register for InformationWeek Newsletters
Current Issue
IT Automation Transforms Network Management
In this special report we will examine the layers of automation and orchestration in IT operations, and how they can provide high availability and greater scale for modern applications and business demands.
Flash Poll