Pentaho Preps Data On Hadoop, Analyzes On MongoDB - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Comments
Pentaho Preps Data On Hadoop, Analyzes On MongoDB
Newest First  |  Oldest First  |  Threaded View
Lorna Garey
50%
50%
Lorna Garey,
User Rank: Author
7/8/2014 | 3:48:24 PM
Re: Data integration vendors are hot to get in on big data
How will Pentaho monetize this? The number of customers paying for enterprise support doesn't seem all that high.
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
7/8/2014 | 3:29:39 PM
Re: Pentaho system, ungainly or powerful?
Sorry, but I guess the headline is potentially misleading. Data-prep on Hadoop is in service of predictive analysis (done with tools such as Pentaho Weka, R, or, soon according to Pentaho, SAS or Metlab). The support for MongoDB is a separate thing, only for BI/data-visualization style analysis (not predictive work) on the data managed by MongoDB. The two are not connected other than the fact that they are both capabilities introduced in Pentaho 5.1.
Charlie Babcock
50%
50%
Charlie Babcock,
User Rank: Author
7/8/2014 | 3:24:06 PM
Pentaho system, ungainly or powerful?
To "befriend the data scientist" is no easy task. It's all too easy to be a friend to few, stranger to many. The combinatin of Hadoop with YARN on top for data prep, with the rsults plugged into MongoDB sounds like a powerful system -- as long as the movement between the two of them is smooth.  
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
7/8/2014 | 1:13:09 PM
Data integration vendors are hot to get in on big data
When Hadoop first emerged, we all heard it would displace ETL. That's at least partially true, for some transformation processing, but now data-integration vendors -- like Informatica, Paxata, and, now Pentaho -- are saying their stuff is needed for all sorts of data prep and processing ahead of big-data analysis. It's another case of offering an alternative to clunky MapReduce processing, but I haven't talked to enough customers who have validated how useful these tools can be in big-data-analysis scenarios.

The "80% of the work" line above seems like a relic of relational data warehousing approches, but I need to hear from more practitioners -- yes, this is a naked plea for comments from practitioners -- before passing this off as an overstatement or marketing ploy.


The State of Cloud Computing - Fall 2020
The State of Cloud Computing - Fall 2020
Download this report to compare how cloud usage and spending patterns have changed in 2020, and how respondents think they'll evolve over the next two years.
InformationWeek Is Getting an Upgrade!

Find out more about our plans to improve the look, functionality, and performance of the InformationWeek site in the coming months.

Slideshows
11 Things IT Professionals Wish They Knew Earlier in Their Careers
Lisa Morgan, Freelance Writer,  4/6/2021
News
Time to Shift Your Job Search Out of Neutral
Jessica Davis, Senior Editor, Enterprise Apps,  3/31/2021
Commentary
Does Identity Hinder Hybrid-Cloud and Multi-Cloud Adoption?
Joao-Pierre S. Ruth, Senior Writer,  4/1/2021
Register for InformationWeek Newsletters
Video
Current Issue
Successful Strategies for Digital Transformation
Download this report to learn about the latest technologies and best practices or ensuring a successful transition from outdated business transformation tactics.
White Papers
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Sponsored Video
Flash Poll