Yahoo Releases Massive Data Set To Academic Institutions - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Comments
Yahoo Releases Massive Data Set To Academic Institutions
Newest First  |  Oldest First  |  Threaded View
Technocrati
50%
50%
Technocrati,
User Rank: Ninja
1/18/2016 | 8:32:49 PM
Re: Yahoo!

@jastroff   I agree with your skepticism regarding large datasets.   I suspect you are correct, by the time anything useful can be gleaned it will be outdated and useless.

I am sure the data is encrypted but there is always the possibility it can be un-encrypted and then what do we have ?

 

Sounds to me like a data breech.

jastroff
50%
50%
jastroff,
User Rank: Ninja
1/16/2016 | 5:37:18 PM
Re: Yahoo!
Any academic would like to see Amazon's business plan and tax statements. We can do much more with that than large datasets. But I'm just jaded...sorry
jastroff
50%
50%
jastroff,
User Rank: Ninja
1/16/2016 | 5:35:44 PM
Re: Yahoo!
Well, ok. But as one of those reseachers who had access to large datasets from various places in a past life, it took a long time for any results and mostly, they were grant funded and not many people cared. They could be conference papers or dissertations, and they were out of date before they hit the page.

This may be different with Google, etc. but I'm not seeing anything that says "hey, this is why more men than women or more kids than adults do x and why" or "people in Georgia don't access online news..."

And if I did see it, the world is changing so fast, and the data changes with it, I'm not sure it would make a difference. Do you, Brian? 

Now I would be interested in hos governments and other organizations are using the data  -- they might get something out of it when added to everything else they know. But maybe not.

>> Google is great in this regard. Any individual can utilize Google's real-time and stored data to research the keywords that users are typing into their search engine. The research can be split into geographic and demographic data and, Google does not mind with the data is being utilized for commercial or academic concerns.
danielcawrey
50%
50%
danielcawrey,
User Rank: Ninja
1/16/2016 | 4:29:42 PM
Re: Yahoo!
I had never previously considered the challenges researchers have accessing large datasets. I'm sure this release of data from Yahoo is going to make a number of academics really happy. Most big data is understandably kept locked up by the owner, but hopefully we're going to see more massive datasets released for research. It might lead to a better understanding of what all of this data we generate really means. 
Brian.Dean
50%
50%
Brian.Dean,
User Rank: Ninja
1/16/2016 | 3:35:37 PM
Re: Yahoo!
Google is great in this regard. Any individual can utilize Google's real-time and stored data to research the keywords that users are typing into their search engine. The research can be split into geographic and demographic data and, Google does not mind with the data is being utilized for commercial or academic concerns.
jastroff
50%
50%
jastroff,
User Rank: Ninja
1/16/2016 | 3:08:08 PM
Re: Yahoo!
By the time the neew theories are created, Yahoo will be gone, and the users a memory. 

I suspect you can't get a dataset out of Amazon, or FaceBook.
Brian.Dean
50%
50%
Brian.Dean,
User Rank: Ninja
1/15/2016 | 1:21:57 PM
Yahoo!
This is a great move by Yahoo as the data could be utilized to explain and/or build new theories in social sciences, etc. I wonder if limiting the data set to a quarter of a year's data will prohibit academic institutions to research seasonal changes in user interaction or whether the data is already large enough that academic institutions will spend years processing it. 


2020 State of DevOps Report
2020 State of DevOps Report
Download this report today to learn more about the key tools and technologies being utilized, and how organizations deal with the cultural and process changes that DevOps brings. The report also examines the barriers organizations face, as well as the rewards from DevOps including faster application delivery, higher quality products, and quicker recovery from errors in production.
Slideshows
10 Top Cloud Computing Startups
Cynthia Harvey, Freelance Journalist, InformationWeek,  8/3/2020
Commentary
Adding Fuel to the MSP vs. In-house IT Debate
Andrew Froehlich, President & Lead Network Architect, West Gate Networks,  8/6/2020
Commentary
How Enterprises Can Adopt Video Game Cloud Strategy
Joao-Pierre S. Ruth, Senior Writer,  7/28/2020
Register for InformationWeek Newsletters
Video
Current Issue
Enterprise Automation: Do More with Less
In this IT Trend Report, we highlight the benefits of automation and the various tools as enterprises navigate turbulent times, try to do more with less, keep their operations running, and stay on track with digital modernizations.
White Papers
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Sponsored Video
Flash Poll