Yahoo Releases Massive Data Set To Academic Institutions - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Comments
Yahoo Releases Massive Data Set To Academic Institutions
Newest First  |  Oldest First  |  Threaded View
Technocrati
50%
50%
Technocrati,
User Rank: Ninja
1/18/2016 | 8:32:49 PM
Re: Yahoo!

@jastroff   I agree with your skepticism regarding large datasets.   I suspect you are correct, by the time anything useful can be gleaned it will be outdated and useless.

I am sure the data is encrypted but there is always the possibility it can be un-encrypted and then what do we have ?

 

Sounds to me like a data breech.

jastroff
50%
50%
jastroff,
User Rank: Ninja
1/16/2016 | 5:37:18 PM
Re: Yahoo!
Any academic would like to see Amazon's business plan and tax statements. We can do much more with that than large datasets. But I'm just jaded...sorry
jastroff
50%
50%
jastroff,
User Rank: Ninja
1/16/2016 | 5:35:44 PM
Re: Yahoo!
Well, ok. But as one of those reseachers who had access to large datasets from various places in a past life, it took a long time for any results and mostly, they were grant funded and not many people cared. They could be conference papers or dissertations, and they were out of date before they hit the page.

This may be different with Google, etc. but I'm not seeing anything that says "hey, this is why more men than women or more kids than adults do x and why" or "people in Georgia don't access online news..."

And if I did see it, the world is changing so fast, and the data changes with it, I'm not sure it would make a difference. Do you, Brian? 

Now I would be interested in hos governments and other organizations are using the data  -- they might get something out of it when added to everything else they know. But maybe not.

>> Google is great in this regard. Any individual can utilize Google's real-time and stored data to research the keywords that users are typing into their search engine. The research can be split into geographic and demographic data and, Google does not mind with the data is being utilized for commercial or academic concerns.
danielcawrey
50%
50%
danielcawrey,
User Rank: Ninja
1/16/2016 | 4:29:42 PM
Re: Yahoo!
I had never previously considered the challenges researchers have accessing large datasets. I'm sure this release of data from Yahoo is going to make a number of academics really happy. Most big data is understandably kept locked up by the owner, but hopefully we're going to see more massive datasets released for research. It might lead to a better understanding of what all of this data we generate really means. 
Brian.Dean
50%
50%
Brian.Dean,
User Rank: Ninja
1/16/2016 | 3:35:37 PM
Re: Yahoo!
Google is great in this regard. Any individual can utilize Google's real-time and stored data to research the keywords that users are typing into their search engine. The research can be split into geographic and demographic data and, Google does not mind with the data is being utilized for commercial or academic concerns.
jastroff
50%
50%
jastroff,
User Rank: Ninja
1/16/2016 | 3:08:08 PM
Re: Yahoo!
By the time the neew theories are created, Yahoo will be gone, and the users a memory. 

I suspect you can't get a dataset out of Amazon, or FaceBook.
Brian.Dean
50%
50%
Brian.Dean,
User Rank: Ninja
1/15/2016 | 1:21:57 PM
Yahoo!
This is a great move by Yahoo as the data could be utilized to explain and/or build new theories in social sciences, etc. I wonder if limiting the data set to a quarter of a year's data will prohibit academic institutions to research seasonal changes in user interaction or whether the data is already large enough that academic institutions will spend years processing it. 


The State of IT & Cybersecurity Operations 2020
The State of IT & Cybersecurity Operations 2020
Download this report from InformationWeek, in partnership with Dark Reading, to learn more about how today's IT operations teams work with cybersecurity operations, what technologies they are using, and how they communicate and share responsibility--or create risk by failing to do so. Get it now!
Commentary
The Best Way to Get Started with Data Analytics
John Edwards, Technology Journalist & Author,  7/8/2020
Slideshows
10 Cyberattacks on the Rise During the Pandemic
Cynthia Harvey, Freelance Journalist, InformationWeek,  6/24/2020
News
IT Trade Shows Go Virtual: Your 2020 List of Events
Jessica Davis, Senior Editor, Enterprise Apps,  5/29/2020
Register for InformationWeek Newsletters
Video
Current Issue
Key to Cloud Success: The Right Management
This IT Trend highlights some of the steps IT teams can take to keep their cloud environments running in a safe, efficient manner.
White Papers
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Sponsored Video
Flash Poll