Data Scientist: Human Today, Software Tomorrow - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
IT Leadership
Commentary
1/21/2015
09:21 AM
Jeff Bertolucci
Jeff Bertolucci
Commentary
Connect Directly
Google+
RSS
50%
50%

Data Scientist: Human Today, Software Tomorrow

Automation will lessen the need for the elusive, talented, and expensive human data scientist -- and that's a good thing, says Narrative Science cofounder.

Data scientists, you may look sexy today, but automation will win our hearts in the end.

9 CIO Tech Priorities For 2015
9 CIO Tech Priorities For 2015
(Click image for larger view and slideshow.)

So says Narrative Science cofounder and chief scientist Kris Hammond, who predicts that 2015 will bring less investment in "human-powered data science" and more in automated software tools that mine big data to unlock value insights.

In an interview with InformationWeek, Hammond, who doubles as a computer science professor at Northwestern University, expounded on a variety of data-focused topics, including what he sees as the end of the "data-hoarding era" in the enterprise and the emergence of artificial intelligence (sans the killer robots) in mainstream life.

Narrative Science provides one type of this kind of automation. The company makes natural language-generation software, most notably Quill, which examines big data feeds, extracts information relevant to the user, and generates -- or writes -- reports in human language, typically English. Its competitors include Automated Insights, whose Wordsmith platform also generates written reports -- sometimes thousands per second – including online sports and finance stories you may have read.

[Not sold on automation? See How To Build A Data-Driven Dream Team.]

Hammond has nothing against data scientists -- he's one himself. But he believes the explosive growth of big data technologies will require a more automated approach to information analysis. Besides, data scientists are expensive to employ, and there simply aren't enough of them to go around, he claims.

"They're never going to be able to scale into the kind of reporting that is absolutely essential for organizations now," said Hammond.

(Source: Geralt/Pixabay)
(Source: Geralt/Pixabay)

Data scientists are often called upon to do relatively mundane tasks that don't put their data analysis skills to good use. One example is "being asked to spend my day looking at sales figures for 10,000 stores and write reports based upon those sales figures," said Hammond.

He added: "If I were asked to do that, I could do it. It actually requires some of my skills, but it would kill me. It would drive me mad because, in fact, that's not me using my skills at the high end of my skill set."

This dichotomy between the mundane and magnificent is commonplace in today's data science teams. And as we pull in increasing volumes of data, such as the data streaming in from billions of Internet-connected devices, the need for automation becomes more apparent.

"Data scientists, because they're so few of them and they're so expensive, and [because] they want to work on hard and interesting problems, are not going to help us get to the nuts and bolts of understanding data, or even communicate the basics of what's going on in the world," Hammond said.

A greater reliance on automated analysis might help enterprises extract value from their swelling big data stockpiles, which increasingly measure in the petabytes. "We don't all have to become data scientists in order to work with the machine," Hammond said. "The machine needs to become more human and work with us."

Attend Interop Las Vegas, the leading independent technology conference and expo series designed to inspire, inform, and connect the world's IT community. In 2015, look for all new programs, networking opportunities, and classes that will help you set your organization’s IT action plan. It happens April 27 to May 1. Register with Discount Code MPOIWK for $200 off Total Access & Conference Passes.

Jeff Bertolucci is a technology journalist in Los Angeles who writes mostly for Kiplinger's Personal Finance, The Saturday Evening Post, and InformationWeek. View Full Bio
We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
1/22/2015 | 8:55:09 AM
Data analysts using the least of their skills
I've heard the complaint about data analysts wasting time on mundane reporting tasks for years. As I report in this slide show on CIO priorities for 2015, GameStop had the same problem. "We were getting more questions than we had time to respond to, so our analytics team was turning into a reporting team," said Jason Kappel, GameStop's director, CRM, at the recent NRF (National Retail Federation) Big Show in New York.

Using the combination of Tableau Software for dashboard-based reporting and Alteryx for deeper analytics, GameStop was able to set up more of a self-service environment for business users. "We took Monday-morning reporting routines that were taking half a day and brought them down to about 5 minutes," said Kappel.

This common problem has been the impetus of the trend toward self-service reporting, which has led to fast growth for business-user-friendly BI tools such as Tableau, Qlik, and many imitators who have since added data-exploration and data-visualization products.
pfretty
100%
0%
pfretty,
User Rank: Ninja
1/21/2015 | 10:07:20 AM
Maybe
While automation will undoubtedly help organizations with some of the tedious tasks currently performed by data scientists, the evolution will only afford more time for specialists to utilize their talents to continuously come up with creative new ways to leverage data.  We need to keep in perspective that the big data analysis space is still maturing. Most organizations still have quite a ways to go before they are truly effective at utilizing data. It is something that comes with time, experience and honestly learning from failures/mishaps. 

Peter Fretty, IDG blogger working on behalf of SAS

 
Slideshows
What Digital Transformation Is (And Isn't)
Cynthia Harvey, Freelance Journalist, InformationWeek,  12/4/2019
Commentary
Watch Out for New Barriers to Faster Software Development
Lisa Morgan, Freelance Writer,  12/3/2019
Commentary
If DevOps Is So Awesome, Why Is Your Initiative Failing?
Guest Commentary, Guest Commentary,  12/2/2019
White Papers
Register for InformationWeek Newsletters
State of the Cloud
State of the Cloud
Cloud has drastically changed how IT organizations consume and deploy services in the digital age. This research report will delve into public, private and hybrid cloud adoption trends, with a special focus on infrastructure as a service and its role in the enterprise. Find out the challenges organizations are experiencing, and the technologies and strategies they are using to manage and mitigate those challenges today.
Video
Current Issue
The Cloud Gets Ready for the 20's
This IT Trend Report explores how cloud computing is being shaped for the next phase in its maturation. It will help enterprise IT decision makers and business leaders understand some of the key trends reflected emerging cloud concepts and technologies, and in enterprise cloud usage patterns. Get it today!
Slideshows
Flash Poll