12 December, 2012
New Digital Universe Study Reveals Big Data Gap: Less than 1% of World’s Data is Analysed; Less than 20% is Protected
Opportunities Abound for Companies Capable of Protecting and Extracting Value from an Expanding Universe of Data; By
2020, Emerging Markets Will Supplant the Developed World as the Main Producer of the World’s Data.
HOPKINTON, MA – December 11, 2012
• New IDC Digital Universe study, “Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East”¹
(sponsored by EMC) finds that only a tiny fraction of the world’s Big Data potential is being realised, though the
amount of useful data is expanding.
• IDC projects that the digital universe will reach 40 zettabytes (ZB) by 2020, an amount that exceeds previous
forecasts by 5 ZBs, resulting in 50-fold growth from the beginning of 2010
• This year’s study marks the first time IDC was able to capture where the information in the digital universe
either originated or was first captured or consumed, revealing some dramatic shifts currently underway.
• The amount of data that requires protection is growing faster than the digital universe itself, yet levels of
protection are not keeping pace.
• According to the study, 2.8 ZB of data will have been created and replicated in 2012.
• Machine-generated data is a key driver in the growth of the world’s data – which is projected to increase 15x by
• By 2020, emerging markets will supplant the developed world as the main producer of the world’s data.
• The investment in spending on IT hardware, software, services, telecommunications and staff that could be
considered the “infrastructure” of the digital universe will grow by 40% between 2012 and 2020. Investment in targeted
areas like storage management, security, Big Data and cloud computing will grow considerably faster.
• Join the #digitaluniverse and #emc conversations on Twitter
Corporation (NYSE:EMC) today announced results of the EMC-sponsored IDC Digital Universe
study, “Big Data
, Bigger Digital Shadows, and Biggest Growth in the Far East” – which found that despite the unprecedented expansion of
the digital universe due to the massive amounts of data being generated daily by people and machines, IDC estimates that
only 0.5% of the world’s data is being analysed.
To view the multimedia version of this news release click here
The proliferation of devices such as PCs and smartphones worldwide, increased Internet access within emerging markets
and the boost in data from machines such as surveillance cameras or smart metres has contributed to the doubling of the
digital universe within the past two years alone – to a mammoth 2.8 ZB. IDC projects that the digital universe will
reach 40 ZB by 2020, an amount that exceeds previous forecasts by 14%.
In terms of sheer volume, 40 ZB of data is equivalent to:
• There are 700,500,000,000,000,000,000 grains of sand on all the beaches on earth (or seven hundred quintillion
five hundred quadrillion). That means 40 ZB is equal to 57 times the amount of all the grains of sand on all the beaches
• If we could save all 40 ZB onto today’s Blu-ray discs, the weight of those discs (without any sleeves or cases)
would be the same as 424 Nimitz-class aircraft carriers.
• In 2020, 40 ZB will be 5,247 GB per person worldwide.
This year’s study marks the first time IDC was able to capture where the information in the digital universe either
originated or was first captured or consumed, revealing some dramatic shifts currently underway. Now in its sixth year,
the study – measuring and forecasting the amount of digital information created and copied annually – includes findings
around the “Big Data Gap”, which is the gap between the amount of data with hidden value and the amount of value that is
actually being extracted; the level of data protection required versus what is being delivered; and the geographic
implications of the world’s data.
•Rapid expansion of the digital universe: IDC projects that the digital universe will reach 40 ZB by 2020, an amount that exceeds previous forecasts.
o The digital universe will double every two years between now and 2020.
o There will be approximately 5,247 GB of data for every man, woman and child on earth in 2020.
o A major factor behind the expansion of the digital universe is the growth of machine generated data, increasing
from 11% of the digital universe in 2005 to over 40% in 2020.
•Large quantities of useful data are getting lost: The promise of Big Data
lies within the extraction of value from large, untapped pools of data. However, the majority of new data is largely
untagged file-based and unstructured data
, which means little is known about it.
o In 2012, 23% (643 exabytes) of the digital universe would be useful for Big Data if tagged and analysed.
However, currently only 3% of the potentially useful data is tagged, and even less is analysed.
o The amount of useful data is expanding with the growth of the digital universe. By 2020, 33% of the digital
universe (13,000+ exabytes) will have Big Data value if it is tagged and analysed.
•Much of the digital universe is unprotected: The amount of data that requires protection is growing faster than the digital universe itself.
o Less than a third of the digital universe required data protection
in 2010, but that proportion is expected to exceed 40% by 2020.
o In 2012, while about 35% of the information in the digital universe required some type of data protection, less
than 20% of the digital universe actually has these protections.
o The level of protection varies by region, with much less protection in the emerging markets.
o Challenges such as advanced threats
, the security skills gap and lack of adherence to security best practices among consumers and corporations will
continue to compound the issue.
•A geographic role-reversal is around the corner: Although the digital universe was a developed-world phenomenon in the early days, that is about to change as the
population of the emerging markets begins to cast a longer shadow.
o While emerging markets accounted for 23% of the digital universe as recently as 2010, their share is already up
to 36% in 2012.
o By 2020, IDC predicts that 62% of the digital universe will be attributable to emerging markets.
o The current global breakdown of the digital universe is: U.S. – 32%, Western Europe – 19%, China – 13%, India –
4%, rest of the world – 32%.
o By 2020, China alone is expected to generate 22% of the world’s data.
Other Key Findings:
• As cloud computing
plays an even more important role in the management of Big Data, the number of servers worldwide is expected to grow
tenfold and the amount of information managed directly by enterprise data centres will grow by a factor of 14.
• The type of data stored in the cloud will also experience a radical transformation over the next few years. By
2020, IDC predicts that 46.7% of data stored in the cloud will be related to entertainment – not enterprise data.
Surveillance data, embedded and medical data, and information created by computers, phones and consumer electronics will
make up the remainder.
• The amount of information stored in the digital universe about individual users exceeds the amount of data that
they themselves create.
• Western Europe is currently investing the most to manage the digital universe, spending $2.49 USD per GB. The
U.S. comes in second, investing $1.77 per GB, followed by China at $1.31 per GB and India at $0.87 per GB.
• As the infrastructure of the digital universe becomes ever more connected, information won’t reside within the
region where it is consumed, nor will it need to. By 2020, IDC estimates that nearly 40% of data will be “touched” by
cloud computing (private and public), meaning that somewhere between a byte’s origination and consumption, it will be
stored or processed in a cloud.
Jeremy Burton, Executive Vice President, Product Operations and Marketing, EMC Corporation
“As the volume and complexity of data barraging businesses from all angles increases, IT organisations have a choice:
they can either succumb to information-overload paralysis, or they can take steps to harness the tremendous potential
teeming within all of those data streams. This year’s study underscores the massive opportunity that exists for
businesses that not only identify the potential benefits of the digital universe, but recognise the importance of
navigating that universe with the right balance of technology, data security
practices and IT skills. At EMC, we’re uniquely positioned to help customers manage, protect and unlock game-changing
value from data that translates directly into competitive advantage.”
• Learn more about the Digital Universe
• Connect with EMC via Twitter