To ensure that more organizations and people can use the vast amounts of data being generated, collected and stored everyday – also known as “big data” – Intel Corporation announced the availability of Intel
Distribution for Apache Hadoop* software (Intel
Distribution). The offering, which includes Intel
Manager for Apache Hadoop* software, is built from the silicon up to deliver industry-leading performance and improved security features.
The ability to analyze and make sense of big data has profound potential to transform society by enabling new scientific discoveries, business models and consumer experiences. Yet, only a small fraction of the world is able to extract meaning from all of this information because the technologies, techniques and skills available today are either too rigid for the data types or too expensive to deploy.
Hadoop* is an open source framework for storing and processing large volumes of diverse data on a scalable cluster of servers that has emerged as the preferred platform for managing big data. With even more information coming from billions of sensors and intelligent systems also on the horizon, the framework must remain open and scalable as well as deliver on the demanding requirements of enterprise-grade performance, security and manageability.
“People and machines are producing valuable information that could enrich our lives in so many ways, from pinpoint accuracy in predicting severe weather to developing customized treatments for terminal diseases,” said Boyd Davis, vice president and general manager of Intel’s Datacenter Software Division. “Intel is committed to contributing its enhancements made to use all of the computing horsepower available to the open source community to provide the industry with a better foundation from which it can push the limits of innovation and realize the transformational opportunity of big data.”
Performance and Security: The Intel Difference
Intel is delivering an innovative open platform built on Apache Hadoop* that can keep pace with the rapid evolution of big data analytics. The Intel Distribution is the first to provide complete encryption with support of Intel
AES New Instructions (Intel
AES-NI) in the Intel
processor. By incorporating silicon-based encryption support of the Hadoop Distributed File System*, organizations can now more securely analyze their data sets without compromising performance.
The optimizations made for the networking and IO technologies in the Intel Xeon
processor platform also enable new levels of analytic performance. Analyzing one terabyte of data, which would previously take more than 4 hours to fully process, can now be done in 7 minutes
thanks to the data-crunching combination of Intel’s hardware and
the Intel Distribution. Considering Intel estimates that the world generates 1 petabyte (1,000 terabytes) of data every 11 seconds or the equivalent of 13 years of HD video, the power of Intel technology opens up the world to even greater possibilities.
For example, in a hospital setting, the intelligence derived from this data could help improve patient care by helping caregivers make quicker and more accurate diagnoses, determine effectiveness of drugs, drug interactions, dosage recommendations and potential side effects through the analysis of millions of electronic medical records, public health data and claims records. Strict guidelines also exist globally for protecting health and payment information, making it imperative to maintain security and privacy while performing analytics.