Cloudera And Informatica Team To Optimize The Data Warehouse

Redwood City, Calif., and New York, Oct. 29, 2013 (GLOBE NEWSWIRE) -- At the Strata Conference + Hadoop World 2013, InformaticaCorporation (Nasdaq:INFA), the world's number one independentprovider of dataintegration software, and Cloudera, the leader in enterpriseanalytic data management powered by Apache Hadoop™,  support services and training, todayannounced a jointly designed reference architecture foroptimizing data warehouses for today's data-driven businessworld.

The new Data Warehouse Optimization (DWO) reference architecturespecifically for Enterprise Data Hub deployments addresses thechallenges facing traditional data warehouse infrastructures, wherecapacity is too quickly consumed by increasing data volumes,leading to performance bottlenecks and costly upgrades. The DWOarchitecture empowers companies to optimally deploy an EnterpriseData Hub, a central system to land and work with all data in avariety of ways, together with the tools, security and governancecustomers require. An Enterprise Data Hub is a complementarytechnology to data warehouse implementations, enabling them tostore and process data at any scale, to dramatically reduce datawarehouse costs, and to boost developer productivity by up to afactor of five.

The proven core building blocks for implementing the DWOarchitecture are Cloudera Enterprise, a subscription offering thatcombines  CDH, Cloudera's 100 percent open sourcedistribution of Apache Hadoop, Cloudera Manager and Cloudera Navigator and Informatica PowerCenterBig Data Edition powered by Informatica Vibe. InformaticaVibe is the world's first and only embeddable virtual data machine(VDM), with "map once, deploy anywhere" data integration.

"Legacy environments are not going away, but they need to beaugmented by Hadoop-based solutions to meet the demands of bigdata," said Todd Goldman, vice president and general manager,Enterprise Data Integration, Informatica. "The Cloudera andInformatica Data Warehouse Optimization reference architecturehelps companies leverage their existing environment with emergingtechnologies using readily available skills, so organizations canmore affordably and efficiently unlock the massive potential of bigdata." 

Fast-growing data volumes and new types of data sources, rangingfrom cloud and mobile apps to social media and machine data, areplacing substantial demands on current data warehouseinfrastructures. To optimize their data warehouse environments,organizations are seeking ways to support unlimited data volumeswhile leveraging industry-standard hardware and software to reduceinfrastructure costs and existing skills to minimize operationalcosts. They are also seeking ways to support all types of data, andeasily integrate new and existing types ofinfrastructure. 

"One of the best ways to introduce Cloudera into anorganization's data management infrastructure is to start byoptimizing the data warehouse environment," said Charles Zedlewski,vice president, Products, Cloudera.  "The Cloudera andInformatica DWO reference architecture has the dual benefit ofdramatically lowering costs and providing an enterprise-ready dataplatform that cost-effectively scales to meet the data storage andprocessing requirements for big data projects."

The DWO reference architecture addresses all these requirementsthrough the combination of Informatica and Cloudera technologies.Informatica delivers a broad and mature set of data integration anddata management capabilities around Hadoop. Cloudera Enterpriseenables cost-effective, scalable storage and processing oncommodity infrastructure, along with enterprise-grade security,high availability, cluster management, and low-latencyquerying.  The joint reference architecture includestechnologies and solutions that:

·         Lowerinfrastructure and operational costs - Delivers the killer appon Cloudera, so organizations can cost-effectively scale datastorage and processing on industry-standard hardware andopen-source software using readily available resource skills.

·         Useexisting resource skills to staff projects - Many datawarehouse organizations already have ETL developers and consultantson staff trained on Informatica.  With the InformaticaPowerCenter Big Data Edition, every Informatica developer is now aHadoop developer without having to become a Hadoop expert. WithInformatica's and Cloudera's world-class support and trainingorganizations, users can staff the development and administrationof data warehouse projects on Cloudera with readily availableresource skills.

·         Futureproof the data warehouse and drive productivity - InformaticaVibe enables data integration and ETL processes to be written justonce and deployed anywhere. This means that existing ETL processescreated using Informatica's codeless visual development paradigmcan be redeployed on Cloudera Enterprise with minimal effort,resulting in a more resilient data warehouse infrastructure and anup-to-5x productivity gain for developers.  Rapid developmentis further enhanced with Informatica's Vibe for rapid ETLprototyping and Cloudera's Impala for real-time interactive queriesto discover insights faster.

·         Optimizedata warehouse performance - Informatica PowerCenter Big DataEdition deploys on Cloudera Enterprise to load, profile, parse andtransform for analysis of data in a high performance andcost-effective fashion. Optimal processing flows can be definedquickly using Informatica's visual design interface and extensivelibrary of pre-built transforms.

·         Handlevirtually all types of data and sources - With Informatica,nearly all types of data - including legacy, ERP, CRM, social andmachine - can be accessed and integrated through a variety ofmethods ranging from batch to replication, change data capture(CDC) and real-time streaming. Newly released Informatica Vibe DataStream for Machine Data technology, for example, collects andstreams high-volume, real-time machine data into Hadoop to drivenew levels of operational intelligence.

·         Ensuredata quality - Informatica Data Quality Big Data Editionexecutes data quality and matching rules on Cloudera Enterprise toensure trust in the data.

·         Ensureenterprise-ready deployments that meet business SLAs - WithInformatica's Vibe, "Map Once, Deploy Anywhere", virtual datamachine technology, users can immediately deploy ETL jobs fromdevelopment into production. The combination of Informatica'sunified administration and Cloudera Manager makes it easy to manageETL workloads on Cloudera for data warehouse projects.

The Data Warehouse Optimization reference architecture fromCloudera and Informatica is available now for implementation. Toview a Solution Brief, "Cloudera & Informatica Unleash thePower of Hadoop," click here

Visit Informatica at Kiosk 63 and Cloudera at Booth 403at the Strata Conference + Hadoop World 2013, Oct. 28-30 at the NewYork Hilton Midtown.

Tweet this: News: @Cloudera and@InformaticaCorp Team to Optimize the #DataWarehouse http://bit.ly/1bqFv3c

About Informatica

Informatica Corporation (Nasdaq:INFA) is the world's number oneindependent provider of dataintegration software. Organizations around the world rely onInformatica to realize their informationpotential and drive top business imperatives. Informatica Vibe, theindustry's first and only embeddable virtual data machine (VDM),powers the unique "Map Once. Deploy Anywhere." capabilities of theInformatica Platform. Worldwide, over 5,000 enterprises depend onInformatica to fully leverage their information assets from devicesto mobile to social to big data residing on-premise, in the Cloud and across social networks. For moreinformation, call +1 650-385-5000 (1-800-653-3871 in the U.S.), orvisit www.informatica.com.Connect with Informatica at http://www.facebook.com/InformaticaCorporation, http://www.linkedin.com/company/informaticaand http://twitter.com/InformaticaCorp.

About Cloudera

Founded in 2008, Cloudera pioneered the business case for Hadoopwith CDH, the world's most comprehensive, thoroughly tested andwidely deployed 100% open source distribution of Apache Hadoop inboth commercial and non-commercial environments. Now, the companyis redefining data management with its Platform for Big Data,Cloudera Enterprise, empowering enterprises to Ask BiggerQuestions™ and gain rich, actionable insights from all their data,to quickly and easily derive real business value that translatesinto competitive advantage. As the top contributor to the Apacheopen source community and leading educator of data professionalswith the broadest array of Hadoop training and certificationprograms, Cloudera also offers comprehensive consulting services.Over 700 partners across hardware, software and services haveteamed with Cloudera to help meet organizations' big data goals.With tens of thousands of nodes under management and hundreds ofcustomers across diverse markets, Cloudera is the category leaderthat has set the standard for Hadoop in the enterprise. www.cloudera.com.

###

Note: Informatica, PowerCenter, Informatica DataQuality, the Informatica Platform and InformaticaVibe are trademarks or registered trademarks of InformaticaCorporation in the United States and in jurisdictions throughoutthe world. All other company and product names may be trade namesor trademarks of their respective owners.

CONTACT: Steve Bauer         Informatica Corporation         +1 650 385 4159         +1 650 670 7135         stbauer@informatica.com                  Deborah Wiltshire         Cloudera         +1 650 644 3900 x5907         +1 650 862 8186         dwiltshire@cloudera.com

Informatica Corp.