Home > Blogs > VMware vFabric Blog


Cloudera Gets More Cloudy: Partners and Certifies CDH4 on vSphere

Today, we are excited to welcome Cloudera officially to the VMware family. VMware and Cloudera have entered into a partnership agreement that is meant to help users of Cloudera’s Hadoop distribution, CDH4, to run in the cloud. As part of this announcement, VMware has tested and certified Cloudera’s Enterprise Big Data software to run on vSphere 5.1 and that Cloudera is now part of the VMware Ready and Technical Alliances Partner (TAP) program.

This month at EMC World, VMware CEO Pat Gelsinger stated that over 500,000 Hadoop installations exist today on bare metal servers, with compute and data tied to the same physical server. By breaking compute and data apart, and putting it on fast-to-deploy vSphere virtual machines, big data becomes inherently more accessible, compute times can improve by up to 13%, and datacenters can optimize to provide more types of data services without adding more hardware.

It comes at a time where both the volume of data is exploding and, according to PwC’s 5th Annual Digital IQ Survey,  83% of their top performing companies believe that harnessing Big Data will give their firms a competitive advantage. As such, many CIOs are formally aligning their agenda to invest in big data this year.

Download vSphere

Click Here

Download CDH4

Click Here

In fact, a study released earlier this year by NewVantage Partners with found that investment is happening faster than we think already. 85% of their 50 respondents (most with more than 30,000 employees) said they were already investing in Big Data.  Many of these are still in Phase I, where they are experimenting with a pilot project. At this point in the adoption cycle, they are finding value and planning on how to expand investment and increase ROI of data mining.

This is the point that both VMware and Cloudera hope to begin to working with these data teams. As they gain experience and focus in servicing their own data needs, they will be looking for ways to do this more efficiently in the datacenter as well as expand Big Data’s reach within the organization—making it more accessible for employees. This is when they should consider moving their Hadoop data loads to the cloud for the following reasons:

  • Setup new compute processes in minutes, not hours or days
  • Better hardware utilization and consolidation by running mixed workloads
  • Performance improvements through pooling of resources
  • High Availability/Fault Tolerance through vSphere Enterprise & Enterprise+
  • Make Big Data projects more accessible by offering Hadoop-as-a-Service

Cloudera’s CDH4 is available now to run on vSphere 5.1 and above. Additionally, both companies have agreed to collaboratively work together to support calls from customers running the two technologies together, although each company will only provide patches or technical support directly for their respective products.

For more information on running Cloudera CDH4 on vSphere, contact your VMware or Cloudera Sales Rep.

This entry was posted in Serengeti on by .
Stacey Schneider

About Stacey Schneider

Stacey Schneider has over 15 years of working with technology, with a focus on working with sales and marketing automation as well as internationalization. Schneider has held roles in services, engineering, products and was the former head of marketing and community for Hyperic before it was acquired by SpringSource and VMware. She is now working as a product marketing manager across the vFabric products at VMware, including supporting Hyperic. Prior to Hyperic, Schneider held various positions at CRM software pioneer Siebel Systems, including Group Director of Technology Product Marketing, a role for which her contributions awarded her a patent. Schneider received her BS in Economics with a focus in International Business from the Pennsylvania State University.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>