Recent Posts

Big Data Video – Benefits of Virtualizing Big Data on vSphere

posted

We often get questions from people who are new to big data about the reasons for virtualizing these newer infrastructures and applications. People are also interested in knowing the benefits you can gain from doing so. This video provides a set of answers to those questions. Summarizing the main points discussed in the video, the Read more...

Why the Data Scientist and Data Engineer Need to Understand Virtualization in the Cloud

posted

More and more application workloads are moving to the different cloud platforms, whether they be public, private or hybrid clouds. Big data and analytics application workloads are on the move too. It is important that the data science/data engineering users of big data platforms and analytics applications gain a good understanding of the infrastructure in these clouds Read more...

Big Data Performance and Best Practices: New Spark Application Measurements

posted

The Performance Engineering team at VMware has produced another highly useful report and blog on best practice/performance work they have done in the Big Data area. This new report contains test result data from benchmark tests conducted using Spark-based as well as MapReduce applications. The report also gives you specific advice on best practice implementations. Spark was originally Read more...

Eight Myths about Virtualizing Hadoop Dispelled

posted

This article takes eight common misperceptions about virtualizing Hadoop and explains why they are errors in people’s understanding. The short explanations given should serve to clear up the understanding about these important topics. Myth #1: Virtualization may add significant performance overhead to a Hadoop cluster. This is a common question from users who are in the Read more...