Posts Tagged ‘ roce ’

Josh Simons

Paravirtual RDMA for Low Latency and Flexibility

December 27, 2012
By

The Office of the CTO has been exploring how to best enable application access to RDMA for those applications requiring the ultimate in high bandwidth, low-latency communication, which includes many HPC MPI applications as well as many scale-out databases and BigData approaches. Passthrough mode is the most straightforward way to enable guest-level RDMA. With passthrough (which we call VM DirectPath I/O), a physical PCI device can be made directly visible to the guest operating system running within the virtual machine. We published a research note showing that this approach delivers very good InfiniBand latencies (under 2us) and excellent bandwidths over a wide range of message sizes. There is a downside, however: Punching through the virtual machine abstraction in this way disables several...

Read more

Josh Simons

Summer of RDMA

July 8, 2011
By

Those involved in HPC understand the need for low latency communication for many parallel distributed applications and for those applications whose storage requirements generate lots of small-message traffic. But it would be a mistake to assume low latency is an HPC-only issue. Why? Well, for two reasons. First, because as enterprise software architectures become more horizontally scaled (also referred to as “scale-out”) the performance of the link connecting communicating endpoints becomes much more of a first-class determinant of overall application performance, much like in HPC. Middleware examples include memcached , vFabric GemFire , and Hadoop . Oracle’s Exadata and Exalogic products are another example in that these appliances use an InfiniBand interconnect (the most popular high-speed interconnect technology...

Read more

Josh Simons

Our Joint VMware / AMAX HPC Collaboration

April 27, 2011
By
Our Joint VMware / AMAX HPC Collaboration

I am excited we have now started our joint HPC exploration with our partner, AMAX . Based on an initial meeting on the show floor at VMworld in San Francisco last year, we decided to work together to examine several aspects of virtualized HPC of mutual interest. Areas where we see converging requirements between HPC and Enterprise customers are of particular interest to VMware as an Enterprise software company looking at broader markets and to AMAX as a dynamic computing solutions provider to HPC and Enterprise , and now Cloud customers. We are starting with Hadoop since scale-out data analytics is rapidly becoming an important workload in the Enterprise while Data Intensive Computing is simultaneously rising...

Read more

Josh Simons

OpenFabrics Workshop 2011

April 8, 2011
By
OpenFabrics Workshop 2011

I attended the OpenFabrics Workshop in Monterey earlier this week and delivered a talk on converging requirements between HPC, Enterprise, and Cloud with an emphasis on interconnect-specific issues, specifically RDMA . With its lower latencies and lower CPU utilization along with the potential for higher bandwidths, RDMA is of increasing interest for addressing the requirements of important new Enterprise applications classes (discussed more below). For those not familiar, the OpenFabrics Alliance is an organization wrapped around the OpenFabrics Enterprise Distribution ( OFED ) open source community that creates and tests binary releases of the OFED stack for Linux and Windows. They also coordinate patches, offer training courses, and do marketing activities to promote the OFED brand. OFED is the software...

Read more