Tashi: Your Faithful Cluster Manager

Tashi: Cloud Computing on Big Data

Tashi is a research project at Intel Labs Pittsburgh designed to investigate the implications of cloud computing on big data. As part of this effort, researchers at Intel Labs Pittsburgh are also contributing to the development of an open source cluster management system from the Apache Software Foundation's (ASF) Incubator, also called Tashi (the initial proposal). Key initial contributors include Intel Labs Pittsburgh and the Parallel Data Laboratory at Carnegie Mellon University.

The Tashi cluster management system leverages virtual machine technology to enable deployment of many virtual clusters with differing software requirements on a single physical cluster, and a major research goal of the project is the development of mechanisms that will enable efficient access to cluster resources, such as power and storage, from these independent virtual clusters.

Tashi is deployed on the Open Cirrus cluster at Intel Labs Pittsburgh. This 200-node+ cluster comprises more than 1500 cores and 750 disks (providing more than 0.5 PB of raw storage). Tashi is a key software component that enables the Big Data cluster to participate in the OpenCirrus cluster testbed.

Intel Labs Pittsburgh Team



  • "Cloud Computing on Rich Data," Michael Kozuch, Jason Campbell, Madeleine Glick, Padmanabhan Pillai, Intel Technology Journal, Volume 14, Issue 1, 2010, Pages 114-127. [link]
  • "Optimality analysis of energy-performance trade-off for server farm management," Anshul Gandhi, Varun Gupta, Mor Harchol-Balter, Michael A. Kozuch, Performance Evaluation, Volume 67, Issue 11, Performance 2010, November 2010, Pages 1155-1171.
  • "Robust and Flexible Power-Proportional Storage," Hrishikesh Amur, James Cipar, Varun Gupta, Michael Kozuch, Gregory Ganger, Karsten Schwan, Symposium on Cloud Computing (SOCC), June 2010.
  • "Cluster Fault-Tolerance: An Experimental Evaluation of Checkpointing and MapReduce through Simulation," Thomas C. Bressoud and Michael A. Kozuch, 2009 IEEE International Conference on Cluster Computing (Cluster'09), September 2009.
  • "Tashi: Location-aware Cluster Management", Michael A. Kozuch, Michael P. Ryan, Richard Gass, Steven W. Schlosser, David O'Hallaron, James Cipar, Elie Krevat, Julio L√≥pez, Michael Stroucken, Gregory R. Ganger, First Workshop on Automated Control for Datacenters and Clouds (ACDC'09), June 2009.
  • "Migration without Virtualization", Michael A. Kozuch, Michael Kaminsky, Michael P. Ryan, Workshop on Hot Topics in Operating Systems (HotOS '09), May 2009.
  • "Tashi: Cloud Computing on Big Data", PDL Packet: Newsletter on PDL Activities and Events, (Adapted from ACDC'09), [pdf].