Technology Review - Published By MIT
Advertisement

March/April 2009

TR10: HashCache

Vivek Pai's new method for storing Web content could make Internet access more affordable around the world.

By David Talbot

smaller text tool iconmedium text tool iconlarger text tool icon

Throughout the developing world, scarce Internet access is a more conspicuous and stubborn aspect of the digital divide than a dearth of computers. "In most places, networking is more expensive--not only in relative terms but even in absolute terms--than it is in United States," says Vivek Pai, a computer scientist at Princeton University. Often, even universities in poor countries can afford only low-bandwidth connections; individual users receive the equivalent of a fraction of a dial-up connection. To boost the utility of these connections, Pai and his group created HashCache, a highly efficient method of caching--that is, storing frequently accessed Web content on a local hard drive instead of using precious bandwidth to retrieve the same information repeatedly.

Closing the divide: Students surf the Web at Ghana’s Kokrobitey Institute, a conference center with an Internet connection only about four times as fast as dial-up. The link is enhanced by Princeton’s low-cost, low-power HashCache technology, which stores frequently accessed Web content.
Credit: Olivier Asselin/WPN
Multimedia
video  Watch Vivek Pai describe a new technology that could slash the cost of caching Web content.

Despite the Web's protean nature, a surprising amount of its content doesn't change often or by very much. But current caching technologies require not only large hard disks to hold data but also lots of random-access memory (RAM) to store an index that contains the "address" of each piece of content on the disk. RAM is expensive relative to hard-disk capacity, and it works only when supplied with electricity--which, like bandwidth, is often both expensive and scarce in the developing world.

HashCache abolishes the index, slashing RAM and electricity requirements by roughly a factor of 10. It starts by transforming the URL of each stored Web "object"--an image, graphic, or block of text on a Web page--into a shorter number, using a bit of math called a hash function. While most other caching systems do this, they also store each hash number in a RAM-hogging table that correlates it with a hard-disk memory address. Pai's technology can skip this step because it uses a novel hash function: the number that the function produces defines the spot on the disk where the corresponding Web object can be found. "By using the hash to directly compute the location, we can get rid of the index entirely," Pai says.

Story continues below


To be sure, some RAM is still needed, but only enough to run the hash function and to actually retrieve a specific Web object, Pai says. Though still at a very early stage of development, HashCache is being field-tested at the Kokrobitey Institute in Ghana and Obafemi Awolowo University in Nigeria.

Comments

Technology Review Magazine

The 10 Emerging Technologies of 2009
Technology Review presents its annual list of 10 technologies that could change the way we live.

FEATURES

A Zero-Emissions City in the Desert
Oil-rich Abu Dhabi is building a green metropolis. Should the rest of the world care?
By Kevin Bullis
But Who's Counting?
No one really knows how many people visit websites. A San Francisco startup and Google are both working to change that.
By Jason Pontin

Read more articles from this Issue

NOTEBOOKS FORWARD ESSAY TO MARKET Q&A PHOTO ESSAY REVIEWS HACK DEMO
Archives MIT News Subscribe Contact

Log In

Forgot your password?     Register »
Advertisement

Videos

Prescription: Networking
Technology Review November/December 2009

Current Issue

Natural Gas Changes the Energy Map
The United States has vast supplies of this cleaner fossil fuel. But how should we use it?
Advertisement
Advertisement
Subscribe to Technology Review's daily e-mail update. Enter your e-mail address

TECHNOLOGY RESOURCES

More Technology News from Forbes

Advertisement
MIT Massachusetts Institute of Technology © 2009 Technology Review. All Rights Reserved.