Untitled Document
Not a member yet? Register for full benefits!

 How to Build a 100-Million-Image Database

This story is from the category Libraries and Components
Printer Friendly Version
Email to a Friend (currently Down)



Date posted: 03/06/2009

We take some 80 billion photographs each year which would require around 400 petabytes to store if they were all saved. Finding your cherished shot of Aunt Marjory's 80th birthday party among that lot is going to take some special kind of search algorithm. And of course, various groups are working on just how to solve this problem.

But if you want to build the next generation of image search algorithms, you need a database on which to test it, say Andrea Esuli and pals at the Institute of Information Science and Technologies in Pisa, Italy. And they have one: a database of 100 million high quality digital images taken from Flickr. For each image they have extracted five descriptive features such as colours, shape, and texture, as defined by the MPEG-7 image standard.

That's no mean feat. Esuli and co point out that such an image database would normally require the download and processing of up to 50 TB of data, something that would take take about 12 years on a standard PC and about 2 years using a high-end multi-core PC. Instead, they simply decided to crawl the Flickr site, where the pictures are already stories, taking what data they need as descripitors. This paper describes the trials and tribulations of building such a database.

See the full Story via external site: www.technologyreview.com

Most recent stories in this category (Libraries and Components):

17/02/2015: New algorithms Geolocate a video from its images and sounds

25/03/2014: Parallel programming may not be so daunting

24/01/2014: Stanford scientists use 'virtual earthquakes' to forecast Los Angeles quake risk

14/04/2013: The mathematical method for simulating the evolution of the solar system has been improved by UPV/EHU researchers

13/02/2013: 3D Printing on the Micrometer Scale

07/02/2013: Gap geometry grasped: A new algorithm could help understand the structure of liquids, and how they flow through porous media

03/12/2012: The advantages of 3D printing are now being put to the test in soil science laboratories

02/12/2012: Preventing 'Cyber Pearl Harbor'