|Google, Kosmix and the Exploration of the Deep Web
Welcome to the Deep Web, the portion of the Internet not accessible to traditional search engines. Social networks, media-sharing sites for photos and videos, library catalogs, airline reservation systems, phone books, and all kinds of scientific databases lurk inside the Web, practically invisible today’s search tools. The volume of this hidden content is enormous: some estimates have pegged the size of the Deep Web at up to 500 times larger than the slice of the Web we see on search engines today.Ironically, the Deep Web hides some of the richest content on the Internet. The Web 2.0 revolution has enabled an explosion of sites dedicated to user-generated content, including Wikipedia, YouTube, Flickr, TripAdvisor and Yelp. Content on these sites grows so rapidly that it’s nearly impossible for Web crawlers to keep up. Social media sites like Facebook, MySpace, and Twitter pose the same challenge, and often add privacy protections that permit only “friends” to view certain information.
In this session, Kosmix Co-Founder Dr. Anand Rajaraman and Dr. Alon Halevy of Google Labs will explore the mysteries of the Deep Web. Prominent experts in this field, Rajaraman and Halevy will examine how the Deep Web will change the business of Search, offer insight into what content creators should expect as the Deep Web becomes more accessible, and comment on what this shift might mean for the end user. They will also address the question of whether the Deep Web will someday be fully exposed, and if not, why not.
Date: November 12, 2009
The now well established Search SIG Agenda format will be used: