Back to Results
First PageMeta Content
Web crawler / Searching / Invisible Web / Search engine indexing / Bing / Web search engine / Qi / Distributed web crawling / Google Search / Information science / Information retrieval / Internet search engines


Downloading Textual Hidden Web Content Through Keyword Queries Alexandros Ntoulas Petros Zerfos
Add to Reading List

Document Date: 2013-02-13 17:55:48


Open Document

File Size: 409,20 KB

Share Result on Facebook

City

Denver / Query / Cambridge / /

Company

Internet-Based Information Systems / MIT Press / Amazon Inc. / W. H. Freeman & Co. / New York Times / Google / /

Country

United States / /

Currency

USD / /

/

Facility

Public Medical Library / PubMed Medical Library / /

IndustryTerm

Web site Cumulative fraction / Web today / Web corpus / Web content / genericfrequency algorithms / hidden web databases / Web users / Web site returns / keyword-search interface / hidden-web data / dmoz Web site / search box / web collections / highlevel algorithm / approximation algorithm / online bookstore / Web sources / adaptive algorithm / generic-frequency algorithms / web query interfaces / Web Content Through Keyword Queries Alexandros Ntoulas Petros Zerfos Junghoo Cho UCLA Computer Science UCLA Computer Science UCLA Computer Science ntoulas@cs.ucla.edu pzerfos@cs.ucla.edu cho@cs.ucla.edu ABSTRACT An / generic-frequency algorithm / search form / public web search engines / generic algorithm / multi-attribute search interface / dmoz site / multi-attribute search interfaces / hidden web / average Web user / learning search interfaces / actual Web page / Web interface / search-engine coverage / search engine perspective / search forms / hidden web content / selection algorithm / Web query interface / query selection algorithm / search interface / Page similarity detection algorithms / adaptive algorithms / search engines / Web Search Engines / near-optimal solution / hidden web database selection / Web crawling / Web search / online database / search interfaces / real Web sites / typical Web user / deep web / particular Web page / generic-dmoz site / Web site Procedure / search engine / power law distribution / good candidate algorithm / Web crawler / Web crawlers / particular Web site / /

NaturalFeature

Press/McGraw Hill / /

OperatingSystem

Petros / /

Organization

UCLA / MIT / US Patent Office / DP9-An OAI Gateway Service for Web Crawlers / /

Person

There / Lawrence / Giles / C. Richard C. Luo / /

Position

Hidden-Web database model / author / Software Developer / /

Product

Equation 2 / /

ProgrammingLanguage

XML / /

ProvinceOrState

Massachusetts / Colorado / /

PublishedMedium

New York Times / /

Technology

Page similarity detection algorithms / XML / 3.2 Query selection algorithm / generic algorithm / proposed algorithms / search engine / two algorithms / random-based algorithm / highlevel algorithm / generic-frequency algorithms / aforementioned algorithms / approximation algorithm / random-based algorithms / Database technologies / http / query selection algorithm / generic-frequency algorithm / adaptive algorithm / same algorithm / frequency-based algorithm / genericfrequency algorithms / good candidate algorithm / /

URL

www.press.umich.edu/jep/07-01/bergman.html / http /

SocialTag