Query Selection in Deep Web Crawling

Query Selection in Deep Web Crawling

Yan Wang

80,42 €
IVA incluido
Disponible
Editorial:
KS OmniScriptum Publishing
Año de edición:
2014
Materia
Internet: obras generales
ISBN:
9783639712452
80,42 €
IVA incluido
Disponible
Añadir a favoritos

The deep web is the content that is dynamically generated from data sources such as databases or file system. Unlike surface web where web pages are collected by following the hyperlinks embedded inside collected pages, data from a deep web data source is guarded by a search interface and only can be retrieved by queries. The amount of data in deep web exceeds by far that of the surface web. This calls for deep web crawlers to excavate the data so that they can be used, indexed, and searched upon in an integrated environment. Crawling deep web is the process of collecting data from search interfaces by issuing queries. One of the major challenges in crawling deep web is the selection of the queries so that most of the data can be retrieved at a low cost. This work first comprehensively introduces the state-of-art work in query selection techniques for crawling, then in-depth analyzes the remaining problems, such as cold start problem and return limit problem, and finally presents a novel technique to address them.

Artículos relacionados

  • Web Portals
    Arthur Tatnall
    ...
    Disponible

    118,78 €

  • Organizational Communication and Sustainable Development
    Although social, economical, and environmental sustainability has become increasingly important in this era of globalization, little effort has been put forth to investigate the social and cultural impact. Organizational Communication and Sustainable Development: ICTs for Mobility explores how mobility meets sustainability in contemporary organizational communication. A compend...
    Disponible

    236,32 €

  • Cases on Global E-Learning Practices
    Remesh C. Sharma / Remesh CSharma
    ...
    Disponible

    118,77 €

  • Etransformation in Governance
    ...
    Disponible

    105,58 €

  • Web Services Security Development and Architecture
    CARLOS GUTIÉRREZ
    Despite solid advances, numerous challenges have yet to be resolved by Web services-enabled service-oriented architecture systems. Web Services Security Development and Architecture: Theoretical and Practical Issues explores a global approach to methodical development in constructing safety architectures for online systems. Addressing security concerns during the full developme...
    Disponible

    236,42 €

  • Internet Strategy
    Matthew W. Guah / Matthew WGuah / Wendy L. Currie / Wendy LCurrie
    ...
    Disponible

    118,66 €

Otros libros del autor