Advanced metasearch engine technology /

Among the search tools currently on the Web, search engines are the most well known thanks to the popularity of major search engines such as Google and Yahoo! While extremely successful, these major search engines do have serious limitations. This book introduces large-scale metasearch engine techno...

Full description

Saved in:
Bibliographic Details
Author / Creator:Meng, Weiyi.
Imprint:San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, c2011.
Description:1 electronic text (x, 117 p.) : ill., digital file.
Language:English
Series:Synthesis lectures on data management, 2153-5426 ; # 11
Synthesis digital library of engineering and computer science.
Synthesis lectures on data management, # 11.
Subject:Web search engines -- Mathematical models.
Federated searching -- Mathematical models.
Format: E-Resource Book
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/10510963
Hidden Bibliographic Details
Other authors / contributors:Yu, Clement T.
ISBN:9781608451937 (electronic bk.)
9781608451920 (pbk.)
Notes:Part of: Synthesis digital library of engineering and computer science.
Series from website.
Includes bibliographical references (p. 107-115).
Abstract freely available; full-text restricted to subscribers or individual document purchasers.
Compendex
INSPEC
Google scholar
Google book search
Also available in print.
Mode of access: World Wide Web.
System requirements: Adobe Acrobat Reader.
Summary:Among the search tools currently on the Web, search engines are the most well known thanks to the popularity of major search engines such as Google and Yahoo! While extremely successful, these major search engines do have serious limitations. This book introduces large-scale metasearch engine technology, which has the potential to overcome the limitations of the major search engines. Essentially, a metasearch engine is a search system that supports unified access to multiple existing search engines by passing the queries it receives to its component search engines and aggregating the returned results into a single ranked list. A large-scale metasearch engine has thousands or more component search engines. While metasearch engines were initially motivated by their ability to combine the search coverage of multiple search engines, there are also other benefits such as the potential to obtain better and fresher results and to reach the DeepWeb. The following major components of large-scale metasearch engines will be discussed in detail in this book: search engine selection, search engine incorporation,and result merging. Highly scalable and automated solutions for these components are emphasized. The authors make a strong case for the viability of the large-scale metasearch engine technology as a competitive technology for Web search.
Standard no.:10.2200/S00307ED1V01Y201011DTM011
LEADER 06031nam a2200745 a 4500
001 10510963
005 20101216133747.0
006 m e d
007 cr cn |||m|||a
008 101209s2011 caua foab 000 0 eng d
003 ICU
020 |a 9781608451937 (electronic bk.) 
020 |z 9781608451920 (pbk.) 
024 7 |a 10.2200/S00307ED1V01Y201011DTM011  |2 doi 
035 |a MC201011DTM011 
035 |a (CaBNvSL)gtp00545393 
040 |a CaBNvSL  |c CaBNvSL  |d CaBNvSL 
050 4 |a TK5105.884  |b .M452 2011 
082 0 4 |a 025.04  |2 22 
100 1 |a Meng, Weiyi.  |0 http://id.loc.gov/authorities/names/n97087489  |1 http://viaf.org/viaf/28818597 
245 1 0 |a Advanced metasearch engine technology /  |c Weiyi Meng, Clement T. Yu. 
260 |a San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) :  |b Morgan & Claypool,  |c c2011. 
300 |a 1 electronic text (x, 117 p.) :  |b ill., digital file. 
336 |a text  |b txt  |2 rdacontent  |0 http://id.loc.gov/vocabulary/contentTypes/txt 
337 |a computer  |b c  |2 rdamedia  |0 http://id.loc.gov/vocabulary/mediaTypes/c 
338 |a online resource  |b cr  |2 rdacarrier  |0 http://id.loc.gov/vocabulary/carriers/cr 
490 1 |a Synthesis lectures on data management,  |x 2153-5426 ;  |v # 11 
500 |a Part of: Synthesis digital library of engineering and computer science. 
500 |a Series from website. 
504 |a Includes bibliographical references (p. 107-115). 
505 0 |a 1. Introduction -- Finding information on the web -- Browsing -- Searching -- A brief overview of text retrieval -- System architecture -- Document representation -- Document-query matching -- Query evaluation -- Retrieval effectiveness measures -- A brief overview of search engine technology -- Special characteristics of the web -- Web crawler -- Utilizing tag information -- Utilizing link information -- Result organization -- Book overview --  
505 8 |a 2. Metasearch engine architecture -- System architecture -- Why metasearch engine technology -- Challenging environment -- Heterogeneities and their impact -- Standardization efforts --  
505 8 |a 3. Search engine selection -- Rough representative approaches -- Learning-based approaches -- Sample document-based approaches -- Statistical representative approaches -- D-WISE -- CORI Net -- gGLOSS -- Number of potentially useful documents -- Similarity of the most similar document -- Search engine representative generation --  
505 8 |a 4. Search engine incorporation -- Search engine connection -- HTML form tag for search engines -- Automatic search engine connection -- Search result extraction -- Semiautomatic wrapper generation -- Automatic wrapper generation --  
505 8 |a 5. Result merging -- Merging based on full document content -- Merging based on search result records -- Merging based on local ranks of results -- Round-robin based methods -- Similarity conversion based methods -- Voting based methods -- Machine learning based method --  
505 8 |a 6. Summary and future research -- Bibliography -- Authors' biographies. 
506 |a Abstract freely available; full-text restricted to subscribers or individual document purchasers. 
510 0 |a Compendex 
510 0 |a INSPEC 
510 0 |a Google scholar 
510 0 |a Google book search 
520 3 |a Among the search tools currently on the Web, search engines are the most well known thanks to the popularity of major search engines such as Google and Yahoo! While extremely successful, these major search engines do have serious limitations. This book introduces large-scale metasearch engine technology, which has the potential to overcome the limitations of the major search engines. Essentially, a metasearch engine is a search system that supports unified access to multiple existing search engines by passing the queries it receives to its component search engines and aggregating the returned results into a single ranked list. A large-scale metasearch engine has thousands or more component search engines. While metasearch engines were initially motivated by their ability to combine the search coverage of multiple search engines, there are also other benefits such as the potential to obtain better and fresher results and to reach the DeepWeb. The following major components of large-scale metasearch engines will be discussed in detail in this book: search engine selection, search engine incorporation,and result merging. Highly scalable and automated solutions for these components are emphasized. The authors make a strong case for the viability of the large-scale metasearch engine technology as a competitive technology for Web search. 
530 |a Also available in print. 
538 |a Mode of access: World Wide Web. 
538 |a System requirements: Adobe Acrobat Reader. 
650 0 |a Web search engines  |x Mathematical models. 
650 0 |a Federated searching  |x Mathematical models. 
653 |a Metasearch engine 
653 |a Large-scale metasearch engine 
653 |a Search broker 
653 |a Distributed information retrieval 
653 |a Federated search system 
653 |a Search engine selection 
653 |a Database selection 
653 |a Search result extraction 
653 |a Wrapper generation 
653 |a Result merging 
653 |a Collection fusion 
700 1 |a Yu, Clement T.  |1 http://viaf.org/viaf/77873409 
830 0 |a Synthesis digital library of engineering and computer science.  |0 http://id.loc.gov/authorities/names/n2016188085 
830 0 |a Synthesis lectures on data management,  |x 2153-5426 ;  |v # 11.  |0 http://id.loc.gov/authorities/names/no2010037814 
856 4 0 |u http://dx.doi.org/10.2200/S00307ED1V01Y201011DTM011  |y Morgan & Claypool 
903 |a HeVa 
999 f f |i b57f8393-8449-5537-9ec0-312383abd3c0  |s 80f6dbe8-1cce-5fee-98b6-d4094a9505df 
928 |t Library of Congress classification  |a TK5105.884.M452 2011  |l Online  |c UC-FullText  |u http://dx.doi.org/10.2200/S00307ED1V01Y201011DTM011  |z Morgan & Claypool  |g ebooks  |i 8689957