
Clustering and Ranking for Web Information Retrieval
Methodologies for Searching the Web
Antonio Gulli(Author)
AV Akademikerverlag
Published on 27. June 2012
Book
Paperback/Softback
144 pages
978-3-639-43296-1 (ISBN)
Description
Revision with unchanged content. This book investigates several research problems which arise in modern Web Information Retrieval. First of all we consider the fact that there are many situations where a flat list of ten search results are not enough, and that the users might desire to have a larger number of results grouped on-the-fly in folders of similar topics. In this book, we describe Snaket, a hierarchical clustering meta-search engine which personalizes searches according to the clusters selected on-the-fly by users. Second, we consider those situations where users might desire to access fresh information such as news articles. We present a new ranking algorithm suitable for ranking those fresh type of information. Third, we will discuss numerical methodologies for accelerating the ranking methodologies used in Web Search. An important achievement for this book is that we show how to address the above predominant issues of Web Information Retrieval by using clustering and ranking methodologies. We demonstrate that both clustering and ranking have a mutual reinforcement property that has not yet been studied intensively.
More details
Language
English
Product notice
Paperback (trade)
Unsewn / adhesive bound
Dimensions
Height: 220 mm
Width: 150 mm
Thickness: 10 mm
Weight
233 gr
ISBN-13
978-3-639-43296-1 (9783639432961)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Person
is the Director of Advanced Search Projects at Ask.com. He holds a Degree in Computer Science, a Degree in Engineering, and a Ph.D. in Computer Science. His research is manly focused in Web Search, Ranking and Clustering. He served as PC Member of many International Conferences such as WWW2008, WWW07, WSDM08, SIGIR07, etc.