Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.13087/212
Title: | ON THE USEFULNESS OF HTML META ELEMENTS FOR WEB RETRIEVAL | Authors: | Arslan, Ahmet | Issue Date: | 2020 | Abstract: | Web retrieval studies have mostly used URL, title, body, and anchor text fields to represent Web documents. On the other hand, HTML standards provide a rich set of elements to define different parts of a Web page. For example, meta elements are used to provide structured metadata about a Web page not to end users, but instead to browsers or crawlers. However, it is unclear whether meta tags are or are not useful for Web retrieval, as most of the previous studies leveraged URL, title, body, and anchor text fields. In this work, we examine the usefulness of two meta tags, namely keywords and description, based on ad-hoc tasks of previous TREC studies. Through experiments on the standard TREC Web datasets and several query sets, our results using the state-of-the-art term-weighting models show that the utilization of description field systematically increases the retrieval effectiveness, to a statistically significant degree most of the time. By contrast, the employment of keywords field may cause a significant deterioration in retrieval effectiveness for certain term-weighting models. | URI: | https://doi.org/10.18038/estubtda.615103 https://app.trdizin.gov.tr/makale/TkRNME1ERTFOUT09 https://hdl.handle.net/20.500.13087/212 |
ISSN: | 1302-3160 2667-4211 |
Appears in Collections: | Matematik Bölümü Koleksiyonu TR-Dizin İndeksli Yayınlar Koleksiyonu |
Show full item record
CORE Recommender
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.