Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.13087/212
Title: ON THE USEFULNESS OF HTML META ELEMENTS FOR WEB RETRIEVAL
Authors: Arslan, Ahmet
Issue Date: 2020
Abstract: Web retrieval studies have mostly used URL, title, body, and anchor text fields to represent Web documents. On the other hand, HTML standards provide a rich set of elements to define different parts of a Web page. For example, meta elements are used to provide structured metadata about a Web page not to end users, but instead to browsers or crawlers. However, it is unclear whether meta tags are or are not useful for Web retrieval, as most of the previous studies leveraged URL, title, body, and anchor text fields. In this work, we examine the usefulness of two meta tags, namely keywords and description, based on ad-hoc tasks of previous TREC studies. Through experiments on the standard TREC Web datasets and several query sets, our results using the state-of-the-art term-weighting models show that the utilization of description field systematically increases the retrieval effectiveness, to a statistically significant degree most of the time. By contrast, the employment of keywords field may cause a significant deterioration in retrieval effectiveness for certain term-weighting models.
URI: https://doi.org/10.18038/estubtda.615103
https://app.trdizin.gov.tr/makale/TkRNME1ERTFOUT09
https://hdl.handle.net/20.500.13087/212
ISSN: 1302-3160
2667-4211
Appears in Collections:Matematik Bölümü Koleksiyonu
TR-Dizin İndeksli Yayınlar Koleksiyonu

Show full item record

CORE Recommender

Page view(s)

108
checked on Oct 3, 2022

Google ScholarTM

Check

Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.