Web Scraping: Collecting and Retrieving Data from the Web

Roman Egger, Markus Kroner, Andreas Stöckl

Publikation: Beitrag in Buch/Bericht/TagungsbandKapitel

19 Zitate (Scopus)

Abstract

In this chapter, a number of tools for crawling websites are presented, and an example using hotel ratings has been adopted in order to specifically show how these can be extracted from a rating platform. For this purpose, Python with the library ``BeautifulSoup'' is used. Other program packages include Scrapy and Selenium, with which more complex applications can be realized. In addition to the technical aspects of web scraping, the legal framework of this process will also be discussed.
OriginalspracheEnglisch
TitelApplied Data Science in Tourism: Interdisciplinary Approaches, Methodologies, and Applications
Redakteure/-innenRoman Egger
ErscheinungsortCham
Herausgeber (Verlag)Springer
Seiten67-82
Seitenumfang16
ISBN (Print)978-3-030-88389-8
DOIs
PublikationsstatusVeröffentlicht - Jän. 2022

Publikationsreihe

NameTourism on the Verge
BandPart F1051
ISSN (Print)2366-2611
ISSN (elektronisch)2366-262X

Schlagwörter

  • Web crawling
  • Website parsing
  • Web scraping
  • Open Data
  • BeautifulSoup

Zitieren