Knowledge Agora



Scientific Article details

Title A Semi-Automatic Data-Scraping Method for the Public Transport Domain
ID_Doc 44738
Authors Vela, B; Cavero, JM; Cáceres, P; Cuesta, CE
Title A Semi-Automatic Data-Scraping Method for the Public Transport Domain
Year 2019
Published
DOI 10.1109/ACCESS.2019.2932197
Abstract The growing amount of data on the Internet has led to a situation in which it is essential to process these data to generate new services with the specific aim of improving people's daily living conditions. Transport data is of the utmost importance, since everyday people have to move around to perform some daily tasks, such as going to work, studying and shopping, and this means that the number of journeys by public transport grows daily. People with special needs make a large number of these trips, but they do not have sufficient information about the accessibility of the routes they want to take. Although there are numerous websites and applications that provide information on public transport services, most do not provide detailed information on the accessibility of the routes. We are, therefore, developing a technological framework for the processing, management, and exploitation of open data to promote accessibility to urban public transport. This is taking place within the framework of the Access@City project. This paper specifically focuses on the data extraction and processing of the existing information on the web concerning public transport and its accessibility for the generation of an open data repository in which to store this information. We, therefore, propose a method for the semi-automatic generation of a data scraper for the public transport domain. This method allows the extraction of public transport data and the existing accessibility information from a selected website. We have additionally developed a web tool that employs the aforementioned method to generate a data scraper for the public transport domain.
Author Keywords Accessibility; code generation; open data; public transport; smart city; web scraping
Index Keywords Index Keywords
Document Type Other
Open Access Open Access
Source Science Citation Index Expanded (SCI-EXPANDED)
EID WOS:000481972100020
WoS Category Computer Science, Information Systems; Engineering, Electrical & Electronic; Telecommunications
Research Area Computer Science; Engineering; Telecommunications
PDF https://ieeexplore.ieee.org/ielx7/6287639/8600701/08782469.pdf
Similar atricles
Scroll