Proceedings of the IV International research conference "Information technologies in Science, Management, Social sphere and Medicine" (ITSMSSM 2017)

Parsing of Data on Real Estate Objects from Network Resource

Authors
Vyacheslav Cherkesov, Vitaliy Malikov, Alexey Golubev, Danila Parygin, Tatiana Smykovskaya
Corresponding Author
Vyacheslav Cherkesov
Available Online December 2017.
DOI
https://doi.org/10.2991/itsmssm-17.2017.80How to use a DOI?
Keywords
real estate object, network resource, parsing, data collection, BeautifulSoup, Scrapy
Abstract
Existing approaches for collecting data from sites on the Internet were considered. A comparative analysis of the solution based on the BeautifulSoup library and the Scrapy framework for parsing the content of network resources was made. Sources of information about real estate objects were analyzed. The method for parsing data on real estate objects was developed based on the results of the conducted studies. In addition, the main problems with the use of parsing technology were identified.
Open Access
This is an open access article distributed under the CC BY-NC license.

Download article (PDF)

Cite this article

TY  - CONF
AU  - Vyacheslav Cherkesov
AU  - Vitaliy Malikov
AU  - Alexey Golubev
AU  - Danila Parygin
AU  - Tatiana Smykovskaya
PY  - 2017/12
DA  - 2017/12
TI  - Parsing of Data on Real Estate Objects from Network Resource
BT  - IV International research conference "Information technologies in Science, Management, Social sphere and Medicine" (ITSMSSM 2017)
PB  - Atlantis Press
SN  - 2352-538X
UR  - https://doi.org/10.2991/itsmssm-17.2017.80
DO  - https://doi.org/10.2991/itsmssm-17.2017.80
ID  - Cherkesov2017/12
ER  -