The Research of Automation of the Process of Indexing Tax Returns

Abstract

The article is devoted to the study of the automated search for information on tax declarations of different countries in public sources of various structures and the collection of information received in a single information storage. The first part of the paper describes methods of automated data collection and tasks that can be solved by these methods. The second part of the work describes the development of an algorithm for finding data on tax declarations from various sources and creating a prototype system that implements the data of the algorithm and provides access to the collected data.

 

Keywords: search systems, indexation of tax declarations, information retrieval system.

References
[1] Patil, Yugandhara и Patil, Sonal. www.ijarcce.com. [Internet] 16 January 2016 http: //www.ijarcce.com/upload/2016/january-16/IJARCCE%2052.pdf


[2] Cambridge University Press. [Internet] 1 April 2009 https://nlp.stanford.edu/IRbook/pdf/20crawl.pdf


[3] Sushitha, S, etc. Patents and Publications Web Scraping. International Journal of Computer Science and Network. 2016, V. 5, 2.


[4] Scala Scraper. GitHub.com. [Internet] 2016 https://github.com/ruippeixotog/scalascraper.