Version | |
Download | 12 |
Total Views | 65 |
Stock | ∞ |
File Size | 837.90 KB |
File Type | |
Create Date | September 17, 2018 |
Last Updated | September 17, 2018 |
Towards Data Extraction of Dynamic Content from JavaScript Web Applications
Abstract—: An enormous data in World Wide Web and social media has open opportunities for business and organization to get the significant value that leads to efficient operations. As a result, Web Data Extraction has become an important tool for gathering and translating semi-structured documents into valuable information. However, one of the major challenges is dealing with changes from Web documents, especially emerging of JavaScript Web development technology that has significantly affected the way to embed and rendering data of Web pages.
In this paper, we propose a design and implementation of a new Web Data Extraction system that aims for extracting data from JavaScript Web applications. The proposed system enables users to select valuable data from online Web documents by defining data extraction rules and data transformation patterns. The extraction engine automatically scrapes and transforms semi-structure data into relational data. The preliminary evaluation results showed
that our proposed system has successfully extract data from modern JavaScript Web applications.
Get IEEE 2018 Project Topics List