banner webdataminer

Knowlesys Web Data Miner Studio

2020-08-01 Knowlesys Web Data Miner Studio 8.0 Release, added dozens of features and runs much faster. Support extracting 5 millions of articles a day! please contact us to get the new version.

wdm extract

The Web is the largest database of public resources in the world. At present, there are at least 100 million websites with over 80 billion webpages. The number of webpages increases dramatically every single second. You can explore lots of valuable information in these webpages, including the list and contact information of potential customers, price list of competing products, real-time financial news, public opinions information, word-out-mouth information, supply and demand, scientific periodicals, forum posts, blogs and articles, and latest news. The key information, however, exists in the massive HTML webpages of websites in the form of semi-structures. As a result, the information can hardly be gathered and directly utilized.

The Web Data Miner Studio easily addresses this problem. Its major function is to accurately extract the semi-structured data on the target Internet webpages as structured records in batches, and save them to the local database for further usage purposes. The console in the following figure shows the usage procedure of the system.

The system features are as follows:

Websites: support the mining of any data on any webpages of any websites.

Text formats: support the mining from local files, including, HTML, JSON, XML, text, CSV, RTF, Word, and PDF files.

Databases: support all mainstream databases, including Oracle, DB2, MS SQL Server, Sybase, MySQL, PostgreSQL, Interbase, and MS Access.


The Web Data Miner Studio is applied to the fields of public opinions monitoring, network word-of-mouth monitoring, price monitoring and comparison, news mining on portal websites, industry news mining, extraction of competitive intelligence for companies, internal and external news systems, database marketing, periodical mining at digital libraries, scientific research data mining, and integration of remaining information systems.

The Web Data Miner Studio assists you to easily integrate the world's mass information and brings you with huge business values.

Functions of Different Editions

Function

Standard edition

Professional edition

Enterprise edition

Microblog website extraction

icn ok red
icn ok red
icn ok red

BBS extraction

icn ok red
icn ok red
icn ok red

Blog website extraction

icn ok red
icn ok red
icn ok red

News website extraction

icn ok red
icn ok red
icn ok red

Text file extraction

icn ok red
icn ok red
icn ok red

RSS/XML extraction

icn ok red
icn ok red
icn ok red

Image website extraction

icn ok red
icn ok red
icn ok red

Video website extraction

icn ok red
icn ok red
icn ok red

Image website extraction

icn ok red
icn ok red
icn ok red

scheduled execution

icn ok red
icn ok red
icn ok red

static URL list extraction

icn ok red
icn ok red
icn ok red

dynamic URL list extraction

icn ok red
icn ok red
icn ok red

web page screenshot

 
icn ok red
icn ok red

direct POST search and extraction

 
icn ok red
icn ok red

online database website extraction

icn ok red

ordinary Windows window program extraction

   
icn ok red

simulate form completion for query and extraction

 
 
icn ok red

advanced data processing

   
icn ok red

Multi-language information extraction

   
icn ok red

maximum number of tables

10
10
infiniteicn ok red

maximum number of fields

60
100
infiniteicn ok red

maximum lines of data transform script

100
200
infiniteicn ok red

maximum records extracted successively

100,000
500,000
infiniteicn ok red

times of use

infiniteicn ok red
infiniteicn ok red
infiniteicn ok red

number of websites

infiniteicn ok red
infiniteicn ok red
infiniteicn ok red