Knowlesys Web Data Miner Studio
2020-08-01 Knowlesys Web Data Miner Studio 8.0 Release, added dozens of features and runs much faster. Support extracting 5 millions of articles a day! please contact us to get the new version.
The Web is the largest database of public resources in the world. At present, there are at least 100 million websites with over 80 billion webpages. The number of webpages increases dramatically every single second. You can explore lots of valuable information in these webpages, including the list and contact information of potential customers, price list of competing products, real-time financial news, public opinions information, word-out-mouth information, supply and demand, scientific periodicals, forum posts, blogs and articles, and latest news. The key information, however, exists in the massive HTML webpages of websites in the form of semi-structures. As a result, the information can hardly be gathered and directly utilized.
The Web Data Miner Studio easily addresses this problem. Its major function is to accurately extract the semi-structured data on the target Internet webpages as structured records in batches, and save them to the local database for further usage purposes. The console in the following figure shows the usage procedure of the system.
The system features are as follows:
Websites: support the mining of any data on any webpages of any websites.
Text formats: support the mining from local files, including, HTML, JSON, XML, text, CSV, RTF, Word, and PDF files.
Databases: support all mainstream databases, including Oracle, DB2, MS SQL Server, Sybase, MySQL, PostgreSQL, Interbase, and MS Access.
The Web Data Miner Studio is applied to the fields of public opinions monitoring, network word-of-mouth monitoring, price monitoring and comparison, news mining on portal websites, industry news mining, extraction of competitive intelligence for companies, internal and external news systems, database marketing, periodical mining at digital libraries, scientific research data mining, and integration of remaining information systems.
The Web Data Miner Studio assists you to easily integrate the world's mass information and brings you with huge business values.
Functions of Different Editions
Function |
Standard edition |
Professional edition |
Enterprise edition |
Microblog website extraction |
|||
BBS extraction |
|||
Blog website extraction |
|||
News website extraction |
|||
Text file extraction |
|||
RSS/XML extraction |
|||
Image website extraction |
|||
Video website extraction |
|||
Image website extraction |
|||
scheduled execution |
|||
static URL list extraction |
|||
dynamic URL list extraction |
|||
web page screenshot |
|||
direct POST search and extraction |
|||
online database website extraction |
|||
ordinary Windows window program extraction |
|||
simulate form completion for query and extraction |
|||
advanced data processing |
|||
Multi-language information extraction |
|||
maximum number of tables |
10 |
10 |
infinite |
maximum number of fields |
60 |
100 |
infinite |
maximum lines of data transform script |
100 |
200 |
infinite |
maximum records extracted successively |
100,000 |
500,000 |
infinite |
times of use |
infinite |
infinite |
infinite |
number of websites |
infinite |
infinite |
infinite |