数据采集

knowlesys web data mining

Web Data Extraction
Web Data Extraction is a process that able to digest target Web databases that are visible only as HTML pages, and create a local, identical replica of those databases as a result. What is needed in this process is much more than a Web crawler and set of Web site wrappers. A comprehensive Data Collection process needs to deal with such roadblocks such as session identifiers, HTML forms, and client-side JavaScript, and data integration problems such as incompatible datasets and vocabularies, and missing and conflicting data. Web2DB is a web data extraction service. It make thing easy. It includes two types: Web2DB data service Web2DB custom extractor service. You just tell us where you want to search, what you want to get, and how you want it formatted. We do all the work and send the results directly to you. The database format could be Excel, Access, CSV, Text, MS SQL and My SQL. The extractor can also be customized for your targeted website so that you can run i