数据采集

knowlesys web data mining

Web2DB Services
Brief Introduction Submit Request The World Wide Web is a vast and rapidly growing source of information. Most of this information is in the form of unstructured text, making the information hard to query. Web Data Extraction is a process that able to digest target Web databases that are visible only as HTML pages, and create a local, identical replica of those databases as a result. What is needed in this process is much more than a Web crawler and set of Web site wrappers. A comprehensive data extraction process needs to deal with such roadblocks such as session identifiers, HTML forms, and client-side JavaScript, and data integration problems such as incompatible datasets and vocabularies, and missing and conflicting data. Web2DB is a web data extraction service. It make thing easy. It includes two types: Web2DB data service Web2DB custom extractor service. You just tell us where you want to search, what you want to get, and how you want it formatted. We