We have significantly updated the user interface of WebHarvy in the latest version available in our website and the following video explains how the features and options are laid out in the new UI. Existing users of older versions will find this video useful so that they know where to look for specific features and
Changes in 5.2 are mainly related to user interface and experience. The most visible change is the introduction of the ribbon menu system for providing easy access to most software features. In addition to the main interface, other windows like Scheduler / Export etc. have also been updated. The export functionality (to file or database) has
While extracting data from details pages (page reached by navigating a link from the start page), it is recommended that the ‘Capture Following Text‘ option be used whenever possible to correctly and consistently scrape data. This is because the layout and the amount of data displayed in details pages may not be consistent. For example,
The ‘category scraping’ feature of WebHarvy allows you to easily scrape a list of links which leads to similarly formatted pages within a website with a single configuration. This helps to scrape data from sections and subsections listed under the main page of a website. Please follow this link to know more about Category Scraping.
The latest update of WebHarvy (version 220.127.116.11) has gone live and is available for download at www.webharvy.com/download.html. Changes : [New Feature] Keyword based Scraping : Allows you to run the same configuration for a set of input keywords (Read more : http://www.webharvy.com/tour71.html) Edit Configuration : Allows you to edit an already saved WebHarvy configuration XML file
We have released a new version of WebHarvy Web Scraper (version 18.104.22.168). The new features in this release are : Support for exporting scraped data to database. Support for web scraping via proxy servers. Multi level page scraping. Scrape sections, subsections or categories within websites. Pause / Resume mining operation. Status updates while mining. Automatically