WebHarvy 4.1.5.141 released

The main changes in this release are :- Pagination via JavaScript – see https://www.webharvy.com/tour3.html#JS This powerful feature is the main highlight of this release. When all other methods of pagination fails, this method, where you can directly provide a JavaScript code which when run would load the next page, can be used. Increased size of

Read More

Scraping high resolution images from pinterest.com

In this blog post, we will take a look at how to scrape images from www.pinterest.com in their full sizes.We follow a two stage extraction process to capture the high-res images from pinterest.com. In the first extraction stage, we capture the image URLs which are present in the listings page. These URLs actually point to smaller sized

Read More

WebHarvy 4.0.3.129 (Installer Update Only)

This update addresses problems in installing .NET 4.5 on Windows 7 (and earlier Windows versions where .NET 4.5 is not present) during installation process. Only the installer has been updated in this release and WebHarvy application files are unchanged compared to the just previous version. So in case you are already running 4.0.3.128 you can

Read More

Windows Smartscreen warning while installing WebHarvy

All WebHarvy application files and installation package are digitally signed (Comodo RSA Code Signing CA) and secured. However in case you get the following Smartscreen warning while trying to install the latest version of WebHarvy, please click the ‘More info‘ link and then click the ‘Run anyway‘ button to proceed with the installation. The above

Read More

WebHarvy 4.0.3.128 (Minor Update)

From this release on wards WebHarvy targets (depends on) .NET 4.5 which comes pre-installed on latest Windows editions. This results in smoother installation process, doing away with .NET 3.5 download and install which was previously required. Targeting .NET 4.5 also helps WebHarvy improve performance and resource usage, and to solve issues related to crashes while

Read More

WebHarvy crashes after installing the latest Windows update for Adobe Flash

Microsoft released a new security update for Adobe Flash Player for Internet Explorer (IE) a few days back (Dec 29, 2015). This update has caused many software (including Skype – see Skype Crash) to crash. See http://borncity.com/win/2015/12/30/windows-10-flash-update-kb3132372-issues/ for a list of other software titles affected due to this update. InfoWorld Article : Win10 Flash patch KB 3132372

Read More

WebHarvy version 3.4 released !

We’ve just released a new WebHarvy update. The following are the changes in this version. Major: Support for pagination where a link/button has to be clicked to load the next set of pages. More Info URL based pagination – automatically increment a numeral in start page URL to load subsequent pages. More Info One-click multiple image extraction

Read More

Scraping hidden details using WebHarvy

WebHarvy allows you to scrape hidden fields in websites which are displayed only when you click on a link or button. The ‘Click’ option in the Capture window can be used to display such ‘click to display’ fields. The following video shows the process. The video below shows how contact details from Craigslist listing pages can

Read More