WebHarvy 6.2 (Enhanced Proxy Support, Chromium v86, New Browser Setting options)

The following are the changes in this version. Enhanced proxy support In this version we have added support for various types of proxies. Earlier, WebHarvy supported only HTTP proxies. Starting from this version the following proxy types are supported. HTTP HTTPS SOCKS4 SOCKS4a SOCKS5 In the proxy settings window you can select the type of … Read more

Sequentially Scrape Websites : Automation

Often you require to scrape data from multiple websites and might also need to automate the entire process. The following would be your desired workflow. Configure WebHarvy to scrape data from each website. Then start scraping data from each website, one after the other, without any manual intervention. In short, a one-click method to start … Read more

How to scrape Google Jobs? | Scraping job details

WebHarvy can be used to scrape job details from jobs listing websites like Indeed, Google Jobs etc. WebHarvy can automatically pull job details from multiple pages of listings and save them to a file or database. The following video shows how WebHarvy can be configured to scrape data from Google Jobs listings. Details like job … Read more

How to scrape business contact details from Google Maps ?

WebHarvy is a visual web scraper which can be easily configured to scrape data from any website. In this article we will see how WebHarvy can easily scrape business contact details from Google Maps. WebHarvy can scrape contact details (name, address, website, phone etc.) as well as reviews of businesses displayed on Google Maps. The … Read more

How to build a simple web scraper using Puppeteer?

Table of Contents What is Puppeteer? Uses of Puppeteer How to install? How to start a browser instance? How to load a URL? How to navigate/interact with the page? How to take screenshots, save page as PDF? How to select data from page? Headless browser as a service What is Puppeteer? Puppeteer (https://developers.google.com/web/tools/puppeteer) is a … Read more

Announcing an upcoming product : GrabContacts

We are happy to announce news regarding our upcoming product launch, on which we were working during the past one year. GrabContacts is an online service which helps you easily extract contact details (email addresses, phone numbers, social media handles) from websites (URLs) or search queries. Unlike WebHarvy there is no configuration involved, you just … Read more

AliExpress Scraper – Scraping product data including images from AliExpress

WebHarvy is a visual web scraper which can be easily used to scrape data from any website including eCommerce websites like Amazon, eBay, AliExpress etc. Scraping AliExpress The following video shows how WebHarvy can be configured to scrape data from AliExpress product listings. Details of the products like product name, price, minimum orders, shipping details, seller … Read more

How to use User Agent strings to prevent blocking while web scraping ?

What is a user agent string ? The User-Agent string of a web browser helps servers (websites) to identify the browser (Chrome, Edge, FireFox, IE etc.), its version and also the operating system (Windows, Mac, Android, iOS etc.) on which it is running. This mainly helps the websites to serve different pages for various platforms … Read more