WebHarvy Blog

WebHarvy 6.2 (Enhanced Proxy Support, Chromium v86, New Browser Setting options)

March 23, 2021 by admin

The following are the changes in this version. Enhanced proxy support In this version we have added support for various types of proxies. Earlier, WebHarvy supported only HTTP proxies. Starting from this version the following proxy types are supported. HTTP HTTPS SOCKS4 SOCKS4a SOCKS5 In the proxy settings window you can select the type of … Read more

Sequentially Scrape Websites : Automation

March 8, 2021 by admin

Often you require to scrape data from multiple websites and might also need to automate the entire process. The following would be your desired workflow. Configure WebHarvy to scrape data from each website. Then start scraping data from each website, one after the other, without any manual intervention. In short, a one-click method to start … Read more

How to Scrape Google Maps Location Coordinates?

April 26, 2021February 8, 2021 by admin

This article explains how the Keyword Scraping feature of WebHarvy can be used to scrape geo location coordinates (latitude and longitude) of a list of addresses from Google Maps. Given below is a sample list of addresses for which we will scrape geo location coordinates from Google Maps using WebHarvy as shown in the above … Read more

How to scrape Google Jobs? | Scraping job details

June 20, 2023January 13, 2021 by admin

WebHarvy can be used to scrape job details from jobs listing websites like Indeed, Google Jobs etc. WebHarvy can automatically pull job details from multiple pages of listings and save them to a file or database. The following video shows how WebHarvy can be configured to scrape data from Google Jobs listings. Details like job … Read more

How to scrape business contact details from Google Maps ?

August 5, 2022December 18, 2020 by admin

WebHarvy is a visual web scraper which can be easily configured to scrape data from any website. In this article we will see how WebHarvy can easily scrape business contact details from Google Maps. WebHarvy can scrape contact details (name, address, website, phone etc.) as well as reviews of businesses displayed on Google Maps. The … Read more

How to build a simple web scraper using Puppeteer?

May 25, 2021November 17, 2020 by admin

Table of Contents What is Puppeteer? Uses of Puppeteer How to install? How to start a browser instance? How to load a URL? How to navigate/interact with the page? How to take screenshots, save page as PDF? How to select data from page? Headless browser as a service What is Puppeteer? Puppeteer (https://developers.google.com/web/tools/puppeteer) is a … Read more

Announcing an upcoming product : GrabContacts

October 30, 2020 by admin

We are happy to announce news regarding our upcoming product launch, on which we were working during the past one year. GrabContacts is an online service which helps you easily extract contact details (email addresses, phone numbers, social media handles) from websites (URLs) or search queries. Unlike WebHarvy there is no configuration involved, you just … Read more

AliExpress Scraper – Scraping product data including images from AliExpress

September 10, 2020 by admin

WebHarvy is a visual web scraper which can be easily used to scrape data from any website including eCommerce websites like Amazon, eBay, AliExpress etc. Scraping AliExpress The following video shows how WebHarvy can be configured to scrape data from AliExpress product listings. Details of the products like product name, price, minimum orders, shipping details, seller … Read more

How to use User Agent strings to prevent blocking while web scraping ?

November 1, 2024August 28, 2020 by admin

What is a user agent string ? The User-Agent string of a web browser helps servers (websites) to identify the browser (Chrome, Edge, FireFox, IE etc.), its version and also the operating system (Windows, Mac, Android, iOS etc.) on which it is running. This mainly helps the websites to serve different pages for various platforms … Read more

Scraping Instagram Images using WebHarvy

April 27, 2021August 25, 2020 by admin

WebHarvy can be used to scrape text as well as images from websites. In this article we will see how WebHarvy can be used to scrape Instagram Images. How to scrape images from Instagram? The following video shows how WebHarvy can be configured to scrape Instagram images (download images) by searching Instagram for a tag … Read more