Scraping Yellow Pages Australia (yellowpages.com.au) – phone, email, website

WebHarvy is a visual web scraping software which can be easily configured to scrape data from any website. In this article we will see how WebHarvy can be configured to extract data from www.yellowpages.com.au listings.

Scraping yellowpages.com.au

A special technique is employed to extract data correctly and consistently from yellowpages.com.au listings. This is mainly because the layout of boxes of listings vary from one listing to another – some has header with their logo/image, some does not etc.

The regular expression strings used in the video to extract phone, email and website are given below.

tel:([^”]*)

data-email=”([^”]*)

title=”([^\s]*)\s*\(opens

Know More

We highly recommend that you download and try using the free evaluation version of WebHarvy available in our website. To get started, please follow the link below.

Getting started with web scraping using WebHarvy

Leave a Reply

Your email address will not be published. Required fields are marked *