Scraping News Articles and Press Releases using WebHarvy

In this article we will see how WebHarvy can be easily configured to scrape news articles, publications and press releases . Being a generic web scraping software, WebHarvy can be configured to extract data from any website as per your requirement.

WebHarvy can be used to scrape articles from article directories, news from news websites and press releases from PR websites.

How to easily scrape data from websites using WebHarvy ?

WebHarvy lets you scrape the content of the article as a file (text file) - see Scrape text as file for details. The Capture More Content option also comes in handy while scraping articles. The following demo shows how WebHarvy can be used to scrape articles from www.ezinearticles.com. Details like article title, author name, date, article body, keywords etc. can be easily extracted using WebHarvy.

Video below shows how WebHarvy can be used to scrape news articles from Wall Street Journal (wsj.com)

WebHarvy can also extract the entire article content in HTML format, so that text formatting and embedded images are not lost. For this the Capture HTML feature should be used.

We recommend that you download and try the evaluation version and also view the video demonstrations.

Download the FREE evaluation version of WebHarvy

In case you need assistance in configuring WebHarvy, please do not hesitate to contact our support team (support@webharvy.com) with the details (URL of the webpage + details of the data to be scraped). We are happy to help you get started with your first data extracting project using WebHarvy !

Keywords : Article Scraper, Scrape Articles, PR Scraper, Scrape Press Releases