How to Use the Yellow Pages Scraper Data Extraction Software
How to Use the Yellow Pages Scraper Data Extraction Software
Yellow Pages is a good source of local B2B leads such as restaurants, vape shops, cbd shops, petrol stations, beauty salons, hair dressers, auto garages and so on. A clear advantage of scraping Yellow Pages business directory is the fact that all the business contact details are presented in a template format and are relatively complete. USA version of the Yellow Pages is fairly easy to scrape and has all the business contact details, including emails. UK version of Yellow Pages is more secure and so requires more proxies to scrape. UK Yellow Pages (Yell.com) do not display email addresses but our Yellow Pages Scraper automatically extracts email addresses from the business websites and their Facebook Business pages.
How to Extract Information from Yellow Page Websites
First of all, it is important to note that scraping the Yellow Pages business directories is different to how you would normally scrape the search engines. This is due to the fact that Yellow Pages business directory is focused on local searches: it helps people to locate local businesses near them. For this reason, you will need to select one root keyword that describes the target business niche for which you would like to scrape business contact details. For example, this could be a beauty salon, dentist, real estate agents, lawyer, vape shop, Hemp and CBD shop, gas station, grocery store and so on. Then, you will need to produce many geo-targeted keywords for the area/region/country you want to scrape. In the end, you should have your root keyword with many geographic variations (different cities and post codes).
To achieve this, you would need to use our Footprint generator tool. You can locate it on the main GUI of the website scraper and email extractor (just under the keywords input field). Open up the footprints generator and in the "Keywords" field, enter your root keyword. This should describe the business niche you would like to scrape; i.e. vape shop, hemp and cbd shop, beauty salon, nails, hair and beauty, cafe, gas station, grocery store, yoga, gym, etc. Inside the "Footprint 1" text field, simply upload your cities that you would like to scrape. We have done all the heavy lifting for you by generating our lists with the main cities and post codes of popular countries. You can of course, use your own cities. Now click on "Merge" and the keywords generator will generate your new set of keywords and automatically transfer them to the keywords field.
How to Configure your Yellow Pages Data Extractor
Go to settings and open the search engines/dictionaries tab.
Inside, the tab, select either USA Yellow Pages (https://www.yellowpages.com) or UK Yellow Pages (https://www.yell.com). If you are planning to use your own keywords that you generated using the footprint generator, simply select either UK or USA yellow pages. However, if you would like to use just your root keyword, then double click on the plus sign to expand the options. Then, select the cities or states. The Yellow Pages Scraper will automatically scrape your root keyword for every single city or state that you selected. This search is much broader and is great for some types of local businesses that have a lesser presence. This could include vape shops and hemp and cbd shops. You will not find nearly as many vape shops in a city or state as you will find say beauty salons and grocery stores. Therefore, it is fine to use a broader search. On the contrary, if you want to scrape more popular businesses such as cafes, beauty salons, convenience stores and restaurants, you would be better off using your own targeted keywords.
Configuring your Proxies for Yellow Pages
If you are planning of scraping USA Yellow Pages, we recommend that you use either USA proxies or a VPN using USA IPs with a timed out IP change. To scrape the UK Yellow Pages business directory, you will need more proxies than for USA Yellow Pages. Yell.com is more sensitive to scraping. We recommend that you use lower thread numbers and many proxies. You can use either shared proxies or better still, backconnect rotating proxies where each IP address will change either after each request or after a given period of time (3 minutes, 10 minutes and so on).