Search Knowledge Base articles

    Settings for google search data extractor

    Check these settings in google search data extractor

    Extract settings

    There is a tick mark in email address and Phone numbers. This means that the software will extract both email addresses and phone numbers when we make the searches.  if you do not need email, then you can untick email addresses. or if you dont need phone numebrs, then you can untick phone numbers.


    Maximum time for email search settings

    settings Maximum time for email search would be the time that the software spends on every single website. The software will search a website for email addresses and phone numbers for a maximum of 20 seconds. and then it will visit the next website. the software will extract as many email addresses or phone numbers available on public pages. You can reduce the time or increase it depending on your requirement. 


    Email regular expression

    You can just leave it untouched. because this is a standard regular expression used by developers to extract email addresses. 


    Phone regular expression

    You can just leave it untouched. This regular expression is developed for indian numbers, however if you wish to extract phone number of other countries, then you will have to change the regular expression.


    Domain blacklist

    If you dont wish to extract data from particular websites, then you can add them in blacklist. this way you will always be able to extract quality database. for example .. you are trying to extract data of website designing companies in mumbai, google.com will always display few list of business directories, blog websites, or classified websites etc. the data extracted from these websites might not be useful. so the best thing you can do is block them.


    Email blacklist

    You can just leave it untouched. unless you are a software programmer.


    Related Articles