Jump to content
UBot Underground

Recommended Posts

search page for the mailto:?

 

 

But how do I find the page with the contact information on it?  Not all pages are labeled "contact us"

That is the bit you'll have to code. Try finding a common ground among your sites list pages to find the contact page, or maybe a little un-efficient way is to scrape all the pages and then go with a regex gun on it.

 

Regex to scrape emails: 

[a-zA-Z0-9\._\-]{3,}(@|AT|\s(at|AT)\s|\s*[\[\(\{]\s*(at|AT)\s*[\]\}\)]\s*)[a-zA-Z]{3,}(\.|DOT|\s(dot|DOT)\s|\s*[\[\(\{]\s*(dot|DOT)\s*[\]\}\)]\s^*)[a-zA-Z]{2,}((\.|DOT|\s(dot|DOT)\s|\s*[\[\(\{]\s*(dot|DOT)\s*[\]\}\)]\s*)[a-zA-Z]{2,})?$

Hope it helps,

 

Cheers!

Praney

Link to post
Share on other sites

im thinking i have to 

 

1. search keywords on google

2. scrape links in results to a list

3. take that same list go back to goole and do a site:xyz.com to get all the pages from that domain and save to another list

4. take that other list and search for the emails.

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...