Jump to content
UBot Underground

[Solved] Scraping text that has no pattern?


Recommended Posts

Alexa lists 100 backlinks to a web site, for example for bodybuilding.com:

 

http://www.alexa.com/site/linksin;0/bodybuilding.com

http://www.alexa.com/site/linksin;4/bodybuilding.com

 

I want to grab the pages listed below the domain, but there does not seem to be a text or HTML pattern that I can use to pull out the URLs, example listing.

 

Anyone have an idea of how to grab the text of the URLs (since they are shown as text, not actual links)?

 

Thanks,

Troy

Link to post
Share on other sites

Hey John,

 

Thanks. I had started trying it a different way. But looking at your code.. clean smooth and perfect.

 

I am adding this to my "knowledge-base". :)

 

Thanks again.

Link to post
Share on other sites
  • 1 month later...

Sorry for jumping in, this thread answered half of my question! :-)

 

how can I access the %urls after it scraped it?

 

 

they are in the list %urls save to text file or just call from list

 

 

Pftg4

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...