Jump to content
UBot Underground

*page scrape using a wildcard


Recommended Posts

Am trying to scrape a page and I need to add a wildcard to the left side

 

 

<A class=l onmousedown="return clk(this.href,'','','res', * href="

 

 

where the * is the wildcard, but it isn't working. How do I need to specify the wildcard for it to skip a certain amount of characters?

Link to post
Share on other sites

Thanks for the reply.

 

Scraping URLS from Google. Have got all URLS but includes some local google links, was going to clean the list but thought if could use wild cards would be a cleaner solution.

 

Looking it now via source in FF, it should be easy

 

Left side

<h3 class="r"><a href="

right side

"

 

But when I view it in Ubot page scrape view thingy, it has the left side

<H3 class=r><A class=l onmousedown="return clk(this.href,'','','res','3','','0CBIQFjAC')" href="

 

where the '3' changes and the '0CBIQFjAC' changes per URL.

 

 

Am just going to have to clean it, shouldn't be too hard with some additions to the string lib. Hopefully get it finished soon and uploaded

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...