Rob Howard 0 Posted August 7, 2013 Report Share Posted August 7, 2013 I need to scrape a URL out of a series of URLS from amazon's product details. Here is what I'm talking about: The very last link is the one I need to scrape. I get the feeling I'm going to nee some sort of regex, as each one of these product details change when looking at a new product, so this is going to be a pain in the ass, methinks. Rob Quote Link to post Share on other sites
LoWrIdErTJ - BotGuru 904 Posted August 7, 2013 Report Share Posted August 7, 2013 can scrape the section outerhtml to a variable, then regex out the href="" to a list or variable as needed Quote Link to post Share on other sites
LoWrIdErTJ - BotGuru 904 Posted August 7, 2013 Report Share Posted August 7, 2013 have a link to the page i can help you real fast. 1 Quote Link to post Share on other sites
Rob Howard 0 Posted August 9, 2013 Author Report Share Posted August 9, 2013 http://www.amazon.com/14-Days-Well-Behaved-Dog-ebook/dp/B008YK51Y8/ref=sr_1_1?s=digital-text&ie=UTF8&qid=1376067494&sr=1-1&keywords=dog+training Sorry for the delay, was out yesterday. Thanks for your help! Rob Quote Link to post Share on other sites
ds062692 19 Posted August 9, 2013 Report Share Posted August 9, 2013 In this case you don't need regex. The last category url has a last distinction. <a href="http://www.amazon.com/gp/bestsellers/digital-text/156708011/ref=pd_zg_hrsr_kstore_1_6_last"> The code will be add list to list(%last link, $scrape attribute(<href=w"http://www.amazon.com/gp/*_last*">, "href"), "Delete", "Global") Quote Link to post Share on other sites
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.