Learjet 27 Posted January 1, 2016 Report Share Posted January 1, 2016 (edited) Greetings, first post on the forum :-) I'm following Frank's scraping tutorial but Google has removed the class from the links on their pages making things a bit more tricky. I can do this easily with Scrapebox but I'm having trouble figuring out how to do it with Ubot. What I need to do is scrape the content between one point and another: <a href="https://www.allaboutbirds.org/guide/" onmousedown="return rwt(this,'','','','1','AFQjCNEtCVsjUVy4rwZhRkep3d599ciT0g','','0ahUKEwid9s640IXKAhWM6yYKHWfEAf0QFggcMAA','','',event)">Bird Guide - All About Birds</a> Basically I want to scrape the info between: <a href=" and " onmousedown="return which will give me just the link Thanks for your wisdom!Peace,EJ Lear Edited January 1, 2016 by Learjet Quote Link to post Share on other sites
pash 504 Posted January 1, 2016 Report Share Posted January 1, 2016 scrape html and use REGEX alert($find regular expression("<a href=\"https://www.allaboutbirds.org/guide/\" onmousedown=\"return rwt(this,\'\',\'\',\'\',\'1\',\'AFQjCNEtCVsjUVy4rwZhRkep3d599ciT0g\',\'\',\'0ahUKEwid9s640IXKAhWM6yYKHWfEAf0QFggcMAA\',\'\',\'\',event)\">Bird Guide - All About Birds</a>","(?<=href=\").*?(?=\" onmousedown=\")")) Quote Link to post Share on other sites
pftg4 102 Posted January 1, 2016 Report Share Posted January 1, 2016 you can use page scrape also little easier than regex Quote Link to post Share on other sites
Learjet 27 Posted January 1, 2016 Author Report Share Posted January 1, 2016 Thanks pftg4! Got it working with page scrape, great feature! I use this kind of function all the time in Scrapebox and it's very handy. Love being able to put it all together in uBot, saves me a lot of time! Peace,EJ Quote Link to post Share on other sites
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.