Scraping Between Markers

Learjet · January 1, 2016

Greetings, first post on the forum :-)

I'm following Frank's scraping tutorial but Google has removed the class from the links on their pages making things a bit more tricky.

I can do this easily with Scrapebox but I'm having trouble figuring out how to do it with Ubot. What I need to do is scrape the content between one point and another:

<a href="https://www.allaboutbirds.org/guide/" onmousedown="return rwt(this,'','','','1','AFQjCNEtCVsjUVy4rwZhRkep3d599ciT0g','','0ahUKEwid9s640IXKAhWM6yYKHWfEAf0QFggcMAA','','',event)">Bird Guide - All About Birds</a>

Basically I want to scrape the info between: <a href=" and " onmousedown="return which will give me just the link

Thanks for your wisdom!

Peace,
EJ Lear

Edited January 1, 2016 by Learjet

pash · January 1, 2016

scrape html and use REGEX

alert($find regular expression("<a href=\"https://www.allaboutbirds.org/guide/\" onmousedown=\"return rwt(this,\'\',\'\',\'\',\'1\',\'AFQjCNEtCVsjUVy4rwZhRkep3d599ciT0g\',\'\',\'0ahUKEwid9s640IXKAhWM6yYKHWfEAf0QFggcMAA\',\'\',\'\',event)\">Bird Guide - All About Birds</a>","(?<=href=\").*?(?=\" onmousedown=\")"))

pftg4 · January 1, 2016

you can use page scrape also little easier than regex

Learjet · January 1, 2016

Thanks pftg4!

Got it working with page scrape, great feature! I use this kind of function all the time in Scrapebox and it's very handy. Love being able to put it all together in uBot, saves me a lot of time!

Peace,

EJ

Sign In

Scraping Between Markers

Recommended Posts

Learjet 27

Link to post

Share on other sites

pash 504

Link to post

Share on other sites

pftg4 102

Link to post

Share on other sites

Learjet 27

Link to post

Share on other sites

Join the conversation

Browse

Activity