rafterman 0 Posted February 23, 2011 Report Share Posted February 23, 2011 Trying to scrape yp.com. I understand the wildcards concept and I wish it would work!!! Keep getting these http://www.yellowpages.com/boston-ma/mip/dental-arts-of-boston-13316488?lid=166478893http://www.yellowpages.com/boston-ma/mip/dental-arts-of-boston-13316488/map?lid=166478893http://www.yellowpages.com/boston-ma/mip/dental-arts-of-boston-13316488/reviews?lid=166478893http://www.yellowpages.com/boston-ma/mip/dental-arts-of-boston-13316488/email_form?lid=166478893http://www.yellowpages.com/boston-ma/mip/dental-arts-of-boston-13316488/send_mobile?lid=166478893http://www.yellowpages.com/boston-ma/mip/dental-arts-of-boston-13316488/send_twitter?lid=166478893 I only want the first one, but obviously when I place the asterisk like this: http://www.yellowpages.com/boston-ma/mip/*?lid=* I should get what I do above. What else can I do????????????? Quote Link to post Share on other sites
Kreatus (Ubot Ninja) 422 Posted February 23, 2011 Report Share Posted February 23, 2011 If you want to get the first link only. You can try this wild card on choose attributehttp://www.yellowpages.com/*/mip/*?lid=* Edit: If this doesnt work can you please post the page you want to scrape that link? Quote Link to post Share on other sites
Pete 121 Posted February 23, 2011 Report Share Posted February 23, 2011 If I remember correctly when you scrap and use the add to list that’s what you get a list of all the urls, however if you use a variable to scrap then add that to a list it will return only the first url scrapped Quote Link to post Share on other sites
rafterman 0 Posted February 23, 2011 Author Report Share Posted February 23, 2011 Zap, So that works, but i want to be able to scrape every first link out of that set on the page. Quote Link to post Share on other sites
rafterman 0 Posted February 23, 2011 Author Report Share Posted February 23, 2011 More or less, i'm trying to scrape this page http://www.yellowpages.com/boston-ma/dentist?g=Boston,+MA for the business title url Quote Link to post Share on other sites
Kreatus (Ubot Ninja) 422 Posted February 23, 2011 Report Share Posted February 23, 2011 Hi check this yp.ubot I choose different attibute so no need to use wildcard.. 1 Quote Link to post Share on other sites
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.