cervant41 2 Posted August 18, 2010 Report Share Posted August 18, 2010 So i've got a bot that will scrape a ton of wordpress installations. It outputs the sites to a text file and i'm good to go. However, I want the option to be able to clean that list by removing any list item that has some form of "wordpress.com" in the url. The problem is that there's no wildcard option to read a list item @ list position and figure out if that list item has some form of url originating from wordpress.com Besides having to navigate to each url in order to scrape the page using wildcards and cleaning the list that way, what other way can I tackle this? Quote Link to post Share on other sites
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.