Jump to content
UBot Underground

Cleaning up lists - how to evaluate using wildcards?


Recommended Posts

So i've got a bot that will scrape a ton of wordpress installations. It outputs the sites to a text file and i'm good to go. However, I want the option to be able to clean that list by removing any list item that has some form of "wordpress.com" in the url. The problem is that there's no wildcard option to read a list item @ list position and figure out if that list item has some form of url originating from wordpress.com

 

Besides having to navigate to each url in order to scrape the page using wildcards and cleaning the list that way, what other way can I tackle this?

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...