Jump to content
UBot Underground

[FREE] Footprint Website Finder


Recommended Posts

First of all I wanted to say thanks again to LoWrIdErTJ and willywonka for their hugs lists of footprints. You can find the list as a sticky at the top of the Tutorials, Tips and Trick forum or by clicking here.

 

I decided to create a bot to search and scrape footprint and keywords to find websites based on a list of footprints. Because doing it one at a time sucks. And I wanted to give it to the ubot community because so many people have helped me with so many questions. There is always someone on this forums eager to help, give ideas and even write some code for you when you're stuck.

 

Thanks to k1lv9h for helping me fix a scraping issue with the bot.

 

Here it is!

 

 

 

 

footprint-website-finder-005.ubot

  • Like 7
Link to post
Share on other sites

Happy to offer something of some value. Hope people can put it to use. I've used it quite a lot already in combination with the huge list of footprints.

Link to post
Share on other sites

Hey Josh great share. Im finding the URLs are all coming out with long strings at the end of them though. Like this for example.

 

http://katymarvelcatering.com/guestbook&sa=U&ei=3Il9UOTJBeqW0QGKz4CoAQ&ved=0CBUQFjAA&usg=AFQjCNFwHUl6NKQ5oK2Yuqk6cXURwY96_w

 

Any idea why thats happening. I didn't have time to check into it fully. Not sure how you're going about the scraping.

Link to post
Share on other sites

Hey Josh great share. Im finding the URLs are all coming out with long strings at the end of them though. Like this for example.

 

http://katymarvelcat...Yuqk6cXURwY96_w

 

Any idea why thats happening. I didn't have time to check into it fully. Not sure how you're going about the scraping.

 

I have not had that issue.

 

Is anyone else seeing that issue?

Link to post
Share on other sites

Hey Josh great share. Im finding the URLs are all coming out with long strings at the end of them though. Like this for example.

 

http://katymarvelcat...Yuqk6cXURwY96_w

 

Any idea why thats happening. I didn't have time to check into it fully. Not sure how you're going about the scraping.

 

I am seeing that now. Strange it wasn't doing that before. I'll try to see if I can scrape another way, and let you know.

Link to post
Share on other sites

Beat me to this one Josh,

 

I have been working on one as a side project for a while now. I will post it up for free when I'm done with it. Not to steal your thunder... Mine is pretty different.

Link to post
Share on other sites

Beat me to this one Josh,

 

I have been working on one as a side project for a while now. I will post it up for free when I'm done with it. Not to steal your thunder... Mine is pretty different.

 

I live in San Diego. Not too much thunder here. It's always sunny! :D

Link to post
Share on other sites

So I figured out that google is doing something to the url on the page and you can see it in the address bar right before you are redirected to the website from the SERP.

 

I found the piece of code in google SERP source code, but I don't know how to go about extracting the url.

 

<a class="l" onmousedown="return rwt(this,'','','','1','AFQjCNGs3NfmEYRSQfhQPeSOOjJSfHZbFg','','0CCEQFjAA',null,event)" href="http://as.wwu.edu/kugs/playlist/">

 

Any ideas?

Link to post
Share on other sites

Hey Josh,

 

This will scrape the url's very well:

add list to list(%urls, $scrape attribute(<class="l">, "href"), "Delete", "Global")

 

It's what I'm using in my scraper that will blow yours out of the water :P

 

J/K

Link to post
Share on other sites

Hey Josh,

 

This will scrape the url's very well:

add list to list(%urls, $scrape attribute(<class="l">, "href"), "Delete", "Global")

 

It's what I'm using in my scraper that will blow yours out of the water :P

 

J/K

 

lol

 

Well I tried it and it's not scraping any data. Are you setting a user agent? if so which one?

Link to post
Share on other sites

No thats without a useragent.

 

Just realized that your bot sets the useragent to ie6.

 

That worked. But then how are you changing the google search settings to show 100 results per page?

Link to post
Share on other sites

I'm not. Just a sacrifice you have to make or choose not to.

 

I don't think it's possible using default ua.

 

There was a thread about it a while back, but I couldn't find it...

Link to post
Share on other sites

I'm not. Just a sacrifice you have to make or choose not to.

 

I don't think it's possible using default ua.

 

There was a thread about it a while back, but I couldn't find it...

 

Found the solution. Look in the bot bank! Under search providers. There is a google scrape bot. That has the solution.

Link to post
Share on other sites
  • 3 weeks later...

I downloaded V4 and it worked great, except for the extra strings on the results urls. So I downloaded V5, but it doesn't run at all and actually crashed my computer a couple of times - couldn't shut it down or anything. I'm using V4.5 professional edition - could that be why? On windows 7

Link to post
Share on other sites

I downloaded V4 and it worked great, except for the extra strings on the results urls. So I downloaded V5, but it doesn't run at all and actually crashed my computer a couple of times - couldn't shut it down or anything. I'm using V4.5 professional edition - could that be why? On windows 7

 

It was created with DEV version. I can compile it and send it to you if you like.

Link to post
Share on other sites

It was created with DEV version. I can compile it and send it to you if you like.

 

Ah, so I need the DEV version to get it to work? I compiled it and tried it that way too, but it also won't work for me then. I didn't know it made a difference. If that's the deal, thanks for clearing it up for me. And yes, I would LOVE the compiled version, thank you!! Great bot!

 

Edit here - the first version worked for me, though. Wasn't that made in using the DEV version also?

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...