Jump to content
UBot Underground

sales

Fellow UBotter
  • Content Count

    22
  • Joined

  • Last visited

Posts posted by sales

  1. I don't usually share anything here on the forum because I find uBot to be very frustrating for my old senile mind :)

     

    But I found a Javascript code I like much better than the uBot source code viewer. This a Javascript Bookmarklet that works in my uBot program (and also in IE, Firefox and Chrome) that shows the source code in a layout where I can tell what elements are inside of other elements.

     

    Anyways, I have attached it to this post. All you have to do is create a bookmarklet for standard browsers and inside of your uBot project, create a "navigate" command and paste the mass of JS code inside like any other URL. When you run the navigate command a popup appears over top of the page in the uBot browser window with the HTML color coded layout.

     

    Hopefully other uBotter's will find it useful.

     

    salesView Source Code.txt

    • Like 1
  2. Dan,

     

    Just search on yellowpages.com for any business category (contractor type plus location). It displays 2-3 promoted ads on the top, 3-4 on the bottom and 3-4 on the right sidebar. The regular ads are in the center with numbers from 1-30 per page. The paid ad businesses are also contained in the 30 listings in the center. Some have street addresses, some don't. I can't seem to get only the 30 in the middle with the 1. 2. 3. etc using the xPath Pro for an expression. I don't know enough to use advanced xpath. I thought if I could see an example I could learn from it.

     

    Thanks for checking it out.

     

    sales

    ============

    cmmorris1

    I will try your expression when I am at the program. Thanks for trying to help.

     

    sales

  3. Hi Dan,

     

    Bought the program. I am trying to learn how to use it now. In my first attempt I was looking at a yellowpages.com search results page. Here is the URL:

    http://www.yellowpages.com/search?search_terms=Concrete+Contractors&geo_location_terms=Lancaster%2C+OH

     

    I wanted to scrape ONLY the listings on the page that are numbered (like 1. 2. 3. etc). Due to the overlapping structure of the HTML I cannot figure out what expression would find only the numbered listing results. Can you look at that page and tell me if it's possible and how to select them. Keep in mine I am a total newbie to xpath so really "dumb-down" your explanation for me.

     

    Thanks in advance for any help. If I should submit it to support let me know. I just thought maybe another newbie user would be stuck like me.

     

    sales

  4. I wanted to post information about a VERY nice freeware program that helps me every single day.

     

    Maybe you have it or have heard of it. It's called FileBox Extender and is at

    http://www.hyperionics.com/files/index.asp.

     

    The program adds multiple functions, but for me, it give me the ability to click a small icon on most windows file and program windows and they stay "always on top". For instance, in uBot I can keep the debugger out where I can see what is being scraped at all times without creating a stat viewer.

     

    sales

  5. Being new to uBot...EVERYTHING is a challenge! I'm sure the uBot "gurus" here on the forum don't have this problem...but I sure did.

     

    I scraped a list of URLs from Google. There were lots of duplicates in my list so I wanted to use the $replace regular expression feature to clean my list. I found several regex's online, both for sorted and unsorted lists. Anyways, here is the code I used in uBot v4. It may not be pretty...but it works! I hope it will help some other poor newbie soul out. This regex deletes duplicates in lists that are either sorted or unsorted.

     

    My code:

     

    clear list(%dupesremoved)

    add list to list(%dupesremoved, $list from file("C:\\GoogleURLs\\GoogleURLs.txt"), "Delete", "Global")

    save to file("C:\\GoogleURLs\\DupesRemoved.txt", $replace regular expression(%dupesremoved, "^(.*)(?:\\r?\\n|\\r)(?=[\\s\\S]*^\\1$)", ""))

     

    sales

×
×
  • Create New...