Jump to content
UBot Underground

New User Need Help When Scraping


Recommended Posts

Hi everyone,

I just purchased ubot studio I have basic question here, I have project to create bot for my self like these:

1. Scrape urls from specific website that contain articles match with my keywords

2. Store the urls to specified folder

4. Grab articles from urls I saved

5. Save article in specified folder become: article 1, article 2, article 3 and so on

Please help with sample script you made is very appreciated

Thanks so much and Please see image

 

http://oi58.tinypic.com/34p04mv.jpg

Link to post
Share on other sites

If you want to get a list of the links, first get the "Add list to list" command out, then in the "List to Add" spot you need to put in a "Scrape attribute" parameter to get the class of links or however they're classified.

Link to post
Share on other sites

Welcome to the forum jbsgroup.

 

If your looking for example scripts try searching google with a footprint. 

 

inurl:ubotstudio.com "[free]"

 

inurl:ubotstudio.com "example"

 

ect

 

It works better than really any forum search.

 

and here is something to start you with.

 

this will scrape google results via regex.

 

add list to list(%results, $find regular expression($scrape attribute(<class="r">, "innerhtml"), "(?<=href\\=\\\")http.*?(?=\\\")"), "Delete", "Global")

  • Like 1
Link to post
Share on other sites

Hi,

 

Happy new year!

 

Thanks for all reply,

 

But please check to target website: kompasiana.com then try to search something until you find search result then click the page number at the bottom (like my image #3 above) the url in the browser wont change so I can't scrape the next page (e.g page number 2, 3, 4 and so on)

 

Thanks so much

Link to post
Share on other sites

Hi,

 

Happy new year!

 

Thanks for all reply,

 

But please check to target website: kompasiana.com then try to search something until you find search result then click the page number at the bottom (like my image #3 above) the url in the browser wont change so I can't scrape the next page (e.g page number 2, 3, 4 and so on)

 

Thanks so much

Why does the URL has to change? you can just click those buttons and it will show the new results.

By the way.. the kompasiana.com site is presenting results from google so you could just work with google directly?

 

Dan

Link to post
Share on other sites

Havnt read your problem to be honest,but doing a scrape yourself really is a pivotal point in learning ubot,but WATCH this group of beginner tutorials from Automation League,this guy also has a lot of bot submissions in the bot bank,great guy and an even better teacher,project build a gogle scraper,8 tutorials from absolute beginner leading to the scraper

 

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...