Jump to content
UBot Underground

Anyway to load/scrape from unlimited loading web 2.0 type pages?


Recommended Posts

A lot of websites are now gearing towards the style of scrolling down to load extra content instead of just having a next page button which is hell for automation. I wasn't sure if there was some sneaky way of loading all this content to scrape it all. I've used javascript to scroll down, wait, scrape, repeat. However after doing this "x" amount of times too much is loaded on the page and eventually ubot freezes.

Link to post
Share on other sites

I don't think there is another way around that.

 

I know Twitter displays user feeds like that, and in that case I think you could get around by scraping regularly, so you don't have to go as deep as with a single scrape.

Link to post
Share on other sites

just loop while "more" or whatever it is exists, then throw a click in there or can also use java or jquery

 

run javascript("for (i = 0; i < 100; i++) \{
    document.write(\"\" + i + \"<br />\");
    window.scroll(0,document.body.offsetHeight);
\}

")
 

Could have swore this worked for me before but now its showing the numbers.. maybe a java pro can chime in here..

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...