Jump to content
UBot Underground

[SELL]Scrape And Save Websites


Recommended Posts

Introducing: Site Ninja

 

 

What it Does:

 

  • Scrapes All HTML Files
  • Downloads Images
  • Saves CSS and JS Files
  • Creates the File Paths on Your Local Machine
  • Saves the Entire Site on Your Computer

A few really cool uses for this:

 

  • Create killer UI's for your bots.  Like the UI of Site Ninja?  I used it to create it's own UI.
  • Improve your HTML5 and CSS3 skills by looking at what the professionals are doing.
  • Create awesome landing pages quickly and for FREE.

What it does not do:

  • It does not scrape and save wordpress themes or anything for any other CMS.
  • It does not save 100% flash websites.
  • It does not make you coffee or breakfast in the morning.

Frequently Asked Questions:

 

  • What about support?  Yes, I provide support on this.  If it's not working on a site you want to scrape, then I will take a look and see if it's possible to make a change to the software so it will work.
  • Are updates included?  Yes, updates are included.  What's nice about creating a bot that does this instead of the typical bot that does stuff on social networks is that the websites aren't updated all the time to make botting harder, so frequent updates shouldn't be necessary and it should always just work.
  • Money back policy?  Unfortunately because this scrapes and saves websites quick (only a few minutes per site) someone could easily just scrape all the sites they wanted and then ask for a refund.  Therefore, there's no money-back policy.  It works as advertised.

 

Only $27!

http://inninja.com/files/SiteNinjaSalesPageImages/download.jpg

Link to post
Share on other sites

does it work on wix sites?  thanks.

As in http://www.wix.com/?  I don't know.  I haven't tried, but I will go and try now.

 

EDIT: I just tried and the short answer is: no, not really.  Wix stores all the images, css and js at wix.com/longFilePath (rather than just img/mypicture.jpg).  This wouldn't be much of a problem, but they are using javascript to pull all the text.  So this bot will save an index.html file that reflects the exact wix site and you could change the images, but not the text because it's being called with their javascript (or at least that's what I can tell from my quick glance).

 

Edit2: I might update it later where it will save all js and other files even if it's path is domain.com/longpath.... and then on your local machine it creates new paths and changes the html in your index so the new path is there and everything magically works, but when I do that the price will definitely go up.

Link to post
Share on other sites

There is a large market for people wanting a tool to pull exact copies of wix sites, as their interface for designing is fast and easy to use, and very good for landing pages especially. Thats why they have their setup that way.

 

you would be better of creating a dedicated tool called wix scraper though.

 

Best of luck with sales anyway.

Link to post
Share on other sites

There is a large market for people wanting a tool to pull exact copies of wix sites, as their interface for designing is fast and easy to use, and very good for landing pages especially. Thats why they have their setup that way.

 

you would be better of creating a dedicated tool called wix scraper though.

 

Best of luck with sales anyway.

I'll keep this in mind.  Thanks!

Link to post
Share on other sites

Pretty nice bot.  When do you think the "wix update" would be ready? 

I don't know.  I hadn't even considered this until quite_interesting's post.  I will take a look at wix when I have time (possibly over the weekend) and see how feasible it is to incorporate this.  

 

What exactly would you want a tool to do with wix?  Would you need to be able to edit the html in a text editor, or would it simply be so you can upload an exact copy of a wix page you already created to your own server?

Link to post
Share on other sites

Well, i have never done anything using wix, BUT my little doughter does, and she loves it (more than 20 sites about her favorite things,best friends, music and more). Guess she only wants to download, and upload it, since editing is easy in the wix site.

I don't know.  I hadn't even considered this until quite_interesting's post.  I will take a look at wix when I have time (possibly over the weekend) and see how feasible it is to incorporate this.  

 

What exactly would you want a tool to do with wix?  Would you need to be able to edit the html in a text editor, or would it simply be so you can upload an exact copy of a wix page you already created to your own server?

Link to post
Share on other sites

Well, i have never done anything using wix, BUT my little doughter does, and she loves it (more than 20 sites about her favorite things,best friends, music and more). Guess she only wants to download, and upload it, since editing is easy in the wix site.

If that's the case, then you can do that now with the software.  Every time you make changes on the wix site, just scrape it again and re-upload to your server..  It will save an identical copy of your wix site as one file - index.html.  You just can't edit it, you'll need to do that inside Wix.

Link to post
Share on other sites

saves entire site.... should i try scraping 10.000s page sites as well?

It probably won't work on a site with thousands of pages, but most of those are content management systems so it wouldn't work anyway.  It can definitely get a lot of pages, though, especially if there are links to them on the index page and the href tags are structured something like this: href="stuff/morestuff/mypage.html".  This is how it looks for pages because this is the common "best practices" for building static websites.  Otherwise, it would save all pages that it's linking to, even if it's on another domain, and it just wouldn't work very well.  This might change in the future, but that would be a much more complicated process and I would have to raise the price at that point.

Link to post
Share on other sites

If that's the case, then you can do that now with the software.  Every time you make changes on the wix site, just scrape it again and re-upload to your server..  It will save an identical copy of your wix site as one file - index.html.  You just can't edit it, you'll need to do that inside Wix.

Actually that is possible from the wix site, but if wix deletes the site (just imagine!),  It would be great if I can save the whole page in a new host, or even in my pc.

Link to post
Share on other sites

Actually that is possible from the wix site, but if wix deletes the site (just imagine!),  It would be great if I can save the whole page in a new host, or even in my pc.

 

Does your bot scrape too .php files ?

Link to post
Share on other sites

Does your bot scrape too .php files ?

No.  That's not really possible, as far as I know, especially if the php file redirects visitors.  Maybe it's possible to download php files?  I don't know, I haven't looked into this.

Link to post
Share on other sites

Bought it :)

When scraping non English sites, the html-s are messed up. Characters shown for example “é” instead of “é” in the html file.

Any chance to correct that?

Cool, thanks for your purchase!  Please send me a PM with the site you're scraping and I will take a look at it.  

Link to post
Share on other sites

"Create killer UI's for your bots.  Like the UI of Site Ninja?  I used it to create it's own UI."

 

How do you do this?

Well, while I was working on Site Ninja inside the UBS editor I browsed http://themeforest.net/category/marketing/landing-pages and found the theme you see in this UI.  I ran the URL through the bot and saved the theme to my local computer.  Then I did a bit of editing (really, I just deleted all the other stuff below the header and moved their search text box into the header area) and bam, I had a nice lookin UI.

Link to post
Share on other sites

Bought it :)

When scraping non English sites, the html-s are messed up. Characters shown for example “é” instead of “é” in the html file.

Any chance to correct that?

Sorry for the delay.  This is still on my "to-do" list.  Just been busy lately with other projects and the holidays and all.

Link to post
Share on other sites

Can it save japanese's word correctly?

It seems like there is an issue right now in how ubot scrapes text. It doesn't seem to store/display foreign characters correctly.  I'm going to see if there's a work around for this.

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...