Jump to content
UBot Underground

Scrape Text From A Pdf


Recommended Posts

Im trying to get some text off a pdf file.  The web page allows you to open the pdf in your browser if you have the adobe's plug-in but I cant get the ubot browser to open the pdf.  I can also save the pdf, but then I dont know how to get the text off of it.

Is there some plug-ins that do this? 

 

Or is there a pdf to text converter someone knows of I can run as a script?

 

All I want to do is get all the text off a pdf usually just 2-4 pages in length.  Once I have access to the text I can process it and get what I need out of it.

Link to post
Share on other sites

added a simple ubot script to load this 

 

https://mozilla.github.io/pdf.js/getting_started/

 

 

I have added the folder to dropbox,download the folder

 

https://www.dropbox.com/sh/2a67mtm1co02nib/AAA2_D4I0Abu44lVemWzrW5Ra?dl=0

 

open the web folder and select UbotFile.ubot

Link to post
Share on other sites
  • 8 months later...

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...