Jump to content
UBot Underground

Scraping Ezines...No paragraph issues!


Recommended Posts

Hi

 

Does anyone have special tricks to autocreate paragraphs from ezinearticles scrapes? Like a regex trick or something?

Im using $findstory.

 

If not where do you guys scrape from? One with clean text like ezine, but with paragraphs!

Link to post
Share on other sites

If you look at this article at EZA

 

http://ezinearticles.com/?Bathroom-Furniture---Mold-Problem-and-Its-Solutions&id=5835181

 

Scroll down until you see "Share this Article" box. In there you will see a link to EzinePublisher. Click that and you will see two large textboxes; one for Plain Text and the other for HTML.

 

Depending on what I need I will scrape from either of those two.

Link to post
Share on other sites

I reconfigured my scraper to work.

 

But I just want to actual article and not the author, title and other junk data.

 

How do I clean this up? A regex statement would be appreciated. Tnx again!

 

 

 

 

<head><title>A Copy of the Dash Diet - Tips on Switching to the Dash Diet Meal Plans</title></head>

 

<body bgcolor="#FFFFFF" leftmargin="0" topmargin="0" marginwidth="0" marginheight="0">

 

<p>A Copy of the Dash Diet - Tips on Switching to the Dash Diet Meal Plans<br>

By <a href="http://ezinearticles.com/?expert=Steven_McIntosh" >Steven McIntosh</a></p>

 

BLAH BLAH BLAH

 

 

<p>

Article Source: <a href="http://ezinearticles.com/?expert=john_doe" target="_new">http://EzineArticles.com/?expert=Steven_McIntosh</a>

 

<br><a href="http://ezinearticles.com/?A-Copy-of-the-Dash-Diet---Tips-on-Switching-to-the-Dash-Diet-Meal-Plans&id=2381113" target="_new">http://EzineArticles.com/?A-Copy-of-the-Dash-Diet---Tips-on-Switching-to-the-Dash-Diet-Meal-Plans&id=2381113</a>

 

<br>

Link to post
Share on other sites

Here is what I do.

 

I loaded to the article to a list. The next thing I did was that setup a loop to cycle through each line and i saved what I wanted to a new list. Then I saved the new list to a text file.

 

I am not a Regex person. It might be possible. It's just I don't know that piece.

 

For me, I am a UBot purest. I do not use Javascript or Regex unless I am backed into a corner. But that's me.

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...