Jump to content
UBot Underground

[BUY] Need to identify some regex for data recognition (pay hourly)


Recommended Posts

Hello guys..

I am using aymen HTTP Post plugin to GET data from an URL, I need to parse data into variables. I dont really have regex expertise but I might learn from this.

I need to scrape 2 pages, and example of the data to get is:

<li>
<strong>Created:</strong>
January 19, 2012
</li>
<li>
<strong>Type:</strong>
Networking Group
</li>
<li>
<strong>Members:</strong>
2
</li>
</ul>

There are like 12 fields of data in total.

Please post how many time it might take you, I pay via Paypal first if you are not new member.

Link to post
Share on other sites

Hello guys..

I am using aymen HTTP Post plugin to GET data from an URL, I need to parse data into variables. I dont really have regex expertise but I might learn from this.

I need to scrape 2 pages, and example of the data to get is:

<li>
<strong>Created:</strong>
January 19, 2012
</li>
<li>
<strong>Type:</strong>
Networking Group
</li>
<li>
<strong>Members:</strong>
2
</li>
</ul>

There are like 12 fields of data in total.

Please post how many time it might take you, I pay via Paypal first if you are not new member.

 

I highly recommend you look into xpath for that kind of stuff. For a lot of HTML / XML scraping that works a lot better than regex. 

 

Dan

Link to post
Share on other sites

There are many ways to do this. But all of them start with xpath parser first to isolate the unordered list. I used regex here but I am sure there are other ways to do it like adding them into a list or using xpath again.

comment("example using standard ul")
set(#html, "<ul>
<li>
<strong>Created:</strong>
January 19, 2012
</li>
<li>
<strong>Type:</strong>
Networking Group
</li>
<li>
<strong>Members:</strong>
2
</li>
</ul>", "Global")
set(#target_data, $trim($plugin function("HTTP post.dll", "$xpath parser", #html, "//ul/li", "InnerText")), "Global")
set(#created, $find regular expression(#target_data, "(?<=Created\\:\\n).*"), "Global")
comment("Now just replace \"Created\" with \"Type\" or anything else")
set(#type, $find regular expression(#target_data, "(?<=Type\\:\\n).*"), "Global")
set(#members, $find regular expression(#target_data, "(?<=Members\\:\\n).*"), "Global")

Link to post
Share on other sites

Damn, I already payed north_star and he left me hanging with the job.

I wish I could have a little patience.

Thanks a lot guys for this great help.

Thanks a lot HelloInsomnia, I will file a dispute for this guy and donate you for your precious help.

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...