Jump to content
UBot Underground

Recommended Posts

Need help scraping twitch usernames from game directories.

There's no exact element to scrape twitch usernames within game directories as far as I can see

The only way is to scrape the channel links "/example" and add them to a list

 

Only problem is , when scraping the href using a wildcard , you pick up all the generic / crappy links with it.

I was hoping some regex expert could help me filter the href  / links on the page?

 

-

 

The page

 

http://www.twitch.tv/directory

 

Any game category works

Link to post
Share on other sites

You can use this:

add list to list(%usernames, $scrape attribute(<href=w"/*/profile">, "href"), "Delete", "Global")

Then just use the $replace function to get rid of the bits you don't need like / and /profile

Link to post
Share on other sites

That doesn't seem to work :l

Also with the replace function , I don't think it's able to remove wildcards is it?

 

Eg. /profile/random-string

 

Thanks for the help!

Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...