Regex Stop Match Before Character

Ptrick125 · October 17, 2013

I am working on a url scraper, and the website tends to include the referall source, but I cannot include that when saving it to a file later on. I was successful in getting it to grab everything after the "/".

testwebsite.com/ubotstudio?ref=referral

This is what I have so far:

[^/]*$
testwebsite.com/ubotstudio?ref=referral

How should I have it match everything before the "?" sign?

HelloInsomnia · October 17, 2013

The simplest way to do it is this:

.*(?=\?)

But it would be nice to know if the format will always be like that so you can come up with something better. For example if the format is always like that (url without www or http) then you can also use something more specific:

[a-zA-Z0-9]+\.[a-zA-Z]{2,4}[a-zA-Z0-9\/\.-_+%!]+(?=\?)

Ptrick125 · October 17, 2013

The simplest way to do it is this:

.*(?=\?)

But it would be nice to know if the format will always be like that so you can come up with something better. For example if the format is always like that (url without www or http) then you can also use something more specific:

[a-zA-Z0-9]+\.[a-zA-Z]{2,4}[a-zA-Z0-9\/\.-_+%!]+(?=\?)

http://i.imgur.com/EeVAAOq.png

It works!

Sign In

Regex Stop Match Before Character

Recommended Posts

Ptrick125 45

Link to post

Share on other sites

HelloInsomnia 1103

Link to post

Share on other sites

Ptrick125 45

Link to post

Share on other sites

Join the conversation

Browse

Activity