Help with some regex

shabbysquire · 11.09.2015

Having some issues in crafting a regex, and hope for some advice.

I'm scraping domains with the lookahead and lookbehind regex.

Here is a sample domains to capture:

Код:

http://domain.com/
https://domain.com/

http://www.domain.com/
https://www.domain.com/

And my regex:

Код:

(?<=https?://|https?://www.).*?(?="|</a>|/)

I only want to capture the main domain without the www., like: domain.com. My regex captures both www and non-www. I know that the regex engine is always eager to match anything, but would appreciate some help.

Cheers!

LexxWork · 11.09.2015

just remove it after regex )

shabbysquire · 11.09.2015

I have done that, but it's just a challenge for me to improve my regex skills. ;-)

The solution is to ignore the: www. So I need to find out what it is!

shabbysquire · 13.09.2015

Done:

Код:

(?<=https?://(?:www\.)?)(?!www\.).*?(?=['/"]|</a>)

Поиск

Help with some regex

shabbysquire

Client

LexxWork

Client

shabbysquire

Client

shabbysquire

Client

Кто просматривает тему: (Всего: 0, Пользователи: 0, Гости: 0)