Regex works but not the scraping

peterparker

Client
Регистрация
30.05.2013
Сообщения
20
Благодарностей
0
Баллы
0
Hi,

I try to scrape search engine results - the regex works in the regex designer but not if I do a testrun.
What is the problem?

Regex for bing: (?<=<div\ class="sb_tlst"><h3><a\ href=").*?(?=")
(-->Does work like a charm in the regex designer.)

I use this regex with "tab data":
What to take: source
Which tab: active
Result (regex above)
What to take: all
-->put result to list xyz

When I test it (Bing with any search) I get the following error:
Text parsing by regular expression actionaction group id: 4c166c1c-62dd-4784-81b0-d02912eac618 Result: nothing was found

What do I do wrong? Any suggestions?
 

rostonix

Известная личность
Регистрация
23.12.2011
Сообщения
29 067
Благодарностей
5 707
Баллы
113
Maybe you took source of page for regexp builder from browser and not from PM?
 

peterparker

Client
Регистрация
30.05.2013
Сообщения
20
Благодарностей
0
Баллы
0
I tried both, source from browser and source from MP-Browser. Regex works with both.
 

peterparker

Client
Регистрация
30.05.2013
Сообщения
20
Благодарностей
0
Баллы
0
Ok, some pictures here - regex works fine:
p1.jpg

This is the short project:
As you can see I use the regex that works in the regex designer
projekt_overview.jpg

This is the error-log (ignore the first entry which was from another test some hours ago):
log.jpg


Where is the error? I use no proxy and the Bing SERP in the zenno projectmaker browser seems to be correct -there should be 10 results...
Any ideas?

By the way: I have got the same problem with google. Regex works fine but scraping doesnt work with the upper project plan.
 

rostonix

Известная личность
Регистрация
23.12.2011
Сообщения
29 067
Благодарностей
5 707
Баллы
113
(?<=<h3><a\ href=").*?(?=")
I scraped DOM, not Source.
 

Кто просматривает тему: (Всего: 1, Пользователи: 0, Гости: 1)