Amazon crawl: 2 versions of the page

Trptc

Новичок
Регистрация
16.04.2018
Сообщения
3
Благодарностей
0
Баллы
1
Hi everybody!

I have difficulties crawling Amazon; the page (and html) returned by amazon can be from 2 versions:
- The first is the same returned to everybody
- The second is the same as googlebot user agent

I can't reproduce easily which one will be returned when Zennoster is running.
When an instance starts it will return the same version of the page.

I use default values for my UserAgent in the project.
I had no difficulties for 2 months (first version of the page only)

I can hit the same problem even if i create a new project from scratch.
For information: I use PIA for the vpn.

does anyone know this problem?

Thank you very much!
 

Trptc

Новичок
Регистрация
16.04.2018
Сообщения
3
Благодарностей
0
Баллы
1
I found a workaround; it's ok when I delete cookies.
But I don't understand why Amazon gives me 2 versions of it's website :bm:
 

lokiys

Moderator
Регистрация
01.02.2012
Сообщения
4 812
Благодарностей
1 187
Баллы
113
Hi. Try to use up to date useragents. Or use newest zenno version where useragents can be emulated by newest versions..
 

Trptc

Новичок
Регистрация
16.04.2018
Сообщения
3
Благодарностей
0
Баллы
1
Thank you lokiys. That's what I done yesterday and it's much better :di:
 

MD. Shamid Islam

Новичок
Регистрация
13.02.2019
Сообщения
3
Благодарностей
0
Баллы
1
Thank you
 

Вложения

russya

Client
Регистрация
08.07.2014
Сообщения
743
Благодарностей
78
Баллы
28
I also have 2 versions of the site. But I made my own parser for everyone. The program checks which version of the site and exactly the way it goes
 

Кто просматривает тему: (Всего: 1, Пользователи: 0, Гости: 1)