Content parser

lokiys · 01.05.2019

Perfecto сказал(а):
Hi,

What is the best way to parse a content inside html balise (p, h1, h2...)
It must be compatible with different website.

Take a look at Html Agility pack https://html-agility-pack.net/

lokiys · 01.05.2019

Perfecto сказал(а):
Thanks for your answer. I don't understand how to use this with Zenno. The parsing data module in Zenno is not enough ?

You have asked about the best way to parse HTML, so I think HTML agility pack is the best way.
No, it is not the default option for zennoposter...
But sure You can use parsing module what is zenno default option as well.

lokiys · 03.05.2019

Not sure what You mean compatible with different sites as You scrape exact content from exact site usually.
But using Regex is fine for scrapping.
Take a look at Parse brick in zenno...
Right click on content You want to parse and ''Parse content''

lokiys · 03.05.2019

If we talk about scrapping data then how to use regex is what You have to learn. Wiki - Regex
Parse content is just helper action. So go with Regex and learn it.
About logic for Your scrapping, I can not answer much, because I do not understand what is Your goals...

Поиск

Content parser

Perfecto

Client

lokiys

Moderator

Perfecto

Client

lokiys

Moderator

Perfecto

Client

lokiys

Moderator

Perfecto

Client

lokiys

Moderator

Кто просматривает тему: (Всего: 1, Пользователи: 0, Гости: 1)