Welcome, Guest
Python Scripts for ComicRack

TOPIC: Bonelli (www.sergiobonelli.it) Scraper v4 BETA (Italian publisher)

Re: Bonelli (www.sergiobonelli.it) Scraper v3 BETA (Italian publisher) 3 years 8 months ago #38926

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
I tested on the previously highlighted bugs.

The results are as follows:
  1. Selecting "Pubblicato" still leads to error message and no date being picked up
  2. Magico Vento 101+ still need to be called "Magico Vento Bimestrale" to be picked-up (setting format to Bimestrale is not sufficient)
  3. Le Storie 7 is correctly picked (not as 17)
  4. Tex 17 is correctly picked up (not as Maxi Tex 17)
  5. Julia 4 is correctly picked up (not as 184)
  6. Speciale Dampyr 8: Cover Artist is still picked up as Enea RiboldiOrrore tra gli Amish
    This appears to be true for all Speciale Dampyr, i.e. Cover Artist is set to "ArtistTitle" - could it be an opportunity to extract the real title of the speciale?
  7. Quickscraper works

As always happy to do any testing needed.
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v3 BETA (Italian publisher) 3 years 8 months ago #38927

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
I see... The Bimestrale thing it's not really possible to correct, as the series' name changes in the site.

I'll see if the split of Speciale Dampyr's cover author from the title is easy to do... It is not the only case i had to intercept.

ANd I'll work on that damn date thing, I have maybe some ideas...

Thanks,

M
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v3 BETA (Italian publisher) 3 years 7 months ago #39149

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Thank you

By the way I have a new bug to report:

Dylan Dog Color Fest 11 (http://www.sergiobonelli.it/scheda/10125/Dylan-Dog-Color-Fest-11.html)
  • Writer is set to Poltergeist, serial killer, un sinistro obitorio, una strana due-ruote tutti a colori! which is the strapline
  • Penciller is set to Poltergeist, serial killer, un sinistro obitorio, una strana due-ruote tutti a colori! which is the strapline
  • Summary is set to Per il verso sbagliato which is the first title of the 4 stories
The administrator has disabled public write access.

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 6 months ago #39327

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Not so much a bug but for your information.

Scraping Lukas #1.
  • The series is currently not being found even after a refresh (not too much of an issue as I can do manual editing of the collane file)
  • Quickscraper with www.sergiobonelli.it/scheda/18468/Deathropolis.html works, however
  • it does not capture any of the following data:
    Uscita: 21/03/2014
    Soggetto: Michele Medda
    Sceneggiatura: Michele Medda
    Disegni: Michele Benevento
    Copertina: Michele Benevento

Edit
Actually the information above is missing also when I quickscrape Le Storie 15 (www.sergiobonelli.it/scheda/10462/I-fiori-del-massacro.html) - so it might be the expected behaviour?
Last Edit: 3 years 6 months ago by rmagere.
The administrator has disabled public write access.

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 5 months ago #39892

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Just come across a scraping issue:

When I scrape Martin Mystere 5 (with format set to "Mensile") instead of getting La casa ai confini del mondo it scrapes Maxi Martin Mystere 5 which is currently in Edicola (and not in Arretrati).

Scraping this number was successful a few months ago before the new Maxi had been released.

Thanks - and as always happy to test any new version of the scraper.
The administrator has disabled public write access.

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 5 months ago #39901

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
Hi, i just came back from a beach... This week, still off, will try to kick this.

Thanks!

M
The administrator has disabled public write access.

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 5 months ago #39905

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
mizio66 wrote:
Hi, i just came back from a beach...
Ah I am envious now... ;)
The administrator has disabled public write access.

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 5 months ago #39953

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
New version available at:

Link


- Various fixes
- added an additional code to intercept date issue for RMagere

Please test it |

M
Last Edit: 3 years 5 months ago by mizio66.
The administrator has disabled public write access.
The following user(s) said Thank You: rmagere, luke_70it

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 5 months ago #39975

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Thanks - look forward to testing it this weekend!

Just one note though: I get a 404 error when I try the link you have shared.
The administrator has disabled public write access.

Bonelli (www.sergiobonelli.it) Scraper v3.1 BETA (Italian publisher) 3 years 5 months ago #39977

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
rmagere wrote:
Thanks - look forward to testing it this weekend!

Just one note though: I get a 404 error when I try the link you have shared.

edited

M
The administrator has disabled public write access.
The following user(s) said Thank You: rmagere
Time to create page: 0.245 seconds

Who's Online

We have 227 guests and 2 members online