Welcome, Guest
Python Scripts for ComicRack

TOPIC: Diabolik (www.diabolik.it.it) Scraper v1.6 (Italian comic)

Diabolik (www.diabolik.it.it) Scraper v1.6 (Italian comic) 5 years 8 months ago #21637

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
- NEW Version 1.6 -

The script to scrape data from www.diabolik.it .

It will scrape (read, retrieve and save) data for eComics/fileless from the www.Diabolik.it website, the reference for one of the oldest comic books in Italy.

It is (almost) fully configurable and despite the beta, should be able to scrape a good 90% of the available issues around.

Give it a try, report any bug and suggestions, they're always welcome !

Enjoy !

M

Scraper 1.6

Manual 1.6
Last Edit: 11 months 1 week ago by mizio66.
The administrator has disabled public write access.
The following user(s) said Thank You: rmagere, luke_70it, duque

Re: Diabolik (www.diabolik.it.it) Scraper v1.00 BETA (Italian comic) 5 years 8 months ago #22489

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Thanks for another great scraper for italian comic readers :cheer:
Will try it and let you know how it goes.
The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.00 BETA (Italian comic) 5 years 7 months ago #22538

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
I seem not to have properly understood the manual as at the moment I have not yet been able to scrape any diabolik issue.

To keep it simple let's look at the March 2012 issue (http://www.diabolik.it/popup_pubblicazioni_dk.asp?id=954)

This is issue 3 of year 51. As such I have inserted the following information in the comic info:
  • Series = Diabolik
  • Year = 51
  • Issue = 3
I then ran the scraper and the result is that the issue was "Ignored".

I have attached screenshots of my settings for the scraper.
I also know that the Collane_Diabolik includes the information for the comic in question (see spoiler box)
Warning: Spoiler! [ Click to expand ]


Any idea on what I am doing wrong?



The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.00 BETA (Italian comic) 5 years 7 months ago #22541

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
I thought for a moment to have written the guideline/manual wrongly...

But no... there is a misunderstanding (anyhow i'll make it clearer in next release of the manual) as the YEAR is not he field to use... it is the VOLUME.

So, your 51 should go in the volume, not the year... give it a second try...

ciao|

M
The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.00 BETA (Italian comic) 5 years 7 months ago #22550

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Thanks that worked flawlessly :cheer:

I think the manual (v1.00b) is actually not very clear on how to use the information. The only section I found that described what should be present and where is in "Tips for Scraping" and as highlighted in the image it seems to imply the issue number should be inserted into the Year section rather than the Volume section.

I will not test it with more issues and let you know if I encounter any bugs and or suggestions for future development.

Thanks again :)


The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.00 BETA (Italian comic) 5 years 7 months ago #22551

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
Yes, that version is probably outdated...i corrected it and with the new release of the scrape will be clearer !!

thanks !

M
The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.20 BETA (Italian comic) 4 years 11 months ago #28263

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
A new release is up ! v1.2b !!

See first post for the link, read the manual !!!!!!!!!

Chnage log:
- web site changed, scrape adjusted (old version was useless)
- adjustment to scraping and general improvements

Enjoy,

M
The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.20 BETA (Italian comic) 4 years 3 months ago #36123

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Hi

I think the new versions of comicrack have broken this script. Specifically I have noticed the following issues:
  • It looks like the month information is scraped into the year field
  • It looks like the day information is scraped into the month field
  • The year is not scraped anymore
  • Issue identification is a bit hit and miss (i.e. even though the books have been correctly named the scraper might ignore the comic or scrape the wrong one)

Let me know if you want me to run any debug logs or anything else that could help.
The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.20 BETA (Italian comic) 4 years 3 months ago #36133

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
I'll check it... Thanks!
The administrator has disabled public write access.

Re: Diabolik (www.diabolik.it.it) Scraper v1.30 BETA (Italian comic) 4 years 3 months ago #36274

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
A new release is up ! v1.3b !!


Chnage log:
- date scraping corrceted
- adjustment to scraping and general improvements

Manual

Scraper

Enjoy,

M
The administrator has disabled public write access.
The following user(s) said Thank You: rmagere, luke_70it, duque
Time to create page: 0.207 seconds

Who's Online

We have 234 guests and 2 members online