Welcome, Guest
Python Scripts for ComicRack

TOPIC: Bonelli (www.sergiobonelli.it) Scraper v4 BETA (Italian publisher)

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36342

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
It's a typo!!!!!! Shoulbe strtime... If you want to try to fix it, it's easy, strange it worked for me though...

I will fix and resend...

Thx

M
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36345

  • luke_70it
  • luke_70it's Avatar
  • Offline
  • Senior Boarder
  • Posts: 64
  • Thank you received: 1
  • Karma: 2
mizio66 wrote:
It's a typo!!!!!! Shoulbe strtime... If you want to try to fix it, it's easy, strange it worked for me though...

I will fix and resend...

Thx

M

Ciao Mizio,
I've modified bonelli.py and changed strtime to strftime.
Now I don't have previous error in debug log, but in the rename log I find this error:

Caught TypeError: expected datetime, got str
C:\Users\lazzarini\AppData\Roaming\cYo\ComicRack\Scripts\Bonelli Scraper\Bonelli.py,836,parseAlbumInfo

Unfortunately my programming skills are, at best, very poor... :-(

Why for you and rmagere it's ok and I have problems? I I'm afraid it's me!! Probably it's safer for me collecting stamps... :P

Edit: I'm using CR v 0.9.172. I don't think this may be an issue, but you never know...
Last Edit: 4 years 3 months ago by luke_70it.
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36356

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
I will test it on Dampyr to check if it is linked to the series (don't think so but just in case)
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36369

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
Actually it is not a typo, but me needing new glasses... :-)

the strptime statement is correct and it is working for me...

there must be smthg else, Luke... any sepcial album, issue... works with others...

thanks!

M
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36370

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
I have tried it on a few Dampyr issues (2-10) and it worked fine with me.
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36371

  • luke_70it
  • luke_70it's Avatar
  • Offline
  • Senior Boarder
  • Posts: 64
  • Thank you received: 1
  • Karma: 2
I tried with Shanghai Devil: same issue!

I redownloaded the script from here: docs.google.com/file/d/0Bzofljx4mD2xcWt2...THc/edit?usp=sharing

Is it the right one?

I removed script and re-installed: no luck.

Probably is something related to my configuration...

I run CR in debug mode, but there is nothing useful to understand my problem. BTW I paste the output:

Calling 'Bonelli_start'...
Compilation of 'C:\Users\lazzarini\AppData\Roaming\cYo\ComicRack\Scripts\Bonelli Scraper\Bonelli.py'
Configuration Used: True True True 1000000 1000000 EN
Collane_Bonelli.txt OK !
Series' Name = Shanghai Devil No = [1]
Searching the Web with the Series' Name: Shanghai Devil
Searching the Web with the Comic's [# 1]
** Comic's URL: www.sergiobonelli.it/sezioni/23/shanghai-devil
Format: Mensile
Notes: www.sergiobonellieditore.it - Friday 23 August 2013 09:09:45
Bonelli Scraper v2.0
Language: Italiano
Series : Shanghai Devil
Title:: Il trafficante d'oppio
Publisher: Sergio Bonelli Editore
Writing Log to C:\Users\lazzarini\AppData\Roaming\cYo\ComicRack\Scripts\Bonelli Scraper\Bonelli_debug_log.txt
Caught AttributeError : 'type' object has no attribute 'strptime'
1 - ('C:\\Users\\lazzarini\\AppData\\Roaming\\cYo\\ComicRack\\Scripts\\Bonelli Scraper\\Bonelli.py', 836, 'parseAlbumInfo')

It doesn't work for me even with other series (Dylan Dog, Tex) :( :(
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36402

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
let me retry :-)

The script scrapes the data from the webpage, in that case www.sergiobonellieditore.it/scheda/9114/...ficante-d-oppio.html

So, can you navigate there with your browser and then right click on the page and then view source ? Then copy all the code into a text file and send it to me :-)

I want to see what is different from my page !

thanks,

M
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36404

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Having scrapped a few more comics and looked into them I am not sure that I am getting all the information either. (It feels that year/month/reviews/writers can be a bit hit and miss).

Just in case here is the source code for Shanghai Devil 1 http://pastebin.com/K6MigjMg.

Here is also the source code for Maxi Tex 16 (http://pastebin.com/78mT9NUh).
I am attaching it as it was the most recent scrape I have carried out and the result was as follows:

Data manually entered: Series=Maxi Tex, Number=16 no other information
Output box: 1 comic ignored
Data added by scraper: Format=Annuale, Publisher=Sergio Bonelli Editore, Title=MAXI TEX N. 16; Notes=http://www.sergiobonellieditore.it - Saturday 24 August 2013 20:18:45
Bonelli Scraper v2.0

No information was included for Year, Month, Soggetto e sceneggiatura, Disegni, Copertina, Summary


P.S. if it is easier to write in italian feel free to do so
Last Edit: 4 years 3 months ago by rmagere. Reason: added P.S.
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36406

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
I compared your code and my code, no difference. I scraped maxi tex 16 and had no issue...

this is strange...

Another attempt... start CR like this: "C:\Program Files\ComicRack\ComicRack.exe" -dso -ssc

debug will be activated. Activate the script debug in the options... then see what happens.

thanks!

M
The administrator has disabled public write access.

Re: Bonelli (www.sergiobonelli.it) Scraper v2 BETA (Italian publisher) 4 years 3 months ago #36412

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 223
  • Thank you received: 24
  • Karma: 7
Followed your suggestions the outcome is as follows:

The scraper log states (as per Luke's experience):
Warning: Spoiler! [ Click to expand ]


Comicrack's log states:
Warning: Spoiler! [ Click to expand ]


Hope this helps, otherwise let me know what else I should try (the above results were obtained after having uninstalled the script and having the reinstalled it again with a having had a couple of restarts in between uninstall and install)
The administrator has disabled public write access.
Time to create page: 0.213 seconds

Who's Online

We have 238 guests and one member online