Welcome, Guest
Python Scripts for ComicRack

TOPIC: Bedetheque Scraper 2 - v4.9

Bedetheque Scraper 2 - v4.9 6 years 5 months ago #15097

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
NEW VERSION 4.9

Thanks to ninjaw, atagal, Bert and Bruno for the thorough testing ! A new release of this version of the famous Bedetheque.com scraper for the BDFR (Bandes Dessinées Francaise).

It is an enhanced version of the original work by Franck (unreachable at the moment...), with some of the missing things added:
- All fields are managed now, including the ISBN and the covers for the fileless;
- Renaming of some of the trickiest ones in Bedetheque.com: INT, HS, One Shot, x.y). For this, try using the same numbering as in the site.
- Check the bedetheque.com naming if a rename fails and copy/past the series' title (my suggestion).
- Manages Le/Les/L', etc. at the beginning or the end of the Series' title. Some Dutch/German/English capabilities also...
- Debug log and rename log, with a switch for on/off as you prefer;
- Parametrization accessible through the File/Automation/Configurer BD2 menu and the dropdown menu under the icon
- added some graphics/bars
- Magazines (revues) scrape
- direct link scraper for the toughest albums, with special edition identification.
- enhanced search not to be forced to enter all accents in french.
- choose the titles capitalization or not
- compatible with CR .178 and on

See last post for more info on changes...

Enjoy,

M

BD2 Scraper 4.9

Manual 4.9
Last Edit: 4 months 5 days ago by mizio66.
The administrator has disabled public write access.
The following user(s) said Thank You: alex.braine, Ludwig, imtheyoyo, PHILIPPE, zetsubou.shin, adrien72, coloc, yipicai, Ptitprince, gmuret and this user have 4 others thankyou

Re: Bedetheque Scraper 2 6 years 5 months ago #15098

  • 600WPMPO
  • 600WPMPO's Avatar
  • Offline
  • Moderator
  • Posts: 3788
  • Thank you received: 557
  • Karma: 232
First a +1 karma to young mizio for the script..!

This is the first mizio script that has worked for me :P

I have a little suggestion..

It would be great if the script dialogs and the scraped data would be in (translated) English. It basically is not useful for guys like me who love to read French/Dutch scanlations in English.

Now Playing: The ComicRack Manual (Online)

See my new comics & gadgets on: Tumblr!
Last Edit: 6 years 5 months ago by 600WPMPO.
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 6 years 5 months ago #15101

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
Thanks for the young... but my nick gives up my birthyear... :-(

About the english... ufff!!! With all the time it took to me to get all the good accents in French!

Will have to see what to do here... not easy (for me)...

ciao!

M
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 6 years 5 months ago #15102

  • 600WPMPO
  • 600WPMPO's Avatar
  • Offline
  • Moderator
  • Posts: 3788
  • Thank you received: 557
  • Karma: 232
mizio66 wrote:
Thanks for the young... but my nick gives up my birthyear...
We age in the forums with the number of posts made! :laugh:
Now Playing: The ComicRack Manual (Online)

See my new comics & gadgets on: Tumblr!
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 6 years 5 months ago #15110

  • Ludwig
  • Ludwig's Avatar
  • Offline
  • Gold Boarder
  • Posts: 197
  • Thank you received: 12
  • Karma: 9
Mizio

Great work. :)
Thanx for the effort.
You deserve Karma big time, for the scraper and because the referencesite is not in you're own language.
Respect :cheer:


Three remarks for users.
I would avoid leaving the 'of' field filled in.
Primo, the number is not always correct (Bédétheque includes Intégrales or Dvd's in the number).
Secundo, it could give the wrong impression that a series is finished (although the script also uses the field 'Series complete' :woohoo: !!).
Tertio, next time a new album appears and you put it in, you 'll have to change also the 'of' -field of all the previous albums you already have in your possession (idem the CBZ's extensions if they mention the 'of' number).

I don't see the benefit of using the Bookformat field myself, it gives 'Autre format', 'Format normal', 'Grand format'.?. Only see the benefit if users posses a paper library and even then... I never use this information.

People should be aware that the synopsis is mostly linked to the serie not to the specific album. Thus the contents repeats itself quite often. I only use it in issue 1. For the others i use Ctr-c / V from another site.


Well that's it.
You know that you can easily change the info after scraping, select all the comics, call up the infopane and change the concerned fields to blank (empty) fields.

Thanx again Mizio
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 6 years 5 months ago #15117

  • Ludwig
  • Ludwig's Avatar
  • Offline
  • Gold Boarder
  • Posts: 197
  • Thank you received: 12
  • Karma: 9
Mizio

For your information and other users.

Had some trouble with the following one shots Bri d'Alban, Carotte aux étoiles (la), Chienne de vie et Souffle du vent(le). For those issues you haven't got a number. But with or without a number no scrape result. Checked on the site and in BDGest.

Then I tried putting the text underneath the title (site) in the number field.

I got Bri d'Alban to work by putting in the number field 'Aventure'.
For Chienne de vie 'One Shot' did the trick.
And for Souffle du vent (le) 'Adaptation'
I wasn't able to scrape Carotte aux étoiles (la).

After the scrape one can adapt the field.

;)
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 6 years 5 months ago #15118

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
Thanks Ludwig, also for the karma ;-)

About the first post of yours, I'm working on enhancing the scraping config to give a possibility to configure it (means: what each one wants to scrape and what does not)... won't be long.

And yes,600, english will be part of it ;-)

Re: the last comments, I strongly suggest to use always the same notation as bedetheque.com... One Shot, INT, HS, HS01, 19.1, etc... so, your findings with Adventure, etc. are right.I have to say bd.com does not always have a great identification way in the site... having no API, you can only scrape (really?) the source and (even lately) a small change can disrupt the whole thing...
Not sure if anybody has better ideas, but I see no other way than this.

I will see if I have the time to write a small manual.... with some samples especially...

ciao and thanks again!

M

p.s. you can try also dutch comics in the scraper...
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 - v1.1 6 years 5 months ago #15134

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 451
  • Thank you received: 143
  • Karma: 67
The new version 1.1 is out !

Get it from the first post as usual.

Changes:
- French and English version !
- Full configure and setup from File/Automation menu.

A manual is available, cannot put it here, will send it to 600 for publishing.

Enjoy,

M
The administrator has disabled public write access.
The following user(s) said Thank You: 600WPMPO

Re: Bedetheque Scraper 2 - v1.1 6 years 5 months ago #15135

  • 600WPMPO
  • 600WPMPO's Avatar
  • Offline
  • Moderator
  • Posts: 3788
  • Thank you received: 557
  • Karma: 232
+1 karma on its way...!




I suggest you unbold the letters in the config dialog..
mizio66 wrote:
A manual is available, cannot put it here, will send it to 600 for publishing.
:P
Now Playing: The ComicRack Manual (Online)

See my new comics & gadgets on: Tumblr!
The administrator has disabled public write access.

Re: Bedetheque Scraper 2 - v1.1 6 years 5 months ago #15153

  • {Oo}
  • {Oo}'s Avatar
  • Offline
  • Platinum Boarder
  • The Chewie is a lie !
  • Posts: 672
  • Thank you received: 41
  • Karma: 15
I'll have to go through the FR version of the addon to make sure there's no problems but from the only screen I saw, there's gonna be little gems :whistle:

It might take me some time since I don't have a lot of free time atm but I'll do my best and contact you mizio with the changes that need to be made ;)

Anyways, thx for updating the script and allowing me to scrape my library once again! :laugh:
Working on re-uploading the FR manuals.
The administrator has disabled public write access.
Time to create page: 0.241 seconds

Who's Online

We have 276 guests and 4 members online