Welcome, Guest
Python Scripts for ComicRack

TOPIC: Bedetheque Scraper 2 - v4.9

Bedetheque Scraper 2 - v4.9 1 year 3 months ago #49451

  • plipplop
  • plipplop's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 7
  • Karma: 1
Thanks guys. I emailed the Bédéthèque and they unblocked me. I set the timeout back to the default values and it works like a charm now.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 year 3 months ago #49491

  • Nocta
  • Nocta's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 10
  • Thank you received: 1
  • Karma: 0
Hi guys,
I'm a big fan of the bedetheque scraper but there is one thing I didn't succeed to do with it.
If I scrap a comic with comicvine, then it's removed from the Unscraped playlist. And I'd like to do that with the bedetheque scraper so my french comics are not stuck in this playlist.

Is this possible?
Thanks!
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 year 3 months ago #49494

  • rmagere
  • rmagere's Avatar
  • Offline
  • Gold Boarder
  • Posts: 284
  • Thank you received: 33
  • Karma: 8
Nocta wrote:
If I scrap a comic with comicvine, then it's removed from the Unscraped playlist. And I'd like to do that with the bedetheque scraper so my french comics are not stuck in this playlist.

The unscraped list must be a smart list you have created - you just need to change it to include recognition of a bedetheque marker. E.g. I use both the bonelli and disney scraper - my unscraped list is based on comics not having any of the following text in any of the fields: cvdb, coa, bonelli.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 year 3 months ago #49497

  • Nocta
  • Nocta's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 10
  • Thank you received: 1
  • Karma: 0
You are totally right rmagere, thanks for the tip!
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 year 2 months ago #49601

  • misakitchi
  • misakitchi's Avatar
  • Offline
  • Senior Boarder
  • Posts: 49
  • Karma: -1
Chgros wrote:
Well i am no python developper at all but i repaired the ISBN data

I don't know if there are other changes in the HTML on the Bedetheque side.


File : BedethequeScraper2.py

Changing
ALBUM_ISBN_PATTERN = r'<label>ISBN\s:\s.*?\">(.*?)<'
for
ALBUM_ISBN_PATTERN = r'<label>ISBN\s:\s?</label>(.*?)</'

Thanks Chgros ISBN scrape is now working! :)
Perhaps mizio66 can update the script?

Merci Chgros le scrape de ISBN fonctionne maintenant! :)
Peut-etre que mizio66 pourrait mettre à jour le script?


Now i have another issue: the "rescrape from web" is not working for me :(
Example: Giunchiglia
www.bedetheque.com/BD-Giunchiglia-Tome-1...d-Irlande-73034.html
Its always "ignored"... :(

Can you do something about it?


EDIT2: I find a little bug for ISBN
www.bedetheque.com/serie-26881-bd-betise...phixerox__10000.html
In ISBN: "<span itemprop="isbn">978-2-88890-369-7"
OK to fix it i put "Flip" in AltNumber
Last Edit: 1 year 2 months ago by misakitchi.
The administrator has disabled public write access.
Time to create page: 0.216 seconds

Who's Online

We have 127 guests and one member online