Welcome, Guest
Python Scripts for ComicRack

TOPIC: Bedetheque Scraper 2 - v4.9

Bedetheque Scraper 2 - v4.9 1 month 1 week ago #48798

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 453
  • Thank you received: 143
  • Karma: 67
Hi, ISBN has not been touched... by me. So, if BDTQ changed some of the HTML, might be not working.
As I have not a lo tod time to check the script, I cannot promise when, but I’ll have a look. I scraped quite some album last weeks, no issues, but I will check the things above.
Cheers!
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 month 1 week ago #48800

  • Chgros
  • Chgros's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 2
  • Karma: 0
Well i am no python developper at all but i repaired the ISBN data

I don't know if there are other changes in the HTML on the Bedetheque side.


File : BedethequeScraper2.py

Changing
ALBUM_ISBN_PATTERN = r'<label>ISBN\s:\s.*?\">(.*?)<'
for
ALBUM_ISBN_PATTERN = r'<label>ISBN\s:\s?</label>(.*?)</'
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 month 4 days ago #48849

  • Chgros
  • Chgros's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 2
  • Karma: 0
it seems something changed in Bedetheque HTML (or quickscrape is broquen)

If i use quickscrape, The title and series are wrong
If i scape the serie , i get the good informations.

expl with
www.bedetheque.com/BD-Star-Wars-Poe-Dame...on-Black-294102.html

Also if i try to quickscrappe some albums, it freez
exmpl : www.bedetheque.com/BD-Nyarlathotep-60843.html

Can someone look at this ?
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 1 month 4 days ago #48850

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 453
  • Thank you received: 143
  • Karma: 67
That is not an album link... Please check the manual for instructions...
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 14 hours 9 minutes ago #49252

  • pascal
  • pascal's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 1
  • Karma: 0
Hello,

First, thank you for the script.
I did have some try on few series, but I must admit I found the results quite poor.

----
The major problem I'm experiencing is a not found serie or book, ththat the scrapper seems to not found any reference I select in CR.
Well to be honest he found one (Incal (L')
But this typically end like this:
Calling 'BD_start'...

=========================- Begin! -=========================

vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
Nom Série = J.K.J. Bloche - T09 - Absent No = []
Recherche sur le Web avec Nom de Série: J.K.J. Bloche - T09 - Absent
Recherche générique dans www.bedetheque.com/search/tout?RechTexte...20absent&RechWhere=0
# Temps nécessaire: 0:00:02

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

My first thought is I'm doing wrong stuff.
I read the doc,
I went through this topic in the forum,
I try to understand the script thanks to the debug mode,
I understand it works fine with quickscrapper button. So I admit this is something to deal with the fact the tool does not recognize the book on the bedetheque

Should I do some kind of pre processing stuff on the titile of the file (the book) or is there anything I might have not done before launching the tool on my selection of book ?

Any thoughts ?

Thank you in advance.
Pascal
The administrator has disabled public write access.
Time to create page: 0.159 seconds

Who's Online

We have 205 guests and 3 members online