Welcome, Guest
Python Scripts for ComicRack

TOPIC: Bedetheque Scraper 2 - v4.9

Bedetheque Scraper 2 - v4.9 beta 11 months 5 days ago #47989

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 457
  • Thank you received: 148
  • Karma: 69
Here is the new 4.9 version.

First post for the links!

- Https replaces http
- other minor fixes.

Please report any issue.

Enjoy,

M
The administrator has disabled public write access.
The following user(s) said Thank You: JiminyC, ninjaw

Bedetheque Scraper 2 - v4.9 beta 11 months 2 days ago #48007

  • StudioNeuneu
  • StudioNeuneu's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 11
  • Thank you received: 2
  • Karma: 0
Thanks for the update. I will try it.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 beta 11 months 13 hours ago #48014

  • StudioNeuneu
  • StudioNeuneu's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 11
  • Thank you received: 2
  • Karma: 0
So it looks like everything work like before !!
Thanks very much !
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 beta 10 months 3 weeks ago #48058

  • StudioNeuneu
  • StudioNeuneu's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 11
  • Thank you received: 2
  • Karma: 0
Hello !

I am not sure, but there is something which doesn't work like before. The "Only if finished" option.
Before, when a serie was not finished, there was no number in the "Count of", but now, even if the series is not finished, there is a number there.
I tried to change the option but it doesn't work like before. Only for the One Shot series I have no number.

Edit : Ok I find what happen. In the option, in data, if I don't use "S. Completed", it put some number in "Count of". But if I use it, it doesn"t put number if the serie isn't finished. Maybe it could be good if "Count of" work even without the "S. Completed" selected. In the next update maybe ?
Last Edit: 10 months 3 weeks ago by StudioNeuneu.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 9 months 1 week ago #48146

  • Slaan
  • Slaan's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 2
  • Karma: 0
Hi !

It is possible to use this scrapper outside of your software ? Just only in python ?
For example, within a ubuntu terminal ?
Thanks !
Last Edit: 9 months 1 week ago by Slaan.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 9 months 1 week ago #48149

  • mizio66
  • mizio66's Avatar
  • Offline
  • Platinum Boarder
  • Started reading comics at 4... and still counting!
  • Posts: 457
  • Thank you received: 148
  • Karma: 69
The code is Ironpython, so heavily leaning on .net. In theory the core function could be reused (that is python), but as it is, not possible to port. It would need to manage the info file in the .cbr/cbz autonomously, out of CR. Not impossible, but needs some time.
if someone is willing to do it, I can help explain the code (badly written by me, I admit).

cheers
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 9 months 1 week ago #48150

  • Slaan
  • Slaan's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 2
  • Karma: 0
Thanks for your answer !
I'm a Python Dev'. Autodidact but i'm not afraid about html parsing :)

Maybe can you explain where is the function of the html parser and how it did the work ?
What kind of tool did you use ? Beautifulsoup ?
Maybe did you have write any unit test ?

If not, or if it's too complicated, don't worry. Like any other dev', i'm lazy. If i can use the work of another folks, i did :)
I have to class a ton of cbr/cbz (60 Go...), but i can write my own parser.

Thanks again !
Last Edit: 9 months 1 week ago by Slaan.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 9 months 1 week ago #48182

  • freMea
  • freMea's Avatar
  • Offline
  • Junior Boarder
  • Posts: 38
  • Thank you received: 5
  • Karma: 1
Works great. I just wanted to report a bug only when I click the quickscrape button. The normal scrape works flawlessly.



And the setup GUI needs updates, it still displays v4.8.
Last Edit: 9 months 1 week ago by freMea.
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 7 months 1 week ago #48546

  • benchapo
  • benchapo's Avatar
  • Offline
  • Fresh Boarder
  • Posts: 1
  • Karma: 0
Hi everyone,

I have a problem with this script, the error log is:

Friday 10 November 2017 18:51:27
Caught SystemError: The operation has timed out
C:\Users\Ftop\AppData\Roaming\cYo\ComicRack\Scripts\Bedetheque Scraper 2\BedethequeScraper2.py,560,SetSerieId
C:\Users\Ftop\AppData\Roaming\cYo\ComicRack\Scripts\Bedetheque Scraper 2\BedethequeScraper2.py,1619,_read_url

It happen when I try to scrap one or multiple files in CR, and I got a message like: "completed:0, skipped:1"

Someone have ab idea?

PS: I didn't change the initial configuration, I have bedetheque scrapper 2 v4.9 ant CR 0.9.178 64 bit
The administrator has disabled public write access.

Bedetheque Scraper 2 - v4.9 5 months 2 weeks ago #48797

  • misakitchi
  • misakitchi's Avatar
  • Offline
  • Senior Boarder
  • Posts: 49
  • Karma: -1
Hi! Thanks for script update!
But why did you remove ISBN?!!
Can you say me how have it back?
I try to read your code but its too complicated... :(


Edit: and i test with a fileless entry and have no ISBN... so its not working at all (i test in debug mode too)

Debug log:
Nom Série = Bouncer No = [10]
Recherche sur le Web avec Nom de Série: Bouncer
URL Série: www.bedetheque.com/serie-663-BD-Bouncer.html
Recherche sur le Web avec No. d'Album [ 10]
============================================================
parseSerieInfo a) www.bedetheque.com/serie-663-bd-bouncer__10000.html b) False
============================================================
Genre: Western
Résumé Serie: trouvé !
Série En cours
Langue: Fr
Val. Proposée No
No. Total Album: ---
** URL de l'Album dans la Série: www.bedetheque.com/BD-Bouncer-Tome-10-L-or-maudit-317957.html
============================================================
parseAlbumInfo a) www.bedetheque.com/BD-Bouncer-Tome-10-L-or-maudit-317957.html b) 10 c) False
============================================================
Titre: L'or maudit
** URL de l'Album: www.bedetheque.com/BD-Bouncer-Tome-10-L-or-maudit-317957.html
Scenariste(s): François Boucq
Dessinateur(s): François Boucq
Coloriste(s): Alexandre Boucq
Couverturiste:
Dèpot lègal: 1/2018
Editeur: Glénat
ISBN:
Lettrage:
Note Album: 4.0
Collection: Grafica
Taille: Grand format
Nom Série = Bouncer
Couverture trouvée: www.bedetheque.com/media/Couvertures/Couv_317957.jpg
N. de Planches: 0
Last Edit: 5 months 2 weeks ago by misakitchi.
The administrator has disabled public write access.
Time to create page: 0.334 seconds

Who's Online

We have 207 guests and 2 members online