You are not logged in.

#1 2011-10-15 14:11:34

Jacek Poplawski
Member
From: Poland
Registered: 2006-01-10
Posts: 736
Website

[SOLVED] archieve website into ebook?

(not sure is it correct forum, but I assume text is also multimedia)

Is there a way to archieve whole website into an offline document for reading on device like Kindle?
I know how to use wget, but wget gives you just a tree of html files. How to organize that into ebook?
Is there a tool for that?
I use calibre and I aware of its "fetch news" feature but don't know how to use it to take whole website.

example website I would like to read on Kindle: http://archiwum.wiz.pl/spis_tematow.asp

Wikipedia is also a good example but I am not sure how big it is now, I remember that there were CDs with offline versions years ago smile

PS. please don't say "just browse website with Kindle on WiFi", that's not what I am asking

Last edited by Jacek Poplawski (2011-10-18 12:17:22)

Offline

#2 2011-10-15 14:18:42

karol
Archivist
Registered: 2009-05-06
Posts: 25,440

Re: [SOLVED] archieve website into ebook?

What do you mean "rganize that into ebook"? You want to convert the html files into a pdf so you can either click on the links or browse e.g. alphabetically?

Offline

#3 2011-10-15 14:21:13

Jacek Poplawski
Member
From: Poland
Registered: 2006-01-10
Posts: 736
Website

Re: [SOLVED] archieve website into ebook?

PDF is not the best format, but anyway.... News processed by calibre are organized this way that each page is separate section.
(install calibre from [community] then fetch some news and see them in mobi format). I just need way to fetch this way any website, maybe the solution is to read how calibre does it.

Offline

#4 2011-10-16 00:24:48

fschiff
Member
Registered: 2011-10-06
Posts: 71

Re: [SOLVED] archieve website into ebook?

I use print friendly as a bookmarklet in my web browser.  Converts web page into a PDF file, allowing you to remove sections you don't want. 

More of a quick & dirty solution then anything else.

Offline

#5 2011-10-16 01:26:24

Jacek Poplawski
Member
From: Poland
Registered: 2006-01-10
Posts: 736
Website

Re: [SOLVED] archieve website into ebook?

I tried "print friendly" but it doesn't work (produces empty pdf every time) so I am searching for other solutions, probably something which will generate document from tree mirrored by wget/curl

Last edited by Jacek Poplawski (2011-10-16 01:28:18)

Offline

#6 2011-10-16 13:59:42

skottish
Forum Fellow
From: Here
Registered: 2006-06-16
Posts: 7,942

Re: [SOLVED] archieve website into ebook?

I've had some success in converting web pages to PDF with indexes using htmldoc, but it's far from perfect.

Last edited by skottish (2011-10-16 14:00:16)

Offline

#7 2011-10-16 16:14:24

Jacek Poplawski
Member
From: Poland
Registered: 2006-01-10
Posts: 736
Website

Re: [SOLVED] archieve website into ebook?

I read how calibre does it, but it's for RSS feeds, not static websites: http://manual.calibre-ebook.com/news.html

Offline

#8 2011-10-18 12:17:07

Jacek Poplawski
Member
From: Poland
Registered: 2006-01-10
Posts: 736
Website

Re: [SOLVED] archieve website into ebook?

OK calibre handles it, no need for other tools than wget:

http://www.mobileread.com/forums/showth … ?p=1792154

Offline

#9 2011-10-18 18:28:45

R00KIE
Forum Fellow
From: Between a computer and a chair
Registered: 2008-09-14
Posts: 4,734

Re: [SOLVED] archieve website into ebook?


R00KIE
Tm90aGluZyB0byBzZWUgaGVyZSwgbW92ZSBhbG9uZy4K

Offline

Board footer

Powered by FluxBB