You are not logged in.
(not sure is it correct forum, but I assume text is also multimedia)
Is there a way to archieve whole website into an offline document for reading on device like Kindle?
I know how to use wget, but wget gives you just a tree of html files. How to organize that into ebook?
Is there a tool for that?
I use calibre and I aware of its "fetch news" feature but don't know how to use it to take whole website.
example website I would like to read on Kindle: http://archiwum.wiz.pl/spis_tematow.asp
Wikipedia is also a good example but I am not sure how big it is now, I remember that there were CDs with offline versions years ago
PS. please don't say "just browse website with Kindle on WiFi", that's not what I am asking
Last edited by Jacek Poplawski (2011-10-18 12:17:22)
Offline
What do you mean "rganize that into ebook"? You want to convert the html files into a pdf so you can either click on the links or browse e.g. alphabetically?
Offline
PDF is not the best format, but anyway.... News processed by calibre are organized this way that each page is separate section.
(install calibre from [community] then fetch some news and see them in mobi format). I just need way to fetch this way any website, maybe the solution is to read how calibre does it.
Offline
I use print friendly as a bookmarklet in my web browser. Converts web page into a PDF file, allowing you to remove sections you don't want.
More of a quick & dirty solution then anything else.
Offline
I tried "print friendly" but it doesn't work (produces empty pdf every time) so I am searching for other solutions, probably something which will generate document from tree mirrored by wget/curl
Last edited by Jacek Poplawski (2011-10-16 01:28:18)
Offline
I've had some success in converting web pages to PDF with indexes using htmldoc, but it's far from perfect.
Last edited by skottish (2011-10-16 14:00:16)
Offline
I read how calibre does it, but it's for RSS feeds, not static websites: http://manual.calibre-ebook.com/news.html
Offline
OK calibre handles it, no need for other tools than wget:
Offline
You could try httrack too https://aur.archlinux.org/packages.php?ID=43508
R00KIE
Tm90aGluZyB0byBzZWUgaGVyZSwgbW92ZSBhbG9uZy4K
Offline