You are not logged in.

#1 2011-11-18 05:36:16

duke11235
Member
Registered: 2009-10-09
Posts: 221

Converting .pages files

I used Mac as my primary OS for years, and have accumulated a plethora of .pages files. I was wondering if anyone knew the simplest and easiest method for converting those to .txt or something else. I was hoping for something that can handle batches of files[there are a lot of them] and could run on Arch or Win7. Thanks

Offline

#2 2011-11-20 00:51:10

thisoldman
Member
From: Pittsburgh
Registered: 2009-04-25
Posts: 1,172

Re: Converting .pages files

From information given on wikipedia, http://en.wikipedia.org/wiki/Pages#Compatibility, this method may work.  No guarantees.

Try copying/renaming one of the '.pages' file to a '.zip' extension.  You may find a '.pdf'. or .'jpg' file inside if the files have been saved with previews enabled.  You should also find an xml file which will have some form of the actual text.

If needed, two tools that may work to convert the xml to plain text are 'xmlto' in extra and 'xml2' in community.  I have experience with neither.

It doesn't sound hard to write a script for this.  Someone has probably done it before, but I couldn't find an example.

Offline

#3 2016-07-20 19:17:56

ob7dev
Member
Registered: 2016-07-20
Posts: 1

Re: Converting .pages files

I had success by changing .pages to .zip, unzipping, then inside Quick folder there is a .pdf with text, and doing a pdftotext command output pure text of original document.  Sometimes .pages uncompress different though, and there is no pdf.  In my case there was still a .jpg though, and I used tesseract to OCR the text from image after converting .jpg to .tif with image magick.

Offline

#4 2016-07-20 19:57:12

Trilby
Forum Moderator
From: Massachusetts, USA
Registered: 2011-11-29
Posts: 14,766
Website

Re: Converting .pages files

Thanks for the input ob7dev, but please keep an eye on thread dates to avoid necrobumping ancient posts.

Closed.


InterrobangSlider
• How's my coding? See this page.
• How's my moderating? Feel free to email any concerns, complaints, or objections.

Offline

Board footer

Powered by FluxBB