print to podcast
November 10, 2013
The VipRiser can produce more then just a PDF thanks to its ability to customize processing workflow using a plugin.
Listening to your print-outs
I do a lot of reading in my work (and my private) life and I always wished I could turn some of that text into spoken word so that I can take it with me out biking or hiking rather then staring at it on my screen at home. The OS X had text to speech functionality built-in since 10.4 (I think), however only with the Mavericks the quality of the voice is good enough for it to be truly usable. If you are into this kind of thing, that is :-).
There are different ways how to turn text into a voice track already. One of them being Add to iTunes as a Spoken Track service, but I use VipRiser for that. One big advantage is that whatever text I can print, I can listen to.
Setting up a workflow
I thought ‘print to speech’ is kind of interesting workflow, therefore I added it as a sample into the VipRiser distribution itself. You can install it from Preferences.
The plugins (workflows) will appear in the menu along with Folder, Kindle etc. and you can use them as VipRiser’s destination.
Customizing the ‘PDF to Podcast’ workflow
Although this sample workflow is derived from the real one I personally use, it is meant to be further customized by the end user. You can look inside by going to the Plugins directory (use Open button) and opening the workflow in the Automator.
The Automator workflows can get quite complex. This is great if you a programmer but understandably scary if you are not. Luckily there are many good resources for the Automator on the web to help you along, therefore I will not go into deep technical details here.
What does it do
- extract text from the PDF
- reflow the text
- turn text into an audio file
- add it to the iTunes
The reflow of the text is one of the tricky point. The problem is that words extracted from a PDF will most likely be broken up to satisfy given layout. The text to speech cannot deal with this on its own. The shell script step (a perl script) attempts to take care of this by ‘un-breaking’ broken words and clean up the extracted text somewhat. …I do not expect anyone to understand this code 🙂 However, this part is directly influences the quality of text to speech conversion as extracted text may contain stuff that should not be spoken at all (e.g. headers, side notes) or that may not be in form of senses and paragraph (e.g. tables, lists). Getting this right very much depends on kind of documents you want to process.
How to use it
- set the destination to the workflow (e.g. Sample – PDF to podcast)
- the MP3 file will be added to the iTunes (playlist is configurable in the last step)
TIP: What works really great is printing articles out of the reader mode in the Safari.