Extract PDF Contents Extract PDF Contents

This tool extracts plain text from a PDF document, and using language detection and special cleanup rules, recombines the line-breaking text back into real words. It also cleans up encoding peculiarities in the different Acrobat formats, standardising bullets points etc.
Please note that only the first 20,000 characters will be processed.

blog comments powered by Disqus

APIAPI

This tool is also available via a web-based API so that you can use it from your own applications and websites. learn more about the API or sign up now

Call nowCall now

Feel free to contact us at any time to discuss your multilingual or translation project.

Ph. +46 70 885 9690 (Sweden)
Ph. +33 6 76 59 21 07 (France)
Ph. +44 20 8133 8055 (UK)
Skype. sprawk