Jason
posted this on March 01, 2009 07:55 pm
The plain text version of a document is usually extracted from a PDF or other rich file format. It's often difficult to retain line breaks and other text formatting when text is extracted from another format.