Damaged DOCX2TXT 1.03b Damaged DOCX2TXT needs the Microsoft .Net version 2 framework. Using a GUI interface and a Perl coded back end, Damaged DOCX2TXT will extract the text even from damaged or corrupted Word 2007 docx files where Word 2007 itself fails to salvage text.
Published by
S2 Services
File Size: 3,585 K OS: Win XP Released: 03-Mar-2010
Free to try, $0.00 to buy
Description Damaged DOCX2TXT requires the installation of the Microsoft .Net version 2 framework.
Using a GUI front and a Perl coded back end, Damaged DOCX2TXT extracts the text from damaged or corrupted Word 2007 docx files where Word 2007 fails to salvage text.
Word 2007 files are really zipped collections of mostly XML files. XML is not tolerant of file corruption. The text from a Word 2007 document is found in the document.xml file within the zipped collection. From the errors it generates Word 2007 appears to be using using both an inadequately corruption tolerant unzipper as well as an inadequate corruption tolerant XML reading algorithm to salvage text from the mentioned XML file within corrupt Word 2007 docx files. Damaged DOCX2TXT on the other hand uses a more corruption tolerant unzipper and a corruption tolerant XML reading algorithm as well, succeeding where MS Word fails.
Damaged DOCX2TXT can also be simply used as a an undamaged Word DOCX file viewer, without having Word 2007 or 2010 installed (or earlier version of Word with the Compatibility Pack). It also works as a text editor of the extracted docx text.
Requirements Damaged DOCX2TXT 1.03b requires .Net Version 2