I need a piece of software that I can use to compare two versions of multiple files (using an A/B/X process) which are not quite the same .doc/.docx/.odt/.txt files as their name may indicate. I have consolidated a multitude of backups and recovered archives, leaving me multiple examples of a 'document' that has 10-200 versions. Unfortunately, the 200th versions is not always the most-current, or had the most-current updates. In one case, the 7th version is considered the gospel, and everything beyond that is duplicates of older versions, clones, or cast-offs from the incremental process. And they are not necessarily identical - sometimes the only thing that changed was the metadata properties (usually the subversion numbers)
ExamDiff has been suggested. It would work, but it would be laborious. There are less than 100 documents, so it is doable. However, if anyone knows of any other software that might work, I would welcome the input. This is strictly for text comparison - metadata comparison is sufficiently workable from the embedded document properties.
EXAMPLE: Document 22 is 57 pages long. The 24th version is the most-current - it has four words removed in comparison to the 22nd version. The four words describe a historical event that is now considered 'non-Canon' after some changes in the historical timeline in other documents, so they were removed. The 23rd version is an incremental backup of the 22nd version that was triggered by a metadata change in the document itself (likely a migration from Word to Writer, but we're not sure yet). ExamDiff could have found the differences between 22 and 24, but not 22 and 23 - that was found in the document properties metadata.
Why ask here? Smart people hang out here. This I have learned through the years...
