I want to extract the text from both doc and docx file with track changes in a batch mode (preferably in commandline) in Linux (Ubuntu) environment.
There are some options as below
soffice --headless --convert-to txt:Text input-file.doc[x]
But it creates the txt file containing text both before and after the track changes.
Another option is :
abiword -t txt inputfile.doc[x]
But it gives the file after track changes (final) .
But I want two text files from the docx file
- Text containing before track changes
- Text containing after track changes