I have a LibreOffice Writer document saved as a “flat” XML file (
*.fodt file type). I am trying to apply regular expressions to it, using an external text editor.
My efforts are hampered because the document is littered with dozens of
<text:span text:style-name="Tnn"> ... </text:span> wrappers. They seem to appear haphazardly, even between characters of a single word without any apparent change of “style” in the Writer view of the document itself. The
T10) numbers appear to be related to style declarations including something like:
Of course, this makes it impossible to construct any regex that works across the document as a whole.
So, two questions:
- What are these wrappers and
- Is there an easy way to get rid of them?
Trying to remove them manually would be ridiculously difficult.
Note: this Q&A is related to the following:
· “Regular expressions to move punctuation from after to before superscripts”