I am asked to post-process a lot of .doc files that contains both U+22EF Midline Horizontal Ellipsis and U+2026 Horizontal Ellipsis. Somehow they are used as markup. I am supposed to replace them with different things. I tried search and replace. But Writer treat them as the same character. I tried to export them to .txt file in the hope that I can use sed. But the resulting .txt file already replaced all U+22EF with U+2026.
Is there settings that I am not aware of, so that Writer differentiate them?