Find-replace time stamps in video transcripts

I hope you are well and in good spirits.

I have transcripts copied from online videos. The text of the transcripts is broken up by time stamps. I would like to use LibreOffice Writer to search for the time stamps and delete them, while preserving the paragraph breaks that come before and after each time stamp.

Here is an excerpt from a transcript.
“…also has lawlessness and everything is
1:18:39
dictated by presidency are those situations comparable or not you think
1:18:44
uh i i’m not in a position to judge i know so little about Taiwan uh you know
1:18:50
aside from reading the economist type thing…”

I would like it to become:
"…also has lawlessness and everything is

dictated by presidency are those situations comparable or not you think

uh i i’m not in a position to judge i know so little about Taiwan uh you know

aside from reading the economist type thing…"

Time stamps vary in the number of characters, with those on the first few pages consisting of four characters (such as 0:05); later pages containing time stamps of five characters (e.g., 10:35); and the last pages containing time stamps of seven characters (e.g., 1:13:22).

How can I automate removing the time stamps?

Thank you.

Find/Replace. In the Find box, type [0-9]:[0-9][0-9]:[0-9][0-9], in the Replace box, nothing, tick Other options, select Regular expressions, then Replace all.

2 Likes

Thank you. This works fine.
For the shorter time stamps, of course, I deleted portions of the “formula” and ran the find-replace again. So I did
[0-9]:[0-9][0-9]:[0-9][0-9] first, which removed the seven-character time stamps; then
[0-9][0-9]:[0-9][0-9] which removed the five-character time stamps; then
[0-9]:[0-9][0-9] which removed the four-character time stamps.

Catlike