IMHO, Kevin didn’t give full details about the original file. It isn’t a simple PDF. It contains an image (the top layer) with white background and some text boxes below (the bottom layer) created by the OCR application.
You can’t follow the same procedure unless your OCR-file has the same structure. And this depends on the OCR software.
But, from experience with an OP on this site, PDF is not the best format to amend OCR files. You should save plain text (yes, you lose formatting information) and proceed through some macro generator. You write correction rules as macros and you launch the macro-generator. However, not all macro-generators will allow for “comfort”. Best choice is for 2-layer macro-gnerators, i.e. those which operates not on characters but on “tokens” (meaningful group of characters) because you’re handling natural language and natural language is based on words. Using “tokens” intermediate form allows to ignore easily spaces and other “noise” sequences.