Ask Your Question
0

Search for German consonant ß but not ss

asked 2016-07-14 09:56:37 +0100

Huskey gravatar image

Hi there

When I do a search in Writer for words with the German consonant ß, it shows me also words with ss. How can I avoid this? I want only words containing the ß.

Many thanks for your help.

Win 10; LO 5.1.4.2

edit retag flag offensive close merge delete

3 Answers

Sort by » oldest newest most voted
0

answered 2016-07-14 10:47:56 +0100

floris v gravatar image

You probably can't. Not only does this happen in Apache OO as well, but the reverse works in both: searching for "dass" also finds "daß". It's probably hardwired into the compare text algorithm. However, you can use that to your advantage: you can spot inconsistent spelling this way.

edit flag offensive delete link more

Comments

@floris v: I would back your suspicion. Even if you search by regular expression for \u00DF (which is the codepoint of ß), you also get found the double-s. \u1E9E is treated the same way. It is the same with the alternative search extension.
This seems to lie very deep, as the regex search for [A-Z] is including ß while it does not include ä, ö, ü. Probably the ß is replaced by ss when searching at any occurrence.
By the way: Searching for ss or s{2} with regex will also find ß.

Lupp gravatar imageLupp ( 2016-07-14 16:28:30 +0100 )edit
0

answered 2016-07-14 12:23:06 +0100

karolus gravatar image

Hallo

Enable Option: …[x]upper/lower case

edit flag offensive delete link more

Comments

@karolus: I didn't know that. Did you notice it by accident, or is there a specification to that effect? As ss is surely lower case, this may indicate that ß is treated as an upper case letter?

Lupp gravatar imageLupp ( 2016-07-14 16:34:54 +0100 )edit

@Lupp: ß is a lowercase letter with no direct uppercase counterpart, so It seems a common agreement is uppercase('ß') == 'SS'

karolus gravatar imagekarolus ( 2016-07-14 17:24:36 +0100 )edit

Hadn't thought of that, I almost deleted my answer on seeing your reply.

floris v gravatar imagefloris v ( 2016-07-14 17:44:48 +0100 )edit

@karolus What does your answer '...[x]upper/lower case' mean? I've just did a try with a simple text editor. There, searches for ss give me only ss results and searches for ß give me only ß results.

Huskey gravatar imageHuskey ( 2016-07-14 20:26:51 +0100 )edit

@Huskey: The setting on the dialog box is [x] Match case. With that selected, A is different from a.

Many years ago I wrote a routine for sorting text, including accented letters. Every word to be sorted was copied, then in the copy every accented letter was replaced by its unaccented counterpart. Like ë by e. My, and karolus' guess is that the search routine in AOO and Libre does something similar, and certainly with ß, especially because it works both ways.

floris v gravatar imagefloris v ( 2016-07-14 21:53:40 +0100 )edit
0

answered 2016-07-15 21:44:06 +0100

Huskey gravatar image

Many thanks to all.

edit flag offensive delete link more
Login/Signup to Answer

Question Tools

1 follower

Stats

Asked: 2016-07-14 09:56:37 +0100

Seen: 124 times

Last updated: Jul 15 '16