There were some questions concerning kruti Dev and I am curious (and tried to help).
...
However any source I could find about the topic talks of cruti dev fonts and most of them suggest downloads. That’s useless for me. I’m not interested in using that stuff, but in understanding.
...
On the other hand a questioner is talking about conversion of cruti Dev to unicode and reverse. This should mean that kruti Dev is a way of coding for characters in a way probably especially adequate for character sets used with devanagari or related fonts.
...
Please point me to a source clearly explaining the situation in thoroughly chosen words without mixing up concepts and without making a mess of terms.
https://gist.github.com/tripleee/b82a79f5b3e57dc6a487ae45077cdbd3
In short: »kruti dev font« seems to be a pseudo-font wich is in real a kind of keyboard-makro to translate sequences of ascii-letters into some single-unicode-letter out of the Devanagari-subset.
Yes. It’s hard. And I still miss the needed information. Got a guess.
A keyboard macro might be used on the system to get a kruti Dev multi-character code for a single stroke on a key showing a character of an alien script. This way may be created a file which our questioner would like to feed into his converter.
Now the multi-character representations need to be separated, recognised and used to get the one character (or its unicode representation) which once shall be printed or ...
BTW: The code you posted gives 171 entries. The mentioned questioner had listed 1286 entries. See attachment:
conversionListAnniruddhaMohod.ods (20.6 KB)
… and filtered [x]case sensitive … [x]no duplicates … the list has 1008 entries!
And also a number of extra-replacements which are applied after the first-processing.
(post deleted by author)
And entries aren’t sorted according to length, for example lines 35 & 36
That means the character U+0936 will be replaced always, but U+0936&U+093E never.
If we look at the assignments we find at least some kruti Dev codes which are case sensitive. But still this isn’t the most relevant questions.
- How are the codes per character separated?
- How is made sure that longer codes aren’t destroyed by the replacement of shorter ones starting with an identical part when running a conversion in situ?
- (Different category:) Why is such a complicated code still used?
Why isn’t there a standard document defining the system clearly?
Such documents exist for some dfecades now for unicode in general and for utf-8 , utf-16, too.
that means the list should first sorted by »len( column A) descending« and then in alphabetic-order
Yes, but why did the questioner not regard this. He claimed nevertheless that his code worked correctly with selected text.
Imaging you have nothing but a keyboard with english-layout.
Ask the questioner!
Why?
Who would and who could type (from natural memory) the mystic codes on an English keyboard.