Lowercase and uppercase do not handle Mathematical Letters as expected

benjwweber · February 17, 2024, 3:17pm

I have recently noticed that

lowercase('𝐴')
# '𝐴': Unicode U+1D434 (category Lu: Letter, uppercase)

and

uppercase('𝑎')
# '𝑎': Unicode U+1D44E (category Ll: Letter, lowercase)

do not behave as I would have expected, as 𝐴 and 𝑎 have the same(-ish) relation as A and a, one being the capital form, the other the small form.
This seems to be the case for all Characters in the Unicode Block “Mathematical Alphanumeric Symbols” that have both lower-case and upper-case (Small and Capital) forms, as well as some Characters where the corresponding lower-case or upper-case form is in the Unicode block “Letterlike Symbols” (for example ‘ℛ’ and ‘𝓇’. )
Is this behaviour intended?

mnemnion · February 17, 2024, 5:07pm

Yes, this is intentional. The name of that character in Unicode is MATHEMATICAL ITALIC CAPITAL A, and if you search the Case Folding Dataset, you will see that it isn’t included.

To guess at the reasoning, a mathematical variable 𝐴 and a variable 𝑎 always refer to different things, so transforming the former into the latter would be undesirable.

benjwweber · February 17, 2024, 6:38pm

Thanks for the reply! Reasoning from the perspective of 𝐴 and 𝑎 as mathematical variables makes a lot of sense.

Topic		Replies	Views
Running out of letters: Pitfalls of Unicode? New to Julia unicode	11	1311	May 14, 2021
Unicode \epsilon\_y New to Julia	33	5771	October 10, 2019
Math notation General Usage question	4	187	July 9, 2025
Array argument naming convention: upper case or lower case? New to Julia	6	2090	January 12, 2018
Unicode: a bad idea, in general General Usage	83	4118	June 17, 2023

Lowercase and uppercase do not handle Mathematical Letters as expected

Related topics