Syntax: Escape hatch for unicode haters

mbauman · January 5, 2024, 3:09pm

Most modern languages support unicode identifiers these days:

Python (2. Lexical analysis — Python 3.3.7 documentation)
Rust (Identifiers - The Rust Reference)
Swift (Documentation)
Go (The Go Programming Language Specification - The Go Programming Language)
C# (C# identifier names - rules and conventions - C# | Microsoft Learn)
Raku (identifiers | Raku Documentation)
Java (Charsets and Unicode Identifiers in Java - DZone)
Ruby (Coding Ninjas Studio)
C++ (Identifiers - cppreference.com)
Heck, even C99 has rudimentary support… but as the oldest one here, it ironically allows int \U03B1 = 2; while leaving α = 2; implementation defined. (Identifier - cppreference.com).
Javascript also allows unicode as well as using unicode escapes in identifiers somewhat similarly to C (Valid JavaScript variable names in ES5 · Mathias Bynens)

Of the languages I thought of here, only Perl, R, and Fortran don’t seem to support unicode. And only C and Javascript support using \U or \u escapes. None support latex- or html-like entity names.

Topic		Replies	Views
Non-unicode versions of unicode functions in base/stdlib? Internals & Design	10	1309	May 16, 2021
Warning against Unicode confusables Internals & Design unicode	51	1924	January 13, 2024
Fun with Unicode: TemplateᐸTᐳ syntax and more General Usage syntax , unicode	4	92	August 9, 2024
Rationale behind excluding some unicode characters from identifiers Internals & Design	10	386	March 3, 2023
String conversion from Symbol with Unicode does not yield a string, which is intended to be the same New to Julia question , bug	6	763	December 5, 2020

Syntax: Escape hatch for unicode haters

Related topics