Substring function?

Sukera · February 19, 2022, 7:52am

As far as I understand, the reason for this is to be able to support invalid UTF-8 in String as well. It’s quite common to have some corrupted data that’s treated as a string. It’s usually seen as a strength to be able to do that - I wouldn’t call it a “idiosyncracy”.

You may also be interested in some of these previous discussions about various parts of the String type in julia:

Java is using UTF-16, right? The same problems mentioned in the two links above should apply as well, as it’s a variable length encoding like UTF-8. I don’t know how java would treat those bad encodings though. I think java works around this problem by just not having strings decompose into an iterator of char easily, which can get quite hairy to implement in a performant way (I can’t find the links to previous discussions about that though, sorry).

Topic		Replies	Views
Julia substring return empty string New to Julia	8	1008	April 23, 2019
SubString doesn't work with unicode New to Julia question , unicode	13	1435	June 17, 2022
Counting special characters ü, å, ø, etc General Usage strings , unicode	11	759	April 1, 2022
String slicing General Usage	3	2714	October 25, 2018
Any difference between : or , in the SubString() method? New to Julia	2	280	September 24, 2020

Substring function?

Related topics