Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow identity matching without decoding to scalar values #225

Open
aphillips opened this issue Aug 30, 2023 · 0 comments
Open

Allow identity matching without decoding to scalar values #225

aphillips opened this issue Aug 30, 2023 · 0 comments

Comments

@aphillips
Copy link
Contributor

RDF is struggling to find a definition for "Unicode string" that fits their needs. Addison has made some suggestions, but notes that we don't provide a string definition. In addition, there is the problem that Charmod-norm doesn't provide the specific "code unit vs. code point" performance out being discussed here.

Particularly germane to this might be the suggestion:

Provide normative text to allow for the efficient comparison of strings, along the lines of:

A string is identical to another string if it consists of the same sequence of code points. An implementation MAY determine string equality by comparing the code units of two strings using the same Unicode character encoding form (UTF-8 or UTF-16) without decoding the string into a scalar value sequence.

This is a topic for the 2023-08-31 call.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant