Check against i18n Review Checklist

Question

Check against i18n Review Checklist

philarcher opened this issue 2 years ago · comments

This short review is for the following spec: RDF Dataset Canonicalization.

Short i18n review checklist is here

Gregg Kellogg · Answer 1 · Sat Jan 21 2023 02:38:52 GMT+0800 (China Standard Time)

If the spec (or its implementation) contains any natural language text that will be read by a human (this includes error messages or other UI text, JSON strings, etc, etc), ensure that there’s metadata about and support for basic things such as language and text direction. Also check the detailed guidance for Language and Text direction.

RDF Datasets may contain natural language text. The methods for encoding such text are described in existing RDF specs. See the [Turtle specification on RDF Literals](https://www.w3.org/TR/2014/REC-turtle-20140225/#literals} for example. The RDF Canonicalization spec accepts any serialization of RDF but does not create a new one. The majority of the c14n spec is concerned with the algorithm for labeling blank nodes and ordering the resulting quads to create a canonical form. At the time of writing, the WG has not settled on whether such a canonical form has a canonical serialization (although it seems likely that it will). In other words, the current working assumption is that i18n issues are covered by the existing RDF standards and that no new issues are created by canonicalization. It is on this point that we would be very grateful for feedback.

Note that JSON-LD instituted a pattern for defining text-direction along with language using a datatype (see The i18n Namespace). In principle, this can be done in any serialization, and IMHO, should be a work item for the RDF-star WG. Although, it may have some semantics implications. But, we may want to mention that as part of the Internationalization checklist.

If the spec (or its implementation) sorts text ensure that it does so in locally relevant ways. Also check the detailed guidance for Text-processing.
Comments go here.

We believe the existing RDF specs offer sufficient clarity in this regard but, again, we'd be grateful for any feedback on this. The RDF Dataset c14n algorithm does include a step where all quads are arranged in lexical order.

We do sort text as part of the algorithm function, and added specific text on using Unicode code point order for doing that. See Unicode code point order in Canonicalization Algorithm Terms.

Addison Phillips · Answer 2 · Tue Jan 24 2023 08:28:57 GMT+0800 (China Standard Time)

Thanks @philarcher and @gkellogg. I have put this self-review on I18N's agenda for this week and have reviewed the above briefly.

We do sort text as part of the algorithm function

I recall our conversing about this in the previous issues and you should be fine with code point order.

Addison Phillips · Answer 3 · Sun Jan 29 2023 01:50:39 GMT+0800 (China Standard Time)

Reviewing the above--thank you for the self-review--a few comments.

We believe the existing RDF specs offer sufficient clarity in this regard but, again, we'd be grateful for any feedback on this. The RDF Dataset c14n algorithm does include a step where all quads are arranged in lexical order.

In looking at the current WD, I see that (as noted by @gkellogg) you define and use Unicode code point order throughout. This is the right thing to do and you look to be in good shape here.

If the spec (or its implementation) allows searching or matching of text, including syntax and identifiers understand the implications of normalisation, case folding, etc. Also check the detailed guidance for Text-processing.

Regarding matching/searching/find processing, I don't see anywhere that you are defining text searching/find operations or textual regular expressions, so I think that section doesn't apply to your spec.

We believe this is covered by the base RDF standards and won't be affected by c14n. (comment in regard to localizability)

I agree. This is N/A for your spec.

In general, a quick perusal of the WD didn't turn up any additional issues. As always, if you have questions or need assistance, please call out to I18N!