Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GCB Prepend codepoints improperly break when before NFG_QC=1 codepoints #6126

Closed
p6rt opened this issue Mar 5, 2017 · 5 comments
Closed
Labels

Comments

@p6rt
Copy link

p6rt commented Mar 5, 2017

Migrated from rt.perl.org#130922 (status was 'resolved')

Searchable as RT130922$

@p6rt
Copy link
Author

p6rt commented Mar 5, 2017

From zefram@fysh.org

"\x[111c2,111c4]".chars
2

The above is erroneous​: these two codepoints make up a single extended
grapheme cluster. U+111c2 "Sharada sign jihvamuliya" has the property
Indic_Syllabic_Category=Consonant_Prefixed, which per UAX #​29 makes it a
Grapheme_Cluster_Break=Prepend, which means that it falls under rule GB9b,
so there should not be an extended grapheme cluster break following it
(except when what follows is a control character or end of text).

Strangely, the rule about Prepend characters doesn't seem to be being
entirely ignored. It is taking effect if the following character is
also a Prepend​:

"\x[111c2,111c3]".chars
1

-zefram

@p6rt
Copy link
Author

p6rt commented Mar 20, 2017

From @samcv

We did not support Prepend until a month or two ago when when I added it. Currently it only works when it comes before a character that has property `NFG_QC = 0` (not an official Unicode property).

The plan is to add support for saving state when putting codepoints into graphemes and improving some other things.

@p6rt
Copy link
Author

p6rt commented Mar 20, 2017

The RT System itself - Status changed from 'new' to 'open'

@p6rt
Copy link
Author

p6rt commented Jul 7, 2017

From @samcv

This has been fixed as of MoarVM/MoarVM@78b60512

in MoarVM. Yay.

@p6rt
Copy link
Author

p6rt commented Jul 7, 2017

@samcv - Status changed from 'open' to 'resolved'

@p6rt p6rt closed this as completed Jul 7, 2017
@p6rt p6rt added the uni label Jan 5, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant