Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some Graphemes throw 'Malformed UTF-8 string' #927

Closed
p6rt opened this issue Apr 21, 2009 · 5 comments
Closed

Some Graphemes throw 'Malformed UTF-8 string' #927

p6rt opened this issue Apr 21, 2009 · 5 comments

Comments

@p6rt
Copy link

p6rt commented Apr 21, 2009

Migrated from rt.perl.org#64918 (status was 'resolved')

Searchable as RT64918$

@p6rt
Copy link
Author

p6rt commented Apr 21, 2009

From @wollmers

Hi,

rakudo$ ./perl6 -e '"\c[LATIN SMALL LETTER A WITH DIAERESIS,COMBINING CEDILLA]";'
Malformed UTF-8 string

current instr.​: 'parrot;PAST;Compiler;escape' pc 9067 (src/POST/Node.pir​:90)
called from Sub 'parrot;PAST;Compiler;escape' pc 1731 (src/PAST/Compiler.pir​:188)
called from Sub 'parrot;PAST;Compiler;as_post' pc 8758 (src/PAST/Compiler.pir​:2313)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 (src/PAST/Compiler.pir​:415)
called from Sub 'parrot;PAST;Compiler;as_post' pc 2600 (src/PAST/Compiler.pir​:602)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 (src/PAST/Compiler.pir​:415)
called from Sub 'parrot;PAST;Compiler;as_post' pc 3633 (src/PAST/Compiler.pir​:866)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 (src/PAST/Compiler.pir​:415)
called from Sub 'parrot;PAST;Compiler;pirop' pc 4256 (src/PAST/Compiler.pir​:1044)
called from Sub 'parrot;PAST;Compiler;post_children' pc 2185 (src/PAST/Compiler.pir​:415)
called from Sub 'parrot;PAST;Compiler;as_post' pc 3633 (src/PAST/Compiler.pir​:866)
called from Sub 'parrot;PCT;HLLCompiler;compile' pc 428 (src/PCT/HLLCompiler.pir​:301)
called from Sub 'parrot;PCT;HLLCompiler;eval' pc 920 (src/PCT/HLLCompiler.pir​:519)
called from Sub 'parrot;PCT;HLLCompiler;command_line' pc 1510 (src/PCT/HLLCompiler.pir​:798)
called from Sub 'parrot;Perl6;Compiler;main' pc 23985 (perl6.pir​:164)

This works​:
rakudo$ ./perl6 -e '"\c[LATIN CAPITAL LETTER D,COMBINING DOT BELOW,COMBINING DOT ABOVE,COMBINING HORN]";'

My versions​:
rakudo$ ./perl6 -v
This is Rakudo Perl 6, revision 37980 built on parrot 1.0.0-devel
for i486-linux-gnu-thread-multi

rakudo$ icu-config --unicode-version
5.1

In comparison the following Perl 8.10 script does not croak​:

use strict;
use warnings;

use charnames qw(​:full);

my $s= "\N{LATIN SMALL LETTER A WITH DIAERESIS}\N{COMBINING CEDILLA}";

use Encode;
decode("utf8",encode("utf8",$s,1),1);

Helmut Wollmersdorfer

@p6rt
Copy link
Author

p6rt commented Apr 23, 2009

From @moritz

On Tue Apr 21 03​:23​:10 2009, helmut@​wollmersdorfer.at wrote​:

Hi,

rakudo$ ./perl6 -e '"\c[LATIN SMALL LETTER A WITH DIAERESIS,COMBINING
CEDILLA]";'
Malformed UTF-8 string

Added as a test to t/spec/S02-literals/char-by-name.t - thanks for the
report.

Cheers,
Moritz

@p6rt
Copy link
Author

p6rt commented Apr 23, 2009

The RT System itself - Status changed from 'new' to 'open'

@p6rt
Copy link
Author

p6rt commented Nov 16, 2009

From @kyleha

On Thu Apr 23 09​:34​:21 2009, moritz wrote​:

Added as a test to t/spec/S02-literals/char-by-name.t - thanks for the
report.

The test passes now, so I'm marking this resolved.

@p6rt
Copy link
Author

p6rt commented Nov 16, 2009

@kyleha - Status changed from 'open' to 'resolved'

@p6rt p6rt closed this as completed Nov 16, 2009
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant