Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

qx[] dies when multibyte UTF-8 characters split across 2048-byte chunk boundary in Rakudo #3120

Closed
p6rt opened this issue May 2, 2013 · 6 comments
Labels

Comments

@p6rt
Copy link

p6rt commented May 2, 2013

Migrated from rt.perl.org#117841 (status was 'resolved')

Searchable as RT117841$

@p6rt
Copy link
Author

p6rt commented May 2, 2013

From @masak

<masak> could someone try and reproduce this?
https://gist.github.com/masak/5505540 -- I thought we had gotten rid
of those problems, but it seems not :/
<masak> hm, I can create a Perl script that just recreates q from hex numbers...
<FROGGS> .u
<yoleaux> U+2014 EM DASH [Pd] ( )
<FROGGS> this might cause it
<FROGGS> last text line col 89
<flussence> ...*sigh*
<flussence> that char is RIGHT ON a 1024-byte boundary
<FROGGS> masak​: shortest example evar!!​: perl6 -e 'qx[perl6 -e "say 1
x 2047,q| |" | cat]'
<masak> FROGGS++
* masak submits rakudobug
<FROGGS> looks like chunksize is 2048
<FROGGS> since 1023 and 2048 is fine
<masak> in short, Parrot blows chunks. and it's not pretty.

@p6rt
Copy link
Author

p6rt commented May 2, 2013

From @pmichaud

On Thu, May 02, 2013 at 02​:48​:46PM -0700, Carl Mäsak wrote​:

<FROGGS> masak​: shortest example evar!!​: perl6 -e 'qx[perl6 -e "say 1
x 2047,q|—|" | cat]'
<masak> FROGGS++
* masak submits rakudobug

I'm not able to reproduce this bug on my system. Can we get more
details, such as Rakudo version, Parrot version, and ICU version?

Thanks,

Pm

@p6rt
Copy link
Author

p6rt commented May 2, 2013

The RT System itself - Status changed from 'new' to 'open'

@p6rt
Copy link
Author

p6rt commented May 3, 2013

From @FROGGS

$ perl6 -V | grep ver
parrot​::flex_version=
parrot​::gccversion=4.7
parrot​::gmake_version=3.81
parrot​::msvcversion=
parrot​::osvers=3.5.0-27-generic
parrot​::versiondir=/parrot/5.2.0-devel
perl6​::version=2013.03-138-g639d7f8

$ icuinfo
<ICUINFO>
International Components for Unicode for C/C++
Copyright (C) 2011, International Business Machines Corporation and
others. All Rights Reserved.
Compiled-Version​: 4.8.1.1
Runtime-Version​: 4.8.1.1
Compiled-Unicode-Version​: 6.0
Runtime-Unicode-Version​: 6.0
Platform​: U_LINUX
Build​: x86_64-pc-linux-gnu
C compiler​: gcc
C++ compiler​: g++
ICUDATA​: icudt48l
Data Directory​:
ICU Initialization returned​: U_ZERO_ERROR
Default locale​: de_DE
CLDR-Version​: 2.0.1
Default converter​: UTF-8
Default TZ​: Europe/Berlin
TZ data version​: 2011k
Plugin file is​: /usr/lib/x86_64-linux-gnu/icu/icuplugins48.txt
</ICUINFO>

@p6rt
Copy link
Author

p6rt commented Aug 30, 2013

From @FROGGS

Fixed upstream​:
parrot/parrot@c823d3e
parrot/parrot@28c83e3

Tests​: Raku/roast@36108480b8

@p6rt
Copy link
Author

p6rt commented Nov 7, 2013

@FROGGS - Status changed from 'open' to 'resolved'

@p6rt p6rt closed this as completed Nov 7, 2013
@p6rt p6rt added the Bug label Jan 5, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant