New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some parts of regex engine impose I32 limit on code points #12467
Comments
From @khwilliamsonThis is a bug report for perl from khw@karl.(none), Some parts of the regex engine and documentation say that it can handle Flags: Site configuration information for perl 5.17.5: Configured by khw at Sun Oct 7 09:13:22 MDT 2012. Summary of my perl5 (revision 5 version 17 subversion 5) configuration: Locally applied patches: @INC for perl 5.17.5: /home/khw/blead/lib/perl5/site_perl/5.17.5/i686-linux-thread-multi-64int-ld Environment for perl 5.17.5: PATH=/home/khw/bin:/home/khw/print/bin:/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/usr/games:/home/khw/cxoffice/bin |
From @khwilliamsonOn 10/07/2012 03:13 PM, karl williamson (via RT) wrote:
I'm currently thinking that it was/is a mistake to allow anything larger It turns out that some code uses negative code points as markers, and I therefore propose that we break potential backwards compatibility by |
From @TuxOn Sun, 07 Oct 2012 15:35:46 -0600, Karl Williamson
+1 -- |
The RT System itself - Status changed from 'new' to 'open' |
From @khwilliamsonOn 10/08/2012 12:10 AM, H. Merijn Brand via RT wrote:
I started to implement this, and ran into two problems. One is that The other is that complementing a scalar which contains a UTF-8 string "When complementing strings, if all characters have ordinal values under I think, if we were to go ahead and restrict to IV_MAX, that we would But if we don't restrict to IV_MAX, we have issues in that some places for (my $code_point = high_number; Thus there is a dilemma that I don't see a good answer to. |
From @cpansproutOn Sun Oct 07 14:36:18 2012, public@khwilliamson.com wrote:
What I say may not be of any consequence, but I do see uses for code -- Father Chrysostomos |
From @khwilliamsonOn 07/26/2013 07:09 PM, Father Chrysostomos via RT wrote:
Since, I've found that tr/// imposes an undocumented IV limit on |
From @cpansproutOn Sep 11, 2015, at 12:06 PM, Karl Williamson <public@khwilliamson.com> wrote:
It has been a while since I said that, and now I do not remember clearly. It may be that I was thinking of using strings for sequences of arbitrary 32-bit integers, just as I currently use them for 16-bit integers. That may be too vague to deserve consideration. If you are right that chars over IV_MAX are currently not handled correctly, then it is probably OK to do as you suggest. But it may require a deprecation cycle to weed out code that puts high codepoints in strings. While such code won’t be common, it is likely to occur in tests; and breaking people’s tests without a deprecation is not nice. |
From @khwilliamsonCommit 760c7c2 deprecates code points above IV_MAX |
From @khwilliamsonI'm closing this, as the usage is now deprecated |
@khwilliamson - Status changed from 'open' to 'resolved' |
Migrated from rt.perl.org#115166 (status was 'resolved')
Searchable as RT115166$
The text was updated successfully, but these errors were encountered: