Philip.Hazel
4a7dfab0ec
Unicode upper/lower casing is now used when UCP is set, even if UTF is not set.
...
This is not yet documented, and it not yet implemented in JIT.
2020-02-23 16:40:05 +00:00
Philip.Hazel
e413f3147c
Optimize certain starting code unit bit maps into a single starting code unit.
2019-09-13 17:02:06 +00:00
Philip.Hazel
bf15267c30
Optimize classes such as [Aa] to be a single caseless character.
2019-09-09 17:00:19 +00:00
Philip.Hazel
1ebc2c50cc
Another extension to minimum length calculation.
2019-06-17 16:26:44 +00:00
Philip.Hazel
16c046ce50
Implement support for invalid UTF in the pcre2_match() interpreter.
2019-05-24 17:15:48 +00:00
Philip.Hazel
d90de8b053
Previout bug-fix was bad. This properly fixes an overrun while reading a
...
Unicode group name.
2019-02-07 17:59:37 +00:00
Philip.Hazel
866750fd53
Basic "script run" implementation. Not yet complete, and not yet documented.
2018-10-02 15:25:58 +00:00
Philip.Hazel
a69267246f
Implement callouts from pcre2_substitute().
2018-09-18 16:31:30 +00:00
Philip.Hazel
814cc96bc5
Save extra compile options with the compiled pattern, and add an info call to
...
retrieve them.
2017-12-16 17:49:26 +00:00
Philip.Hazel
b7d5cee61f
Allow anchored patterns to use "first code unit" optimization.
2017-06-30 16:00:33 +00:00
Philip.Hazel
dfc9712bcd
Add additional compile options and PCRE2_EXTRA_ALLOW_SURROGATE_ESCAPES.
2017-05-17 17:55:11 +00:00
Zoltán Herczeg
bdac9df4cf
Fix character type detection when 32-bit and UCP are enabled but UTF is not in JIT.
2017-03-31 05:40:37 +00:00
Philip.Hazel
53bf29d689
Fix 32-bit non-UTF property test crash.
2017-02-24 18:25:32 +00:00
Philip.Hazel
8933d999d8
Fix class bug when UCP but not UTF was set and all wide characters need to be
...
included.
2016-12-26 17:11:18 +00:00
Philip.Hazel
dda1e79060
Fix bug that caused chars > 255 not to be matched by classes like [\W\pL] when
...
PCRE2_UCP was not set.
2016-08-03 17:22:59 +00:00
Philip.Hazel
69c9d81e43
Update pcre2test with the /utf8_input option, for generating wide characters in
...
non-UTF 16-bit and 32-bit modes.
2016-08-03 09:01:02 +00:00
Philip.Hazel
3263d44b97
Implement --never-backslash-C
2015-10-17 13:50:56 +00:00
Philip.Hazel
be7366f6f0
Tidied up a lot of comments in the test files.
2015-09-16 16:49:48 +00:00
Philip.Hazel
ee41aa906f
Runtime UTF checks now take not of the starting offset.
2015-08-18 10:34:05 +00:00
Philip.Hazel
c9ac9e23ec
Refactor named group handling by adding a pre-pass that generates a list of
...
named groups with their numbers before the rest of the compiling code is run.
This has simplified the main compiling code and removed some sources of error.
2015-06-03 16:27:47 +00:00
Philip.Hazel
91f2e97474
More documentation and test updates.
2014-11-23 18:38:38 +00:00
Philip.Hazel
fd555f266c
Test tidies and additional tests.
2014-08-06 09:35:11 +00:00
Philip.Hazel
e022475d54
All tests except JIT and save/reload are implemented.
2014-08-05 16:51:32 +00:00