Fix some documentation typos.
This commit is contained in:
parent
a5389db88d
commit
d70da76dfb
|
@ -2643,10 +2643,10 @@ lookbehind. For example, consider the pattern
|
|||
</pre>
|
||||
which finds occurrences of "iss" in the middle of words. (\B matches only if
|
||||
the current position in the subject is not a word boundary.) When applied to
|
||||
the string "Mississipi" the first call to <b>pcre2_match()</b> finds the first
|
||||
the string "Mississippi" the first call to <b>pcre2_match()</b> finds the first
|
||||
occurrence. If <b>pcre2_match()</b> is called again with just the remainder of
|
||||
the subject, namely "issipi", it does not match, because \B is always false at
|
||||
the start of the subject, which is deemed to be a word boundary. However, if
|
||||
the subject, namely "issippi", it does not match, because \B is always false
|
||||
at the start of the subject, which is deemed to be a word boundary. However, if
|
||||
<b>pcre2_match()</b> is passed the entire string again, but with
|
||||
<i>startoffset</i> set to 4, it finds the second occurrence of "iss" because it
|
||||
is able to look behind the starting point to discover that it is preceded by a
|
||||
|
|
|
@ -215,8 +215,8 @@ if (rc < 0)
|
|||
return 1;
|
||||
}
|
||||
|
||||
/* Match succeded. Get a pointer to the output vector, where string offsets are
|
||||
stored. */
|
||||
/* Match succeeded. Get a pointer to the output vector, where string offsets
|
||||
are stored. */
|
||||
|
||||
ovector = pcre2_get_ovector_pointer(match_data);
|
||||
printf("Match succeeded at offset %d\n", (int)ovector[0]);
|
||||
|
@ -453,7 +453,7 @@ for (;;)
|
|||
return 1;
|
||||
}
|
||||
|
||||
/* Match succeded */
|
||||
/* Match succeeded */
|
||||
|
||||
printf("\nMatch succeeded again at offset %d\n", (int)ovector[0]);
|
||||
|
||||
|
|
|
@ -54,6 +54,7 @@ platforms:
|
|||
<pre>
|
||||
ARM 32-bit (v5, v7, and Thumb2)
|
||||
ARM 64-bit
|
||||
IBM s390x 64 bit
|
||||
Intel x86 32-bit and 64-bit
|
||||
MIPS 32-bit and 64-bit
|
||||
Power PC 32-bit and 64-bit
|
||||
|
@ -286,7 +287,7 @@ inefficient solution, and not recommended.
|
|||
This is a suggestion for how a multithreaded program that needs to set up
|
||||
non-default JIT stacks might operate:
|
||||
<pre>
|
||||
During thread initalization
|
||||
During thread initialization
|
||||
thread_local_var = pcre2_jit_stack_create(...)
|
||||
|
||||
During thread exit
|
||||
|
|
|
@ -745,7 +745,7 @@ Unicode support is not needed for these characters to be recognized.
|
|||
<P>
|
||||
It is possible to restrict \R to match only CR, LF, or CRLF (instead of the
|
||||
complete set of Unicode line endings) by setting the option PCRE2_BSR_ANYCRLF
|
||||
at compile time. (BSR is an abbrevation for "backslash R".) This can be made
|
||||
at compile time. (BSR is an abbreviation for "backslash R".) This can be made
|
||||
the default when PCRE2 is built; if this is the case, the other behaviour can
|
||||
be requested via the PCRE2_BSR_UNICODE option. It is also possible to specify
|
||||
these settings by starting a pattern string with one of the following
|
||||
|
@ -1090,7 +1090,7 @@ additional characters according to the following rules for ending a cluster:
|
|||
3. Do not break Hangul (a Korean script) syllable sequences. Hangul characters
|
||||
are of five types: L, V, T, LV, and LVT. An L character may be followed by an
|
||||
L, V, LV, or LVT character; an LV or V character may be followed by a V or T
|
||||
character; an LVT or T character may be follwed only by a T character.
|
||||
character; an LVT or T character may be followed only by a T character.
|
||||
</P>
|
||||
<P>
|
||||
4. Do not end before extending characters or spacing marks or the "zero-width
|
||||
|
@ -3607,7 +3607,7 @@ successful match if there is a later mismatch. Consider:
|
|||
</pre>
|
||||
If the subject is "aaaac...", after the first match attempt fails (starting at
|
||||
the first character in the string), the starting point skips on to start the
|
||||
next attempt at "c". Note that a possessive quantifer does not have the same
|
||||
next attempt at "c". Note that a possessive quantifier does not have the same
|
||||
effect as this example; although it would suppress backtracking during the
|
||||
first match attempt, the second attempt would start at the second character
|
||||
instead of skipping on to "c".
|
||||
|
|
|
@ -185,8 +185,8 @@ REVISION
|
|||
Last updated: 28 April 2021
|
||||
Copyright (c) 1997-2021 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2API(3) Library Functions Manual PCRE2API(3)
|
||||
|
||||
|
||||
|
@ -2583,14 +2583,14 @@ MATCHING A PATTERN: THE TRADITIONAL FUNCTION
|
|||
|
||||
which finds occurrences of "iss" in the middle of words. (\B matches
|
||||
only if the current position in the subject is not a word boundary.)
|
||||
When applied to the string "Mississipi" the first call to pcre2_match()
|
||||
finds the first occurrence. If pcre2_match() is called again with just
|
||||
the remainder of the subject, namely "issipi", it does not match, be-
|
||||
cause \B is always false at the start of the subject, which is deemed
|
||||
to be a word boundary. However, if pcre2_match() is passed the entire
|
||||
string again, but with startoffset set to 4, it finds the second occur-
|
||||
rence of "iss" because it is able to look behind the starting point to
|
||||
discover that it is preceded by a letter.
|
||||
When applied to the string "Mississippi" the first call to
|
||||
pcre2_match() finds the first occurrence. If pcre2_match() is called
|
||||
again with just the remainder of the subject, namely "issippi", it does
|
||||
not match, because \B is always false at the start of the subject,
|
||||
which is deemed to be a word boundary. However, if pcre2_match() is
|
||||
passed the entire string again, but with startoffset set to 4, it finds
|
||||
the second occurrence of "iss" because it is able to look behind the
|
||||
starting point to discover that it is preceded by a letter.
|
||||
|
||||
Finding all the matches in a subject is tricky when the pattern can
|
||||
match an empty string. It is possible to emulate Perl's /g behaviour by
|
||||
|
@ -3834,8 +3834,8 @@ REVISION
|
|||
Last updated: 04 November 2020
|
||||
Copyright (c) 1997-2020 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2BUILD(3) Library Functions Manual PCRE2BUILD(3)
|
||||
|
||||
|
||||
|
@ -4428,8 +4428,8 @@ REVISION
|
|||
Last updated: 20 March 2020
|
||||
Copyright (c) 1997-2020 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2CALLOUT(3) Library Functions Manual PCRE2CALLOUT(3)
|
||||
|
||||
|
||||
|
@ -4858,8 +4858,8 @@ REVISION
|
|||
Last updated: 03 February 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2COMPAT(3) Library Functions Manual PCRE2COMPAT(3)
|
||||
|
||||
|
||||
|
@ -5071,8 +5071,8 @@ REVISION
|
|||
Last updated: 06 October 2020
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2JIT(3) Library Functions Manual PCRE2JIT(3)
|
||||
|
||||
|
||||
|
@ -5107,6 +5107,7 @@ AVAILABILITY OF JIT SUPPORT
|
|||
|
||||
ARM 32-bit (v5, v7, and Thumb2)
|
||||
ARM 64-bit
|
||||
IBM s390x 64 bit
|
||||
Intel x86 32-bit and 64-bit
|
||||
MIPS 32-bit and 64-bit
|
||||
Power PC 32-bit and 64-bit
|
||||
|
@ -5326,7 +5327,7 @@ CONTROLLING THE JIT STACK
|
|||
This is a suggestion for how a multithreaded program that needs to set
|
||||
up non-default JIT stacks might operate:
|
||||
|
||||
During thread initalization
|
||||
During thread initialization
|
||||
thread_local_var = pcre2_jit_stack_create(...)
|
||||
|
||||
During thread exit
|
||||
|
@ -5496,8 +5497,8 @@ REVISION
|
|||
Last updated: 23 May 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2LIMITS(3) Library Functions Manual PCRE2LIMITS(3)
|
||||
|
||||
|
||||
|
@ -5566,8 +5567,8 @@ REVISION
|
|||
Last updated: 02 February 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2MATCHING(3) Library Functions Manual PCRE2MATCHING(3)
|
||||
|
||||
|
||||
|
@ -5790,8 +5791,8 @@ REVISION
|
|||
Last updated: 23 May 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2PARTIAL(3) Library Functions Manual PCRE2PARTIAL(3)
|
||||
|
||||
|
||||
|
@ -6170,8 +6171,8 @@ REVISION
|
|||
Last updated: 04 September 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2PATTERN(3) Library Functions Manual PCRE2PATTERN(3)
|
||||
|
||||
|
||||
|
@ -6816,7 +6817,7 @@ BACKSLASH
|
|||
|
||||
It is possible to restrict \R to match only CR, LF, or CRLF (instead of
|
||||
the complete set of Unicode line endings) by setting the option
|
||||
PCRE2_BSR_ANYCRLF at compile time. (BSR is an abbrevation for "back-
|
||||
PCRE2_BSR_ANYCRLF at compile time. (BSR is an abbreviation for "back-
|
||||
slash R".) This can be made the default when PCRE2 is built; if this is
|
||||
the case, the other behaviour can be requested via the PCRE2_BSR_UNI-
|
||||
CODE option. It is also possible to specify these settings by starting
|
||||
|
@ -7016,8 +7017,8 @@ BACKSLASH
|
|||
3. Do not break Hangul (a Korean script) syllable sequences. Hangul
|
||||
characters are of five types: L, V, T, LV, and LVT. An L character may
|
||||
be followed by an L, V, LV, or LVT character; an LV or V character may
|
||||
be followed by a V or T character; an LVT or T character may be follwed
|
||||
only by a T character.
|
||||
be followed by a V or T character; an LVT or T character may be fol-
|
||||
lowed only by a T character.
|
||||
|
||||
4. Do not end before extending characters or spacing marks or the
|
||||
"zero-width joiner" character. Characters with the "mark" property al-
|
||||
|
@ -9388,7 +9389,7 @@ BACKTRACKING CONTROL
|
|||
If the subject is "aaaac...", after the first match attempt fails
|
||||
(starting at the first character in the string), the starting point
|
||||
skips on to start the next attempt at "c". Note that a possessive quan-
|
||||
tifer does not have the same effect as this example; although it would
|
||||
tifier does not have the same effect as this example; although it would
|
||||
suppress backtracking during the first match attempt, the second at-
|
||||
tempt would start at the second character instead of skipping on to
|
||||
"c".
|
||||
|
@ -9618,8 +9619,8 @@ REVISION
|
|||
Last updated: 06 October 2020
|
||||
Copyright (c) 1997-2020 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2PERFORM(3) Library Functions Manual PCRE2PERFORM(3)
|
||||
|
||||
|
||||
|
@ -9853,8 +9854,8 @@ REVISION
|
|||
Last updated: 03 February 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2POSIX(3) Library Functions Manual PCRE2POSIX(3)
|
||||
|
||||
|
||||
|
@ -10187,8 +10188,8 @@ REVISION
|
|||
Last updated: 26 April 2021
|
||||
Copyright (c) 1997-2021 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2SAMPLE(3) Library Functions Manual PCRE2SAMPLE(3)
|
||||
|
||||
|
||||
|
@ -10466,8 +10467,8 @@ REVISION
|
|||
Last updated: 27 June 2018
|
||||
Copyright (c) 1997-2018 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2SYNTAX(3) Library Functions Manual PCRE2SYNTAX(3)
|
||||
|
||||
|
||||
|
@ -10982,8 +10983,8 @@ REVISION
|
|||
Last updated: 28 December 2019
|
||||
Copyright (c) 1997-2019 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
PCRE2UNICODE(3) Library Functions Manual PCRE2UNICODE(3)
|
||||
|
||||
|
||||
|
@ -11417,5 +11418,5 @@ REVISION
|
|||
Last updated: 23 February 2020
|
||||
Copyright (c) 1997-2020 University of Cambridge.
|
||||
------------------------------------------------------------------------------
|
||||
|
||||
|
||||
|
||||
|
||||
|
|
|
@ -2628,10 +2628,10 @@ lookbehind. For example, consider the pattern
|
|||
.sp
|
||||
which finds occurrences of "iss" in the middle of words. (\eB matches only if
|
||||
the current position in the subject is not a word boundary.) When applied to
|
||||
the string "Mississipi" the first call to \fBpcre2_match()\fP finds the first
|
||||
the string "Mississippi" the first call to \fBpcre2_match()\fP finds the first
|
||||
occurrence. If \fBpcre2_match()\fP is called again with just the remainder of
|
||||
the subject, namely "issipi", it does not match, because \eB is always false at
|
||||
the start of the subject, which is deemed to be a word boundary. However, if
|
||||
the subject, namely "issippi", it does not match, because \eB is always false
|
||||
at the start of the subject, which is deemed to be a word boundary. However, if
|
||||
\fBpcre2_match()\fP is passed the entire string again, but with
|
||||
\fIstartoffset\fP set to 4, it finds the second occurrence of "iss" because it
|
||||
is able to look behind the starting point to discover that it is preceded by a
|
||||
|
|
|
@ -215,8 +215,8 @@ if (rc < 0)
|
|||
return 1;
|
||||
}
|
||||
|
||||
/* Match succeded. Get a pointer to the output vector, where string offsets are
|
||||
stored. */
|
||||
/* Match succeeded. Get a pointer to the output vector, where string offsets
|
||||
are stored. */
|
||||
|
||||
ovector = pcre2_get_ovector_pointer(match_data);
|
||||
printf("Match succeeded at offset %d\en", (int)ovector[0]);
|
||||
|
@ -453,7 +453,7 @@ for (;;)
|
|||
return 1;
|
||||
}
|
||||
|
||||
/* Match succeded */
|
||||
/* Match succeeded */
|
||||
|
||||
printf("\enMatch succeeded again at offset %d\en", (int)ovector[0]);
|
||||
|
||||
|
|
|
@ -267,7 +267,7 @@ inefficient solution, and not recommended.
|
|||
This is a suggestion for how a multithreaded program that needs to set up
|
||||
non-default JIT stacks might operate:
|
||||
.sp
|
||||
During thread initalization
|
||||
During thread initialization
|
||||
thread_local_var = pcre2_jit_stack_create(...)
|
||||
.sp
|
||||
During thread exit
|
||||
|
|
|
@ -741,7 +741,7 @@ Unicode support is not needed for these characters to be recognized.
|
|||
.P
|
||||
It is possible to restrict \eR to match only CR, LF, or CRLF (instead of the
|
||||
complete set of Unicode line endings) by setting the option PCRE2_BSR_ANYCRLF
|
||||
at compile time. (BSR is an abbrevation for "backslash R".) This can be made
|
||||
at compile time. (BSR is an abbreviation for "backslash R".) This can be made
|
||||
the default when PCRE2 is built; if this is the case, the other behaviour can
|
||||
be requested via the PCRE2_BSR_UNICODE option. It is also possible to specify
|
||||
these settings by starting a pattern string with one of the following
|
||||
|
@ -1082,7 +1082,7 @@ additional characters according to the following rules for ending a cluster:
|
|||
3. Do not break Hangul (a Korean script) syllable sequences. Hangul characters
|
||||
are of five types: L, V, T, LV, and LVT. An L character may be followed by an
|
||||
L, V, LV, or LVT character; an LV or V character may be followed by a V or T
|
||||
character; an LVT or T character may be follwed only by a T character.
|
||||
character; an LVT or T character may be followed only by a T character.
|
||||
.P
|
||||
4. Do not end before extending characters or spacing marks or the "zero-width
|
||||
joiner" character. Characters with the "mark" property always have the
|
||||
|
@ -3658,7 +3658,7 @@ successful match if there is a later mismatch. Consider:
|
|||
.sp
|
||||
If the subject is "aaaac...", after the first match attempt fails (starting at
|
||||
the first character in the string), the starting point skips on to start the
|
||||
next attempt at "c". Note that a possessive quantifer does not have the same
|
||||
next attempt at "c". Note that a possessive quantifier does not have the same
|
||||
effect as this example; although it would suppress backtracking during the
|
||||
first match attempt, the second attempt would start at the second character
|
||||
instead of skipping on to "c".
|
||||
|
|
|
@ -198,8 +198,8 @@ if (rc < 0)
|
|||
return 1;
|
||||
}
|
||||
|
||||
/* Match succeded. Get a pointer to the output vector, where string offsets are
|
||||
stored. */
|
||||
/* Match succeeded. Get a pointer to the output vector, where string offsets
|
||||
are stored. */
|
||||
|
||||
ovector = pcre2_get_ovector_pointer(match_data);
|
||||
printf("Match succeeded at offset %d\n", (int)ovector[0]);
|
||||
|
@ -436,7 +436,7 @@ for (;;)
|
|||
return 1;
|
||||
}
|
||||
|
||||
/* Match succeded */
|
||||
/* Match succeeded */
|
||||
|
||||
printf("\nMatch succeeded again at offset %d\n", (int)ovector[0]);
|
||||
|
||||
|
|
Loading…
Reference in New Issue