Initial commit of quickbook generated docs.

[SVN r37943]
This commit is contained in:
John Maddock
2007-06-08 09:23:23 +00:00
parent 5f96b68080
commit 7e4d1b570c
82 changed files with 24467 additions and 0 deletions

File diff suppressed because it is too large Load Diff

View File

@ -0,0 +1,657 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>POSIX Basic Regular Expression Syntax</title>
<link rel="stylesheet" href="../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../index.html" title="Boost.Regex">
<link rel="up" href="../syntax.html" title="Regular Expression Syntax">
<link rel="prev" href="basic_extended.html" title="POSIX Extended Regular Expression Syntax">
<link rel="next" href="character_classes.html" title="Character Class Names">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="basic_extended.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="character_classes.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h3 class="title">
<a name="boost_regex.syntax.basic_syntax"></a><a href="basic_syntax.html" title="POSIX Basic Regular Expression Syntax"> POSIX Basic Regular
Expression Syntax</a></h3></div></div></div>
<a name="boost_regex.syntax.basic_syntax.synopsis"></a><h4>
<a name="id518494"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.synopsis">Synopsis</a>
</h4>
<p>
The POSIX-Basic regular expression syntax is used by the Unix utility <code class="computeroutput"><span class="identifier">sed</span></code>, and variations are used by <code class="computeroutput"><span class="identifier">grep</span></code> and <code class="computeroutput"><span class="identifier">emacs</span></code>.
You can construct POSIX basic regular expressions in Boost.Regex by passing
the flag <code class="computeroutput"><span class="identifier">basic</span></code> to the regex
constructor (see <a href="../ref/syntax_option_type.html" title="syntax_option_type"><code class="computeroutput"><span class="identifier">syntax_option_type</span></code></a>), for example:
</p>
<pre class="programlisting">
<span class="comment">// e1 is a case sensitive POSIX-Basic expression:
</span><span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span> <span class="identifier">e1</span><span class="special">(</span><span class="identifier">my_expression</span><span class="special">,</span> <span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span><span class="special">::</span><span class="identifier">basic</span><span class="special">);</span>
<span class="comment">// e2 a case insensitive POSIX-Basic expression:
</span><span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span> <span class="identifier">e2</span><span class="special">(</span><span class="identifier">my_expression</span><span class="special">,</span> <span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span><span class="special">::</span><span class="identifier">basic</span><span class="special">|</span><span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span><span class="special">::</span><span class="identifier">icase</span><span class="special">);</span>
</pre>
<a name="boost_regex.posix_basic"></a><p>
</p>
<a name="boost_regex.syntax.basic_syntax.posix_basic_syntax"></a><h4>
<a name="id518788"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.posix_basic_syntax">POSIX
Basic Syntax</a>
</h4>
<p>
In POSIX-Basic regular expressions, all characters are match themselves except
for the following special characters:
</p>
<pre class="programlisting">.[\*^$</pre>
<a name="boost_regex.syntax.basic_syntax.wildcard_"></a><h5>
<a name="id518829"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.wildcard_">Wildcard:</a>
</h5>
<p>
The single character '.' when used outside of a character set will match
any single character except:
</p>
<div class="itemizedlist"><ul type="disc">
<li>
The NULL character when the flag <code class="computeroutput"><span class="identifier">match_no_dot_null</span></code>
is passed to the matching algorithms.
</li>
<li>
The newline character when the flag <code class="computeroutput"><span class="identifier">match_not_dot_newline</span></code>
is passed to the matching algorithms.
</li>
</ul></div>
<a name="boost_regex.syntax.basic_syntax.anchors_"></a><h5>
<a name="id518899"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.anchors_">Anchors:</a>
</h5>
<p>
A '^' character shall match the start of a line when used as the first character
of an expression, or the first character of a sub-expression.
</p>
<p>
A '$' character shall match the end of a line when used as the last character
of an expression, or the last character of a sub-expression.
</p>
<a name="boost_regex.syntax.basic_syntax.marked_sub_expressions_"></a><h5>
<a name="id518937"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.marked_sub_expressions_">Marked
sub-expressions:</a>
</h5>
<p>
A section beginning <code class="computeroutput"><span class="special">\(</span></code> and ending
<code class="computeroutput"><span class="special">\)</span></code> acts as a marked sub-expression.
Whatever matched the sub-expression is split out in a separate field by the
matching algorithms. Marked sub-expressions can also repeated, or referred-to
by a back-reference.
</p>
<a name="boost_regex.syntax.basic_syntax.repeats_"></a><h5>
<a name="id518996"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.repeats_">Repeats:</a>
</h5>
<p>
Any atom (a single character, a marked sub-expression, or a character class)
can be repeated with the * operator.
</p>
<p>
For example <code class="computeroutput"><span class="identifier">a</span><span class="special">*</span></code>
will match any number of letter a's repeated zero or more times (an atom
repeated zero times matches an empty string), so the expression <code class="computeroutput"><span class="identifier">a</span><span class="special">*</span><span class="identifier">b</span></code>
will match any of the following:
</p>
<pre class="programlisting">b
ab
aaaaaaaab
</pre>
<p>
An atom can also be repeated with a bounded repeat:
</p>
<p>
<code class="computeroutput"><span class="identifier">a</span><span class="special">\{</span><span class="identifier">n</span><span class="special">\}</span></code> Matches
'a' repeated exactly n times.
</p>
<p>
<code class="computeroutput"><span class="identifier">a</span><span class="special">\{</span><span class="identifier">n</span><span class="special">,\}</span></code> Matches
'a' repeated n or more times.
</p>
<p>
<code class="computeroutput"><span class="identifier">a</span><span class="special">\{</span><span class="identifier">n</span><span class="special">,</span> <span class="identifier">m</span><span class="special">\}</span></code> Matches 'a' repeated between n and m times
inclusive.
</p>
<p>
For example:
</p>
<pre class="programlisting">^a{2,3}$</pre>
<p>
Will match either of:
</p>
<pre class="programlisting">aa
aaa
</pre>
<p>
But neither of:
</p>
<pre class="programlisting">a
aaaa
</pre>
<p>
It is an error to use a repeat operator, if the preceding construct can not
be repeated, for example:
</p>
<pre class="programlisting">a(*)</pre>
<p>
Will raise an error, as there is nothing for the * operator to be applied
to.
</p>
<a name="boost_regex.syntax.basic_syntax.back_references_"></a><h5>
<a name="id519242"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.back_references_">Back references:</a>
</h5>
<p>
An escape character followed by a digit <span class="emphasis"><em>n</em></span>, where <span class="emphasis"><em>n</em></span>
is in the range 1-9, matches the same string that was matched by sub-expression
<span class="emphasis"><em>n</em></span>. For example the expression:
</p>
<pre class="programlisting">^\(a*\).*\1$</pre>
<p>
Will match the string:
</p>
<pre class="programlisting">aaabbaaa</pre>
<p>
But not the string:
</p>
<pre class="programlisting">aaabba</pre>
<a name="boost_regex.syntax.basic_syntax.character_sets_"></a><h5>
<a name="id519318"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.character_sets_">Character
sets:</a>
</h5>
<p>
A character set is a bracket-expression starting with [ and ending with ],
it defines a set of characters, and matches any single character that is
a member of that set.
</p>
<p>
A bracket expression may contain any combination of the following:
</p>
<a name="boost_regex.syntax.basic_syntax.single_characters_"></a><h6>
<a name="id519355"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.single_characters_">Single
characters:</a>
</h6>
<p>
For example <code class="computeroutput"><span class="special">[</span><span class="identifier">abc</span><span class="special">]</span></code>, will match any of the characters 'a', 'b',
or 'c'.
</p>
<a name="boost_regex.syntax.basic_syntax.character_ranges_"></a><h6>
<a name="id519408"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.character_ranges_">Character
ranges:</a>
</h6>
<p>
For example <code class="computeroutput"><span class="special">[</span><span class="identifier">a</span><span class="special">-</span><span class="identifier">c</span><span class="special">]</span></code>
will match any single character in the range 'a' to 'c'. By default, for
POSIX-Basic regular expressions, a character <span class="emphasis"><em>x</em></span> is within
the range <span class="emphasis"><em>y</em></span> to <span class="emphasis"><em>z</em></span>, if it collates
within that range; this results in locale specific behavior. This behavior
can be turned off by unsetting the <code class="computeroutput"><span class="identifier">collate</span></code>
option flag when constructing the regular expression - in which case whether
a character appears within a range is determined by comparing the code points
of the characters only.
</p>
<a name="boost_regex.syntax.basic_syntax.negation_"></a><h6>
<a name="id519501"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.negation_">Negation:</a>
</h6>
<p>
If the bracket-expression begins with the ^ character, then it matches the
complement of the characters it contains, for example <code class="computeroutput"><span class="special">[^</span><span class="identifier">a</span><span class="special">-</span><span class="identifier">c</span><span class="special">]</span></code> matches any character that is not in the
range a-c.
</p>
<a name="boost_regex.syntax.basic_syntax.character_classes_"></a><h6>
<a name="id519563"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.character_classes_">Character
classes:</a>
</h6>
<p>
An expression of the form <code class="computeroutput"><span class="special">[[:</span><span class="identifier">name</span><span class="special">:]]</span></code>
matches the named character class "name", for example <code class="computeroutput"><span class="special">[[:</span><span class="identifier">lower</span><span class="special">:]]</span></code> matches any lower case character. See
<a href="character_classes.html" title="Character Class Names">character class names</a>.
</p>
<a name="boost_regex.syntax.basic_syntax.collating_elements_"></a><h6>
<a name="id519647"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.collating_elements_">Collating
Elements:</a>
</h6>
<p>
An expression of the form <code class="computeroutput"><span class="special">[[.</span><span class="identifier">col</span><span class="special">.]</span></code> matches
the collating element <span class="emphasis"><em>col</em></span>. A collating element is any
single character, or any sequence of characters that collates as a single
unit. Collating elements may also be used as the end point of a range, for
example: <code class="computeroutput"><span class="special">[[.</span><span class="identifier">ae</span><span class="special">.]-</span><span class="identifier">c</span><span class="special">]</span></code>
matches the character sequence "ae", plus any single character
in the rangle "ae"-c, assuming that "ae" is treated as
a single collating element in the current locale.
</p>
<p>
Collating elements may be used in place of escapes (which are not normally
allowed inside character sets), for example <code class="computeroutput"><span class="special">[[.^.]</span><span class="identifier">abc</span><span class="special">]</span></code> would
match either one of the characters 'abc^'.
</p>
<p>
As an extension, a collating element may also be specified via its symbolic
name, for example:
</p>
<pre class="programlisting">[[.NUL.]]</pre>
<p>
matches a 'NUL' character. See <a href="collating_names.html" title="Collating Names">collating
element names</a>.
</p>
<a name="boost_regex.syntax.basic_syntax.equivalence_classes_"></a><h6>
<a name="id519794"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.equivalence_classes_">Equivalence
classes:</a>
</h6>
<p>
An expression of theform <code class="computeroutput"><span class="special">[[=</span><span class="identifier">col</span><span class="special">=]]</span></code>,
matches any character or collating element whose primary sort key is the
same as that for collating element <span class="emphasis"><em>col</em></span>, as with collating
elements the name <span class="emphasis"><em>col</em></span> may be a <a href="collating_names.html" title="Collating Names">collating
symbolic name</a>. A primary sort key is one that ignores case, accentation,
or locale-specific tailorings; so for example <code class="computeroutput"><span class="special">[[=</span><span class="identifier">a</span><span class="special">=]]</span></code> matches
any of the characters: a, <20>, <20>, <20>, <20>, <20>, <20>, A, <20>, <20>, <20>, <20>, <20> and <20>. Unfortunately implementation
of this is reliant on the platform's collation and localisation support;
this feature can not be relied upon to work portably across all platforms,
or even all locales on one platform.
</p>
<a name="boost_regex.syntax.basic_syntax.combinations_"></a><h6>
<a name="id519904"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.combinations_">Combinations:</a>
</h6>
<p>
All of the above can be combined in one character set declaration, for example:
<code class="computeroutput"><span class="special">[[:</span><span class="identifier">digit</span><span class="special">:]</span><span class="identifier">a</span><span class="special">-</span><span class="identifier">c</span><span class="special">[.</span><span class="identifier">NUL</span><span class="special">.]].</span></code>
</p>
<a name="boost_regex.syntax.basic_syntax.escapes"></a><h5>
<a name="id519982"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.escapes">Escapes</a>
</h5>
<p>
With the exception of the escape sequences \{, \}, \(, and \), which are
documented above, an escape followed by any character matches that character.
This can be used to make the special characters
</p>
<pre class="programlisting">.[\*^$</pre>
<p>
"ordinary". Note that the escape character loses its special meaning
inside a character set, so <code class="computeroutput"><span class="special">[\^]</span></code>
will match either a literal '\' or a '^'.
</p>
<a name="boost_regex.syntax.basic_syntax.what_gets_matched"></a><h4>
<a name="id520041"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.what_gets_matched">What Gets
Matched</a>
</h4>
<p>
When there is more that one way to match a regular expression, the "best"
possible match is obtained using the <a href="leftmost_longest_rule.html" title="The Leftmost Longest Rule">leftmost-longest
rule</a>.
</p>
<a name="boost_regex.syntax.basic_syntax.variations"></a><h4>
<a name="id520082"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.variations">Variations</a>
</h4>
<a name="boost_regex.grep_syntax"></a><p>
</p>
<a name="boost_regex.syntax.basic_syntax.grep"></a><h5>
<a name="id520116"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.grep">Grep</a>
</h5>
<p>
When an expression is compiled with the flag <code class="computeroutput"><span class="identifier">grep</span></code>
set, then the expression is treated as a newline separated list of <a href="basic_syntax.html#boost_regex.posix_basic">POSIX-Basic expressions</a>, a match
is found if any of the expressions in the list match, for example:
</p>
<pre class="programlisting">
<span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span> <span class="identifier">e</span><span class="special">(</span><span class="string">"abc\ndef"</span><span class="special">,</span> <span class="identifier">boost</span><span class="special">::</span><span class="identifier">regex</span><span class="special">::</span><span class="identifier">grep</span><span class="special">);</span>
</pre>
<p>
will match either of the <a href="basic_syntax.html#boost_regex.posix_basic">POSIX-Basic
expressions</a> "abc" or "def".
</p>
<p>
As its name suggests, this behavior is consistent with the Unix utility grep.
</p>
<a name="boost_regex.syntax.basic_syntax.emacs"></a><h5>
<a name="id520263"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.emacs">emacs</a>
</h5>
<p>
In addition to the <a href="basic_syntax.html#boost_regex.posix_basic">POSIX-Basic features</a>
the following characters are also special:
</p>
<div class="informaltable"><table class="table">
<colgroup>
<col>
<col>
</colgroup>
<thead><tr>
<th>
<p>
Character
</p>
</th>
<th>
<p>
Description
</p>
</th>
</tr></thead>
<tbody>
<tr>
<td>
<p>
+
</p>
</td>
<td>
<p>
repeats the preceding atom one or more times.
</p>
</td>
</tr>
<tr>
<td>
<p>
?
</p>
</td>
<td>
<p>
repeats the preceding atom zero or one times.
</p>
</td>
</tr>
<tr>
<td>
<p>
*?
</p>
</td>
<td>
<p>
A non-greedy version of *.
</p>
</td>
</tr>
<tr>
<td>
<p>
+?
</p>
</td>
<td>
<p>
A non-greedy version of +.
</p>
</td>
</tr>
<tr>
<td>
<p>
??
</p>
</td>
<td>
<p>
A non-greedy version of ?.
</p>
</td>
</tr>
</tbody>
</table></div>
<p>
And the following escape sequences are also recognised:
</p>
<div class="informaltable"><table class="table">
<colgroup>
<col>
<col>
</colgroup>
<thead><tr>
<th>
<p>
Escape
</p>
</th>
<th>
<p>
Description
</p>
</th>
</tr></thead>
<tbody>
<tr>
<td>
<p>
\|
</p>
</td>
<td>
<p>
specifies an alternative.
</p>
</td>
</tr>
<tr>
<td>
<p>
\(?: ... )
</p>
</td>
<td>
<p>
is a non-marking grouping construct - allows you to lexically group
something without spitting out an extra sub-expression.
</p>
</td>
</tr>
<tr>
<td>
<p>
\w
</p>
</td>
<td>
<p>
matches any word character.
</p>
</td>
</tr>
<tr>
<td>
<p>
\W
</p>
</td>
<td>
<p>
matches any non-word character.
</p>
</td>
</tr>
<tr>
<td>
<p>
\sx
</p>
</td>
<td>
<p>
matches any character in the syntax group x, the following emacs
groupings are supported: 's', ' ', '_', 'w', '.', ')', '(', '"',
'\'', '&gt;' and '&lt;'. Refer to the emacs docs for details.
</p>
</td>
</tr>
<tr>
<td>
<p>
\Sx
</p>
</td>
<td>
<p>
matches any character not in the syntax grouping x.
</p>
</td>
</tr>
<tr>
<td>
<p>
\c and \C
</p>
</td>
<td>
<p>
These are not supported.
</p>
</td>
</tr>
<tr>
<td>
<p>
\`
</p>
</td>
<td>
<p>
matches zero characters only at the start of a buffer (or string
being matched).
</p>
</td>
</tr>
<tr>
<td>
<p>
\'
</p>
</td>
<td>
<p>
matches zero characters only at the end of a buffer (or string being
matched).
</p>
</td>
</tr>
<tr>
<td>
<p>
\b
</p>
</td>
<td>
<p>
matches zero characters at a word boundary.
</p>
</td>
</tr>
<tr>
<td>
<p>
\B
</p>
</td>
<td>
<p>
matches zero characters, not at a word boundary.
</p>
</td>
</tr>
<tr>
<td>
<p>
\&lt;
</p>
</td>
<td>
<p>
matches zero characters only at the start of a word.
</p>
</td>
</tr>
<tr>
<td>
<p>
\&gt;
</p>
</td>
<td>
<p>
matches zero characters only at the end of a word.
</p>
</td>
</tr>
</tbody>
</table></div>
<p>
Finally, you should note that emacs style regular expressions are matched
according to the <a href="perl_syntax.html#boost_regex.syntax.perl_syntax.what_gets_matched">Perl
"depth first search" rules</a>. Emacs expressions are matched
this way because they contain Perl-like extensions, that do not interact
well with the <a href="leftmost_longest_rule.html" title="The Leftmost Longest Rule">POSIX-style
leftmost-longest rule</a>.
</p>
<a name="boost_regex.syntax.basic_syntax.options"></a><h4>
<a name="id520760"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.options">Options</a>
</h4>
<p>
There are a <a href="../ref/syntax_option_type/syntax_option_type_basic.html" title="Options for POSIX Basic Regular Expressions">variety
of flags</a> that may be combined with the <code class="computeroutput"><span class="identifier">basic</span></code>
and <code class="computeroutput"><span class="identifier">grep</span></code> options when constructing
the regular expression, in particular note that the <a href="../ref/syntax_option_type/syntax_option_type_basic.html" title="Options for POSIX Basic Regular Expressions"><code class="computeroutput"><span class="identifier">newline_alt</span></code>, <code class="computeroutput"><span class="identifier">no_char_classes</span></code>,
<code class="computeroutput"><span class="identifier">no</span><span class="special">-</span><span class="identifier">intervals</span></code>, <code class="computeroutput"><span class="identifier">bk_plus_qm</span></code>
and <code class="computeroutput"><span class="identifier">bk_plus_vbar</span></code></a> options
all alter the syntax, while the <a href="../ref/syntax_option_type/syntax_option_type_basic.html" title="Options for POSIX Basic Regular Expressions"><code class="computeroutput"><span class="identifier">collate</span></code> and <code class="computeroutput"><span class="identifier">icase</span></code>
options</a> modify how the case and locale sensitivity are to be applied.
</p>
<a name="boost_regex.syntax.basic_syntax.references"></a><h4>
<a name="id520935"></a>
<a href="basic_syntax.html#boost_regex.syntax.basic_syntax.references">References</a>
</h4>
<p>
<a href="http://www.opengroup.org/onlinepubs/000095399/basedefs/xbd_chap09.html" target="_top">IEEE
Std 1003.1-2001, Portable Operating System Interface (POSIX ), Base Definitions
and Headers, Section 9, Regular Expressions (FWD.1).</a>
</p>
<p>
<a href="http://www.opengroup.org/onlinepubs/000095399/utilities/grep.html" target="_top">IEEE
Std 1003.1-2001, Portable Operating System Interface (POSIX ), Shells and
Utilities, Section 4, Utilities, grep (FWD.1).</a>
</p>
<p>
<a href="http://www.gnu.org/software/emacs/" target="_top">Emacs Version 21.3.</a>
</p>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="basic_extended.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="character_classes.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,45 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Character Class Names</title>
<link rel="stylesheet" href="../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../index.html" title="Boost.Regex">
<link rel="up" href="../syntax.html" title="Regular Expression Syntax">
<link rel="prev" href="basic_syntax.html" title="POSIX Basic Regular Expression Syntax">
<link rel="next" href="character_classes/std_char_clases.html" title="Character Classes that are Always Supported">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="basic_syntax.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="character_classes/std_char_clases.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h3 class="title">
<a name="boost_regex.syntax.character_classes"></a><a href="character_classes.html" title="Character Class Names"> Character Class
Names</a></h3></div></div></div>
<div class="toc"><dl>
<dt><span class="section"><a href="character_classes/std_char_clases.html">
Character Classes that are Always Supported</a></span></dt>
<dt><span class="section"><a href="character_classes/optional_char_class_names.html">
Character classes that are supported by Unicode Regular Expressions</a></span></dt>
</dl></div>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="basic_syntax.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="character_classes/std_char_clases.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,542 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Character classes that are supported by Unicode Regular Expressions</title>
<link rel="stylesheet" href="../../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../../index.html" title="Boost.Regex">
<link rel="up" href="../character_classes.html" title="Character Class Names">
<link rel="prev" href="std_char_clases.html" title="Character Classes that are Always Supported">
<link rel="next" href="../collating_names.html" title="Collating Names">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="std_char_clases.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../character_classes.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="../collating_names.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h4 class="title">
<a name="boost_regex.syntax.character_classes.optional_char_class_names"></a><a href="optional_char_class_names.html" title="Character classes that are supported by Unicode Regular Expressions">
Character classes that are supported by Unicode Regular Expressions</a></h4></div></div></div>
<p>
The following character classes are only supported by Unicode Regular Expressions:
that is those that use the <code class="computeroutput"><span class="identifier">u32regex</span></code>
type. The names used are the same as those from Chapter 4 of the Unicode
standard.
</p>
<div class="informaltable"><table class="table">
<colgroup>
<col>
<col>
</colgroup>
<thead><tr>
<th>
<p>
Short Name
</p>
</th>
<th>
<p>
Long Name
</p>
</th>
</tr></thead>
<tbody>
<tr>
<td>
<p>
</p>
</td>
<td>
<p>
ASCII
</p>
</td>
</tr>
<tr>
<td>
<p>
</p>
</td>
<td>
<p>
Any
</p>
</td>
</tr>
<tr>
<td>
<p>
</p>
</td>
<td>
<p>
Assigned
</p>
</td>
</tr>
<tr>
<td>
<p>
C*
</p>
</td>
<td>
<p>
Other
</p>
</td>
</tr>
<tr>
<td>
<p>
Cc
</p>
</td>
<td>
<p>
Control
</p>
</td>
</tr>
<tr>
<td>
<p>
Cf
</p>
</td>
<td>
<p>
Format
</p>
</td>
</tr>
<tr>
<td>
<p>
Cn
</p>
</td>
<td>
<p>
Not Assigned
</p>
</td>
</tr>
<tr>
<td>
<p>
Co
</p>
</td>
<td>
<p>
Private Use
</p>
</td>
</tr>
<tr>
<td>
<p>
Cs
</p>
</td>
<td>
<p>
Surrogate
</p>
</td>
</tr>
<tr>
<td>
<p>
L*
</p>
</td>
<td>
<p>
Letter
</p>
</td>
</tr>
<tr>
<td>
<p>
Ll
</p>
</td>
<td>
<p>
Lowercase Letter
</p>
</td>
</tr>
<tr>
<td>
<p>
Lm
</p>
</td>
<td>
<p>
Modifier Letter
</p>
</td>
</tr>
<tr>
<td>
<p>
Lo
</p>
</td>
<td>
<p>
Other Letter
</p>
</td>
</tr>
<tr>
<td>
<p>
Lt
</p>
</td>
<td>
<p>
Titlecase
</p>
</td>
</tr>
<tr>
<td>
<p>
Lu
</p>
</td>
<td>
<p>
Uppercase Letter
</p>
</td>
</tr>
<tr>
<td>
<p>
M*
</p>
</td>
<td>
<p>
Mark
</p>
</td>
</tr>
<tr>
<td>
<p>
Mc
</p>
</td>
<td>
<p>
Spacing Combining Mark
</p>
</td>
</tr>
<tr>
<td>
<p>
Me
</p>
</td>
<td>
<p>
Enclosing Mark
</p>
</td>
</tr>
<tr>
<td>
<p>
Mn
</p>
</td>
<td>
<p>
Non-Spacing Mark
</p>
</td>
</tr>
<tr>
<td>
<p>
N*
</p>
</td>
<td>
<p>
Number
</p>
</td>
</tr>
<tr>
<td>
<p>
Nd
</p>
</td>
<td>
<p>
Decimal Digit Number
</p>
</td>
</tr>
<tr>
<td>
<p>
Nl
</p>
</td>
<td>
<p>
Letter Number
</p>
</td>
</tr>
<tr>
<td>
<p>
No
</p>
</td>
<td>
<p>
Other Number
</p>
</td>
</tr>
<tr>
<td>
<p>
P*
</p>
</td>
<td>
<p>
Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Pc
</p>
</td>
<td>
<p>
Connector Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Pd
</p>
</td>
<td>
<p>
Dash Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Pe
</p>
</td>
<td>
<p>
Close Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Pf
</p>
</td>
<td>
<p>
Final Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Pi
</p>
</td>
<td>
<p>
Initial Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Po
</p>
</td>
<td>
<p>
Other Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
Ps
</p>
</td>
<td>
<p>
Open Punctuation
</p>
</td>
</tr>
<tr>
<td>
<p>
S*
</p>
</td>
<td>
<p>
Symbol
</p>
</td>
</tr>
<tr>
<td>
<p>
Sc
</p>
</td>
<td>
<p>
Currency Symbol
</p>
</td>
</tr>
<tr>
<td>
<p>
Sk
</p>
</td>
<td>
<p>
Modifier Symbol
</p>
</td>
</tr>
<tr>
<td>
<p>
Sm
</p>
</td>
<td>
<p>
Math Symbol
</p>
</td>
</tr>
<tr>
<td>
<p>
So
</p>
</td>
<td>
<p>
Other Symbol
</p>
</td>
</tr>
<tr>
<td>
<p>
Z*
</p>
</td>
<td>
<p>
Separator
</p>
</td>
</tr>
<tr>
<td>
<p>
Zl
</p>
</td>
<td>
<p>
Line Separator
</p>
</td>
</tr>
<tr>
<td>
<p>
Zp
</p>
</td>
<td>
<p>
Paragraph Separator
</p>
</td>
</tr>
<tr>
<td>
<p>
Zs
</p>
</td>
<td>
<p>
Space Separator
</p>
</td>
</tr>
</tbody>
</table></div>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="std_char_clases.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../character_classes.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="../collating_names.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,391 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Character Classes that are Always Supported</title>
<link rel="stylesheet" href="../../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../../index.html" title="Boost.Regex">
<link rel="up" href="../character_classes.html" title="Character Class Names">
<link rel="prev" href="../character_classes.html" title="Character Class Names">
<link rel="next" href="optional_char_class_names.html" title="Character classes that are supported by Unicode Regular Expressions">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="../character_classes.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../character_classes.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="optional_char_class_names.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h4 class="title">
<a name="boost_regex.syntax.character_classes.std_char_clases"></a><a href="std_char_clases.html" title="Character Classes that are Always Supported">
Character Classes that are Always Supported</a></h4></div></div></div>
<p>
The following character class names are always supported by Boost.Regex:
</p>
<div class="informaltable"><table class="table">
<colgroup>
<col>
<col>
<col>
</colgroup>
<thead><tr>
<th>
<p>
Name
</p>
</th>
<th>
<p>
POSIX-standard name
</p>
</th>
<th>
<p>
Description
</p>
</th>
</tr></thead>
<tbody>
<tr>
<td>
<p>
alnum
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any alpha-numeric character.
</p>
</td>
</tr>
<tr>
<td>
<p>
alpha
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any alphabetic character.
</p>
</td>
</tr>
<tr>
<td>
<p>
blank
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any whitespace character that is not a line separator.
</p>
</td>
</tr>
<tr>
<td>
<p>
cntrl
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any control character.
</p>
</td>
</tr>
<tr>
<td>
<p>
d
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any decimal digit
</p>
</td>
</tr>
<tr>
<td>
<p>
digit
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any decimal digit.
</p>
</td>
</tr>
<tr>
<td>
<p>
graph
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any graphical character.
</p>
</td>
</tr>
<tr>
<td>
<p>
l
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any lower case character.
</p>
</td>
</tr>
<tr>
<td>
<p>
lower
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any lower case character.
</p>
</td>
</tr>
<tr>
<td>
<p>
print
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any printable character.
</p>
</td>
</tr>
<tr>
<td>
<p>
punct
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any punctuation character.
</p>
</td>
</tr>
<tr>
<td>
<p>
s
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any whitespace character.
</p>
</td>
</tr>
<tr>
<td>
<p>
space
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any whitespace character.
</p>
</td>
</tr>
<tr>
<td>
<p>
unicode
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any extended character whose code point is above 255 in value.
</p>
</td>
</tr>
<tr>
<td>
<p>
u
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any upper case character.
</p>
</td>
</tr>
<tr>
<td>
<p>
upper
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any upper case character.
</p>
</td>
</tr>
<tr>
<td>
<p>
w
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any word character (alphanumeric characters plus the underscore).
</p>
</td>
</tr>
<tr>
<td>
<p>
word
</p>
</td>
<td>
<p>
No
</p>
</td>
<td>
<p>
Any word character (alphanumeric characters plus the underscore).
</p>
</td>
</tr>
<tr>
<td>
<p>
xdigit
</p>
</td>
<td>
<p>
Yes
</p>
</td>
<td>
<p>
Any hexadecimal digit character.
</p>
</td>
</tr>
</tbody>
</table></div>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="../character_classes.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../character_classes.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="optional_char_class_names.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,45 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Collating Names</title>
<link rel="stylesheet" href="../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../index.html" title="Boost.Regex">
<link rel="up" href="../syntax.html" title="Regular Expression Syntax">
<link rel="prev" href="character_classes/optional_char_class_names.html" title="Character classes that are supported by Unicode Regular Expressions">
<link rel="next" href="collating_names/digraphs.html" title="Digraphs">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="character_classes/optional_char_class_names.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="collating_names/digraphs.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h3 class="title">
<a name="boost_regex.syntax.collating_names"></a><a href="collating_names.html" title="Collating Names"> Collating Names</a></h3></div></div></div>
<div class="toc"><dl>
<dt><span class="section"><a href="collating_names/digraphs.html"> Digraphs</a></span></dt>
<dt><span class="section"><a href="collating_names/posix_symbolic_names.html">
POSIX Symbolic Names</a></span></dt>
<dt><span class="section"><a href="collating_names/named_unicode.html">
Named Unicode Characters</a></span></dt>
</dl></div>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="character_classes/optional_char_class_names.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="collating_names/digraphs.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,56 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Digraphs</title>
<link rel="stylesheet" href="../../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../../index.html" title="Boost.Regex">
<link rel="up" href="../collating_names.html" title="Collating Names">
<link rel="prev" href="../collating_names.html" title="Collating Names">
<link rel="next" href="posix_symbolic_names.html" title="POSIX Symbolic Names">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="../collating_names.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../collating_names.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="posix_symbolic_names.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h4 class="title">
<a name="boost_regex.syntax.collating_names.digraphs"></a><a href="digraphs.html" title="Digraphs"> Digraphs</a></h4></div></div></div>
<p>
The following are treated as valid digraphs when used as a collating name:
</p>
<p>
"ae", "Ae", "AE", "ch", "Ch",
"CH", "ll", "Ll", "LL", "ss",
"Ss", "SS", "nj", "Nj", "NJ",
"dz", "Dz", "DZ", "lj", "Lj",
"LJ".
</p>
<p>
So for example the expression:
</p>
<pre class="programlisting">[[.ae.]-c] </pre>
<p>
will match any character that collates between the digraph "ae"
and the character "c".
</p>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="../collating_names.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../collating_names.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="posix_symbolic_names.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,49 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Named Unicode Characters</title>
<link rel="stylesheet" href="../../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../../index.html" title="Boost.Regex">
<link rel="up" href="../collating_names.html" title="Collating Names">
<link rel="prev" href="posix_symbolic_names.html" title="POSIX Symbolic Names">
<link rel="next" href="../leftmost_longest_rule.html" title="The Leftmost Longest Rule">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="posix_symbolic_names.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../collating_names.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="../leftmost_longest_rule.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h4 class="title">
<a name="boost_regex.syntax.collating_names.named_unicode"></a><a href="named_unicode.html" title="Named Unicode Characters">
Named Unicode Characters</a></h4></div></div></div>
<p>
When using <a href="../../unicode.html" title="Unicode and Boost.Regex">Unicode aware regular expressions</a>
(with the <code class="computeroutput"><span class="identifier">u32regex</span></code> type),
all the normal symbolic names for Unicode characters (those given in Unidata.txt)
are recognised. So for example:
</p>
<pre class="programlisting">[[.CYRILLIC CAPITAL LETTER I.]] </pre>
<p>
would match the Unicode character 0x0418.
</p>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="posix_symbolic_names.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../collating_names.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="../leftmost_longest_rule.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,980 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>POSIX Symbolic Names</title>
<link rel="stylesheet" href="../../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../../index.html" title="Boost.Regex">
<link rel="up" href="../collating_names.html" title="Collating Names">
<link rel="prev" href="digraphs.html" title="Digraphs">
<link rel="next" href="named_unicode.html" title="Named Unicode Characters">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="digraphs.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../collating_names.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="named_unicode.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h4 class="title">
<a name="boost_regex.syntax.collating_names.posix_symbolic_names"></a><a href="posix_symbolic_names.html" title="POSIX Symbolic Names">
POSIX Symbolic Names</a></h4></div></div></div>
<p>
The following symbolic names are recognised as valid collating element
names, in addition to any single character, this allows you to write for
example:
</p>
<pre class="programlisting">[[.left-square-bracket.][.right-square-bracket.]]</pre>
<p>
if you wanted to match either "[" or "]".
</p>
<div class="informaltable"><table class="table">
<colgroup>
<col>
<col>
</colgroup>
<thead><tr>
<th>
<p>
Name
</p>
</th>
<th>
<p>
Character
</p>
</th>
</tr></thead>
<tbody>
<tr>
<td>
<p>
NUL
</p>
</td>
<td>
<p>
\x00
</p>
</td>
</tr>
<tr>
<td>
<p>
SOH
</p>
</td>
<td>
<p>
\x01
</p>
</td>
</tr>
<tr>
<td>
<p>
STX
</p>
</td>
<td>
<p>
\x02
</p>
</td>
</tr>
<tr>
<td>
<p>
ETX
</p>
</td>
<td>
<p>
\x03
</p>
</td>
</tr>
<tr>
<td>
<p>
EOT
</p>
</td>
<td>
<p>
\x04
</p>
</td>
</tr>
<tr>
<td>
<p>
ENQ
</p>
</td>
<td>
<p>
\x05
</p>
</td>
</tr>
<tr>
<td>
<p>
ACK
</p>
</td>
<td>
<p>
\x06
</p>
</td>
</tr>
<tr>
<td>
<p>
alert
</p>
</td>
<td>
<p>
\x07
</p>
</td>
</tr>
<tr>
<td>
<p>
backspace
</p>
</td>
<td>
<p>
\x08
</p>
</td>
</tr>
<tr>
<td>
<p>
tab
</p>
</td>
<td>
<p>
\t
</p>
</td>
</tr>
<tr>
<td>
<p>
newline
</p>
</td>
<td>
<p>
\n
</p>
</td>
</tr>
<tr>
<td>
<p>
vertical-tab
</p>
</td>
<td>
<p>
\v
</p>
</td>
</tr>
<tr>
<td>
<p>
form-feed
</p>
</td>
<td>
<p>
\f
</p>
</td>
</tr>
<tr>
<td>
<p>
carriage-return
</p>
</td>
<td>
<p>
\r
</p>
</td>
</tr>
<tr>
<td>
<p>
SO
</p>
</td>
<td>
<p>
\xE
</p>
</td>
</tr>
<tr>
<td>
<p>
SI
</p>
</td>
<td>
<p>
\xF
</p>
</td>
</tr>
<tr>
<td>
<p>
DLE
</p>
</td>
<td>
<p>
\x10
</p>
</td>
</tr>
<tr>
<td>
<p>
DC1
</p>
</td>
<td>
<p>
\x11
</p>
</td>
</tr>
<tr>
<td>
<p>
DC2
</p>
</td>
<td>
<p>
\x12
</p>
</td>
</tr>
<tr>
<td>
<p>
DC3
</p>
</td>
<td>
<p>
\x13
</p>
</td>
</tr>
<tr>
<td>
<p>
DC4
</p>
</td>
<td>
<p>
\x14
</p>
</td>
</tr>
<tr>
<td>
<p>
NAK
</p>
</td>
<td>
<p>
\x15
</p>
</td>
</tr>
<tr>
<td>
<p>
SYN
</p>
</td>
<td>
<p>
\x16
</p>
</td>
</tr>
<tr>
<td>
<p>
ETB
</p>
</td>
<td>
<p>
\x17
</p>
</td>
</tr>
<tr>
<td>
<p>
CAN
</p>
</td>
<td>
<p>
\x18
</p>
</td>
</tr>
<tr>
<td>
<p>
EM
</p>
</td>
<td>
<p>
\x19
</p>
</td>
</tr>
<tr>
<td>
<p>
SUB
</p>
</td>
<td>
<p>
\x1A
</p>
</td>
</tr>
<tr>
<td>
<p>
ESC
</p>
</td>
<td>
<p>
\x1B
</p>
</td>
</tr>
<tr>
<td>
<p>
IS4
</p>
</td>
<td>
<p>
\x1C
</p>
</td>
</tr>
<tr>
<td>
<p>
IS3
</p>
</td>
<td>
<p>
\x1D
</p>
</td>
</tr>
<tr>
<td>
<p>
IS2
</p>
</td>
<td>
<p>
\x1E
</p>
</td>
</tr>
<tr>
<td>
<p>
IS1
</p>
</td>
<td>
<p>
\x1F
</p>
</td>
</tr>
<tr>
<td>
<p>
space
</p>
</td>
<td>
<p>
\x20
</p>
</td>
</tr>
<tr>
<td>
<p>
exclamation-mark
</p>
</td>
<td>
<p>
!
</p>
</td>
</tr>
<tr>
<td>
<p>
quotation-mark
</p>
</td>
<td>
<p>
"
</p>
</td>
</tr>
<tr>
<td>
<p>
number-sign
</p>
</td>
<td>
<p>
#
</p>
</td>
</tr>
<tr>
<td>
<p>
dollar-sign
</p>
</td>
<td>
<p>
$
</p>
</td>
</tr>
<tr>
<td>
<p>
percent-sign
</p>
</td>
<td>
<p>
%
</p>
</td>
</tr>
<tr>
<td>
<p>
ampersand
</p>
</td>
<td>
<p>
&amp;
</p>
</td>
</tr>
<tr>
<td>
<p>
apostrophe
</p>
</td>
<td>
<p>
'
</p>
</td>
</tr>
<tr>
<td>
<p>
left-parenthesis
</p>
</td>
<td>
<p>
(
</p>
</td>
</tr>
<tr>
<td>
<p>
right-parenthesis
</p>
</td>
<td>
<p>
)
</p>
</td>
</tr>
<tr>
<td>
<p>
asterisk
</p>
</td>
<td>
<p>
*
</p>
</td>
</tr>
<tr>
<td>
<p>
plus-sign
</p>
</td>
<td>
<p>
+
</p>
</td>
</tr>
<tr>
<td>
<p>
comma
</p>
</td>
<td>
<p>
,
</p>
</td>
</tr>
<tr>
<td>
<p>
hyphen
</p>
</td>
<td>
<p>
-
</p>
</td>
</tr>
<tr>
<td>
<p>
period
</p>
</td>
<td>
<p>
.
</p>
</td>
</tr>
<tr>
<td>
<p>
slash
</p>
</td>
<td>
<p>
/
</p>
</td>
</tr>
<tr>
<td>
<p>
zero
</p>
</td>
<td>
<p>
0
</p>
</td>
</tr>
<tr>
<td>
<p>
one
</p>
</td>
<td>
<p>
1
</p>
</td>
</tr>
<tr>
<td>
<p>
two
</p>
</td>
<td>
<p>
2
</p>
</td>
</tr>
<tr>
<td>
<p>
three
</p>
</td>
<td>
<p>
3
</p>
</td>
</tr>
<tr>
<td>
<p>
four
</p>
</td>
<td>
<p>
4
</p>
</td>
</tr>
<tr>
<td>
<p>
five
</p>
</td>
<td>
<p>
5
</p>
</td>
</tr>
<tr>
<td>
<p>
six
</p>
</td>
<td>
<p>
6
</p>
</td>
</tr>
<tr>
<td>
<p>
seven
</p>
</td>
<td>
<p>
7
</p>
</td>
</tr>
<tr>
<td>
<p>
eight
</p>
</td>
<td>
<p>
8
</p>
</td>
</tr>
<tr>
<td>
<p>
nine
</p>
</td>
<td>
<p>
9
</p>
</td>
</tr>
<tr>
<td>
<p>
colon
</p>
</td>
<td>
<p>
:
</p>
</td>
</tr>
<tr>
<td>
<p>
semicolon
</p>
</td>
<td>
<p>
;
</p>
</td>
</tr>
<tr>
<td>
<p>
less-than-sign
</p>
</td>
<td>
<p>
&lt;
</p>
</td>
</tr>
<tr>
<td>
<p>
equals-sign
</p>
</td>
<td>
<p>
=
</p>
</td>
</tr>
<tr>
<td>
<p>
greater-than-sign
</p>
</td>
<td>
<p>
&gt;
</p>
</td>
</tr>
<tr>
<td>
<p>
question-mark
</p>
</td>
<td>
<p>
?
</p>
</td>
</tr>
<tr>
<td>
<p>
commercial-at
</p>
</td>
<td>
<p>
@
</p>
</td>
</tr>
<tr>
<td>
<p>
left-square-bracket
</p>
</td>
<td>
<p>
[
</p>
</td>
</tr>
<tr>
<td>
<p>
backslash
</p>
</td>
<td>
<p>
\
</p>
</td>
</tr>
<tr>
<td>
<p>
right-square-bracket
</p>
</td>
<td>
<p>
]
</p>
</td>
</tr>
<tr>
<td>
<p>
circumflex
</p>
</td>
<td>
<p>
~
</p>
</td>
</tr>
<tr>
<td>
<p>
underscore
</p>
</td>
<td>
<p>
_
</p>
</td>
</tr>
<tr>
<td>
<p>
grave-accent
</p>
</td>
<td>
<p>
`
</p>
</td>
</tr>
<tr>
<td>
<p>
left-curly-bracket
</p>
</td>
<td>
<p>
{
</p>
</td>
</tr>
<tr>
<td>
<p>
vertical-line
</p>
</td>
<td>
<p>
|
</p>
</td>
</tr>
<tr>
<td>
<p>
right-curly-bracket
</p>
</td>
<td>
<p>
}
</p>
</td>
</tr>
<tr>
<td>
<p>
tilde
</p>
</td>
<td>
<p>
~
</p>
</td>
</tr>
<tr>
<td>
<p>
DEL
</p>
</td>
<td>
<p>
\x7F
</p>
</td>
</tr>
</tbody>
</table></div>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="digraphs.html"><img src="../../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../collating_names.html"><img src="../../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../../index.html"><img src="../../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="named_unicode.html"><img src="../../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

View File

@ -0,0 +1,74 @@
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>The Leftmost Longest Rule</title>
<link rel="stylesheet" href="../../../../../../doc/html/boostbook.css" type="text/css">
<meta name="generator" content="DocBook XSL Stylesheets Vsnapshot_2006-12-17_0120">
<link rel="start" href="../../index.html" title="Boost.Regex">
<link rel="up" href="../syntax.html" title="Regular Expression Syntax">
<link rel="prev" href="collating_names/named_unicode.html" title="Named Unicode Characters">
<link rel="next" href="../format.html" title="Search and Replace Format String Syntax">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF">
<table cellpadding="2" width="100%">
<td valign="top"><img alt="Boost C++ Libraries" width="277" height="86" src="../../../../../../boost.png"></td>
<td align="center"><a href="../../../../../../index.htm">Home</a></td>
<td align="center"><a href="../../../../../../libs/libraries.htm">Libraries</a></td>
<td align="center"><a href="../../../../../../people/people.htm">People</a></td>
<td align="center"><a href="../../../../../../more/faq.htm">FAQ</a></td>
<td align="center"><a href="../../../../../../more/index.htm">More</a></td>
</table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="collating_names/named_unicode.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="../format.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
<div class="section" lang="en">
<div class="titlepage"><div><div><h3 class="title">
<a name="boost_regex.syntax.leftmost_longest_rule"></a><a href="leftmost_longest_rule.html" title="The Leftmost Longest Rule"> The Leftmost
Longest Rule</a></h3></div></div></div>
<p>
Often there is more than one way of matching a regular expression at a particular
location, for POSIX basic and extended regular expressions, the "best"
match is determined as follows:
</p>
<div class="orderedlist"><ol type="1">
<li>
Find the leftmost match, if there is only one match possible at this location
then return it.
</li>
<li>
Find the longest of the possible matches, along with any ties. If there
is only one such possible match then return it.
</li>
<li>
If there are no marked sub-expressions, then all the remaining alternatives
are indistinguishable; return the first of these found.
</li>
<li>
Find the match which has matched the first sub-expression in the leftmost
position, along with any ties. If there is only on such match possible
then return it.
</li>
<li>
Find the match which has the longest match for the first sub-expression,
along with any ties. If there is only one such match then return it.
</li>
<li>
Repeat steps 4 and 5 for each additional marked sub-expression.
</li>
<li>
If there is still more than one possible match remaining, then they are
indistinguishable; return the first one found.
</li>
</ol></div>
</div>
<table xmlns:rev="http://www.cs.rpi.edu/~gregod/boost/tools/doc/revision" width="100%"><tr>
<td align="left"></td>
<td align="right"><small>Copyright <20> 2007 John Maddock</small></td>
</tr></table>
<hr>
<div class="spirit-nav">
<a accesskey="p" href="collating_names/named_unicode.html"><img src="../../../../../../doc/html/images/prev.png" alt="Prev"></a><a accesskey="u" href="../syntax.html"><img src="../../../../../../doc/html/images/up.png" alt="Up"></a><a accesskey="h" href="../../index.html"><img src="../../../../../../doc/html/images/home.png" alt="Home"></a><a accesskey="n" href="../format.html"><img src="../../../../../../doc/html/images/next.png" alt="Next"></a>
</div>
</body>
</html>

File diff suppressed because it is too large Load Diff