=begin comment
# !!!!!!! DO NOT EDIT THIS FILE !!!!!!!
# This file is machine-generated by lib/unicore/mktables from the Unicode
# database, Version 13.0.0. Any changes made here will be lost!
To change this file, edit lib/unicore/mktables instead.
=end comment
=head1 NAME
perluniprops - Index of Unicode Version 13.0.0 character properties in Perl
=head1 DESCRIPTION
This document provides information about the portion of the Unicode database
that deals with character properties, that is the portion that is defined on
single code points. (L
below briefly mentions other data that Unicode provides.)
Perl can provide access to all non-provisional Unicode character properties,
though not all are enabled by default. The omitted ones are the Unihan
properties (accessible via the CPAN module L) and certain
deprecated or Unicode-internal properties. (An installation may choose to
recompile Perl's tables to change this. See L.)
For most purposes, access to Unicode properties from the Perl core is through
regular expression matches, as described in the next section.
For some special purposes, and to access the properties that are not suitable
for regular expression matching, all the Unicode character properties that
Perl handles are accessible via the standard L module, as
described in the section L.
Perl also provides some additional extensions and short-cut synonyms
for Unicode properties.
This document merely lists all available properties and does not attempt to
explain what each property really means. There is a brief description of each
Perl extension; see L for more information on
these. There is some detail about Blocks, Scripts, General_Category,
and Bidi_Class in L, but to find out about the intricacies of the
official Unicode properties, refer to the Unicode standard. A good starting
place is L.
Note that you can define your own properties; see
L.
=head1 Properties accessible through C<\p{}> and C<\P{}>
The Perl regular expression C<\p{}> and C<\P{}> constructs give access to
most of the Unicode character properties. The table below shows all these
constructs, both single and compound forms.
B consist of two components, separated by an equals sign or a
colon. The first component is the property name, and the second component is
the particular value of the property to match against, for example,
C<\p{Script_Extensions: Greek}> and C<\p{Script_Extensions=Greek}> both mean
to match characters whose Script_Extensions property value is Greek.
(C is an improved version of the C