YAML (3)
Leading comments
Automatically generated by Pod::Man 4.07 (Pod::Simple 3.32) Standard preamble: ========================================================================
NAME
YAML - YAML Ain't Markup LanguageXVERSION
This document describesNOTE
This module has been released toIf you want robust and fast
If you really need to use this version of
The rest of this documentation is left unchanged, until
SYNOPSIS
use YAML; # Load a YAML stream of 3 YAML documents into Perl data structures. my ($hashref, $arrayref, $string) = Load(<<'...'); --- name: ingy age: old weight: heavy # I should comment that I also like pink, but don't tell anybody. favorite colors: - red - green - blue --- - Clark Evans - Oren Ben-Kiki - Ingy do.t Net --- > You probably think YAML stands for "Yet Another Markup Language". It ain't! YAML is really a data serialization language. But if you want to think of it as a markup, that's OK with me. A lot of people try to use XML as a serialization format. "YAML" is catchy and fun to say. Try it. "YAML, YAML, YAML!!!" ... # Dump the Perl data structures back into YAML. print Dump($string, $arrayref, $hashref); # YAML::Dump is used the same way you'd use Data::Dumper::Dumper use Data::Dumper; print Dumper($string, $arrayref, $hashref);
DESCRIPTION
The
For information on the
WHY YAML IS COOL
- YAMLis readable for people.
-
It makes clear sense out of complex data structures. You should find that YAMLis an exceptional data dumping tool. Structure is shown through indentation,YAMLsupports recursive data, and hash keys are sorted by default. In addition,YAMLsupports several styles of scalar formatting for different types of data.
- YAMLis editable.
-
YAMLwas designed from the ground up to be an excellent syntax for configuration files. Almost all programs need configuration files, so why invent a new syntax for each one? And why subject users to the complexities ofXMLor native Perl code?
- YAMLis multilingual.
-
Yes, YAMLsupports Unicode. But I'm actually referring to programming languages.YAMLwas designed to meet the serialization needs of Perl, Python, Ruby, Tcl,PHP,Javascript and Java. It was also designed to be interoperable between those languages. That meansYAMLserializations produced by Perl can be processed by Python.
- YAMLis taint safe.
-
Using modules like Data::Dumper for serialization is fine as long as you can
be sure that nobody can tamper with your data files or transmissions. That's
because you need to use Perl's "eval()" built-in to deserialize the data.
Somebody could add a snippet of Perl to erase your files.
YAML's parser does not need to eval anything.
- YAMLis full featured.
-
YAMLcan accurately serialize all of the common Perl data structures and deserialize them again without losing data relationships. Although it is not 100% perfect (no serializer is or can be perfect), it fares as well as the popular current modules: Data::Dumper, Storable, XML::Dumper and Data::Denter.YAML.pm also has the ability to handle code (subroutine) references and typeglobs. (Still experimental) These features are not found in Perl's other serialization modules.
- YAMLis extensible.
-
The YAMLlanguage has been designed to be flexible enough to solve it's own problems. The markup itself has 3 basic construct which resemble Perl's hash, array and scalar. By default, these map to their Perl equivalents. But eachYAMLnode also supports a tagging mechanism (type system) which can cause that node to be interpreted in a completely different manner. That's howYAMLcan support object serialization and oddball structures like Perl's typeglob.
YAML IMPLEMENTATIONS IN PERL
This module,Different
Why would there be more than one implementation of
For this reason I am currently working on 3 different
- YAML
-
The main YAMLdistribution will keeping evolving to support the entireYAMLspecification in pure Perl. This may not be the fastest or most stable module though. Currently,YAML.pm has lots of known bugs. It is mostly a great tool for dumping Perl data structures to a readable form.
- YAML::Tiny
-
The point of YAML::Tiny is to strip YAMLdown to the 90% that people use most and offer that in a small, fast, stable, pure Perl form. YAML::Tiny will simply die when it is asked to do something it can't.
- YAML::Syck
-
"libsyck" is the C based YAMLprocessing library used by the Ruby programming language (and also Python,PHPand Pugs). YAML::Syck is the Perl binding to "libsyck". It should be very fast, but may have problems of its own. It will also require C compilation.NOTE:Audrey Tang has actually completed this module and it works great and is
10 times faster thanYAML.pm.
In the future, there will likely be even more
FUNCTIONAL USAGE
Exported Functions
The following functions are exported by
use YAML ();
- Dump(list-of-Perl-data-structures)
-
Turn Perl data into YAML.This function works very much like Data::Dumper::Dumper(). It takes a list of Perl data structures and dumps them into a serialized form. It returns a string containing theYAMLstream. The structures can be references or plain scalars.
- Load(string-containing-a-YAML-stream)
-
Turn YAMLinto Perl data. This is the opposite of Dump. Just like Storable's thaw() function or the eval() function in relation to Data::Dumper. It parses a string containing a validYAMLstream into a list of Perl data structures.
Exportable Functions
These functions are not exported by default but you can request them in an import list like this:
use YAML qw'freeze thaw Bless';
- freeze() and thaw()
-
Aliases to Dump() and Load() for Storable fans. This will also allow YAML.pm to be plugged directly into modules likePOE.pm, that use the freeze/thawAPIfor internal serialization.
- DumpFile(filepath, list)
-
Writes the YAMLstream to a file instead of just returning a string.
- LoadFile(filepath)
-
Reads the YAMLstream from a file instead of a string.
- Bless(perl-node, [yaml-node | class-name])
-
Associate a normal Perl node, with a yaml node. A yaml node is an object tied
to the YAML::Node class. The second argument is either a yaml node that you've
already created or a class (package) name that supports a "yaml_dump()"
function. A "yaml_dump()" function should take a perl node and return a yaml
node. If no second argument is provided, Bless will create a yaml node. This
node is not returned, but can be retrieved with the Blessed() function.
Here's an example of how to use Bless. Say you have a hash containing three keys, but you only want to dump two of them. Furthermore the keys must be dumped in a certain order. Here's how you do that:
use YAML qw(Dump Bless); $hash = {apple => 'good', banana => 'bad', cauliflower => 'ugly'}; print Dump $hash; Bless($hash)->keys(['banana', 'apple']); print Dump $hash;
produces:
--- apple: good banana: bad cauliflower: ugly --- banana: bad apple: good
Bless returns the tied part of a yaml-node, so that you can call the YAML::Node methods. This is the same thing that YAML::Node::ynode() returns. So another way to do the above example is:
use YAML qw(Dump Bless); use YAML::Node; $hash = {apple => 'good', banana => 'bad', cauliflower => 'ugly'}; print Dump $hash; Bless($hash); $ynode = ynode(Blessed($hash)); $ynode->keys(['banana', 'apple']); print Dump $hash;
Note that Blessing a Perl data structure does not change it anyway. The extra information is stored separately and looked up by the Blessed node's memory address.
- Blessed(perl-node)
-
Returns the yaml node that a particular perl node is associated with (see
above). Returns undef if the node is not (YAML) Blessed.
GLOBAL OPTIONS
For example, to change the indentation width, do something like:
local $YAML::Indent = 3;
The current options are:
- DumperClass
-
You can override which module/class YAMLuses for Dumping data.
- LoaderClass
-
You can override which module/class YAMLuses for Loading data.
- Indent
-
This is the number of space characters to use for each indentation level when
doing a Dump(). The default is 2.
By the way,
YAMLcan use any number of characters for indentation at any level. So if you are editingYAMLby hand feel free to do it anyway that looks pleasing to you; just be consistent for a given level. - SortKeys
-
Default is 1. (true)
Tells
YAML.pm whether or not to sort hash keys when storing a document.YAML::Node objects can have their own sort order, which is usually what you want. To override the YAML::Node order and sort the keys anyway, set SortKeys to 2.
- Stringify
-
Default is 0. (false)
Objects with string overloading should honor the overloading and dump the stringification of themselves, rather than the actual object's guts.
- UseHeader
-
Default is 1. (true)
This tells
YAML.pm whether to use a separator string for a Dump operation. This only applies to the first document in a stream. Subsequent documents must have aYAMLheader by definition. - UseVersion
-
Default is 0. (false)
Tells
YAML.pm whether to include theYAMLversion on the separator/header.--- %YAML:1.0
- AnchorPrefix
-
Default is ''.
Anchor names are normally numeric.
YAML.pm simply starts with '1' and increases by one for each new anchor. This option allows you to specify a string to be prepended to each anchor number. - UseCode
-
Setting the UseCode option is a shortcut to set both the DumpCode and LoadCode
options at once. Setting UseCode to '1' tells YAML.pm to dump Perl code references as Perl (using B::Deparse) and to load them back into memory using eval(). The reason this has to be an option is that using eval() to parse untrusted code is, well, untrustworthy.
- DumpCode
-
Determines if and how YAML.pm should serialize Perl code references. By defaultYAML.pm will dump code references as dummy placeholders (much like Data::Dumper). If DumpCode is set to '1' or 'deparse', code references will be dumped as actual Perl code.
DumpCode can also be set to a subroutine reference so that you can write your own serializing routine.
YAML.pm passes you the code ref. You pass back the serialization (as a string) and a format indicator. The format indicator is a simple string like: 'deparse' or 'bytecode'. - LoadCode
-
LoadCode is the opposite of DumpCode. It tells YAMLif and how to deserialize code references. When set to '1' or 'deparse' it will use "eval()". Since this is potentially risky, only use this option if you know where yourYAMLhas been.
LoadCode can also be set to a subroutine reference so that you can write your own deserializing routine.
YAML.pm passes the serialization (as a string) and a format indicator. You pass back the code reference. - UseBlock
-
YAML.pm uses heuristics to guess which scalar style is best for a given node. Sometimes you'll want all multiline scalars to use the 'block' style. If so, set this option to 1.NOTE: YAML's block style is akin to Perl's here-document.
- UseFold
-
If you want to force YAMLto use the 'folded' style for all multiline scalars, then set $UseFold to 1.NOTE: YAML's folded style is akin to the wayHTMLfolds text, except smarter.
- UseAliases
-
YAMLhas an alias mechanism such that any given structure in memory gets serialized once. Any other references to that structure are serialized only as alias markers. This is howYAMLcan serialize duplicate and recursive structures.
Sometimes, when you
KNOWthat your data is nonrecursive in nature, you may want to serialize such that every node is expressed in full. (ie as a copy of the original). Setting $YAML::UseAliases to 0 will allow you to do this. This also may result in faster processing because the lookup overhead is by bypassed.THIS OPTION CAN BE DANGEROUS.If your data is recursive, this option will cause Dump() to run in an endless loop, chewing up your computers memory. You have been warned. - CompressSeries
-
Default is 1.
Compresses the formatting of arrays of hashes:
- foo: bar - bar: foo
becomes:
- foo: bar - bar: foo
Since this output is usually more desirable, this option is turned on by default.
- QuoteNumericStrings
-
Default is 0. (false)
Adds detection mechanisms to encode strings that resemble numbers with mandatory quoting.
This ensures leading that things like leading/trailing zeros and other formatting are preserved.
YAML TERMINOLOGY
It is important to remember that although
- stream
-
A YAML stream is the full sequence of Unicode characters that a YAML parser would read or a YAML emitter would write. A stream may contain one or more YAML documents separated by YAML headers. --- a: mapping foo: bar --- - a - sequence
- document
-
A YAMLdocument is an independent data structure representation within a stream. It is a top level node. Each document in aYAMLstream must begin with aYAMLheader line. Actually the header is optional on the first document.
--- This: top level mapping is: - a - YAML - document
- header
-
A YAMLheader is a line that begins aYAMLdocument. It consists of three dashes, possibly followed by more info. Another purpose of the header line is that it serves as a place to put top level tag and anchor information.
--- !recursive-sequence &001 - * 001 - * 001
- node
-
A YAMLnode is the representation of a particular data structure. Nodes may contain other nodes. (In Perl terms, nodes are like scalars. Strings, arrayrefs and hashrefs. But this refers to the serialized format, not the in- memory structure.)
- tag
-
This is similar to a type. It indicates how a particular YAMLnode serialization should be transferred into or out of memory. For instance a Foo::Bar object would use the tag 'perl/Foo::Bar':
- !perl/Foo::Bar foo: 42 bar: stool
- collection
-
A collection is the generic term for a YAMLdata grouping.YAMLhas two types of collections: mappings and sequences. (Similar to hashes and arrays)
- mapping
-
A mapping is a YAMLcollection defined by unordered key/value pairs with unique keys. By defaultYAMLmappings are loaded into Perl hashes.
a mapping: foo: bar two: times two is 4
- sequence
-
A sequence is a YAMLcollection defined by an ordered list of elements. By defaultYAMLsequences are loaded into Perl arrays.
a sequence: - one bourbon - one scotch - one beer
- scalar
-
A scalar is a YAMLnode that is a single value. By defaultYAMLscalars are loaded into Perl scalars.
a scalar key: a scalar value
YAMLhas many styles for representing scalars. This is important because varying data will have varying formatting requirements to retain the optimum human readability. - plain scalar
-
A plain scalar is unquoted. All plain scalars are automatic candidates for
``implicit tagging''. This means that their tag may be determined automatically
by examination. The typical uses for this are plain alpha strings, integers,
real numbers, dates, times and currency.
- a plain string - -42 - 3.1415 - 12:34 - 123 this is an error
- single quoted scalar
-
This is similar to Perl's use of single quotes. It means no escaping except
for single quotes which are escaped by using two adjacent single quotes.
- 'When I say ''\n'' I mean "backslash en"'
- double quoted scalar
-
This is similar to Perl's use of double quotes. Character escaping can be
used.
- "This scalar\nhas two lines, and a bell -->\a"
- folded scalar
-
This is a multiline scalar which begins on the next line. It is indicated by a
single right angle bracket. It is unescaped like the single quoted scalar.
Line folding is also performed.
- > This is a multiline scalar which begins on the next line. It is indicated by a single carat. It is unescaped like the single quoted scalar. Line folding is also performed.
- block scalar
-
This final multiline form is akin to Perl's here-document except that (as in
all YAMLdata) scope is indicated by indentation. Therefore, no ending marker is required. The data is verbatim. No line folding.
- | QTY DESC PRICE TOTAL --- ---- ----- ----- 1 Foo Fighters $19.95 $19.95 2 Bar Belles $29.95 $59.90
- parser
-
A YAMLprocessor has four stages: parse, load, dump, emit.
A parser parses a
YAMLstream.YAML.pm's Load() function contains a parser. - loader
- The other half of the Load() function is a loader. This takes the information from the parser and loads it into a Perl data structure.
- dumper
- The Dump() function consists of a dumper and an emitter. The dumper walks through each Perl data structure and gives info to the emitter.
- emitter
-
The emitter takes info from the dumper and turns it into a YAMLstream.NOTE:InYAML.pm the parserloader and the dumperemitter code are currently
very closely tied together. In the future they may be broken into
separate stages.
For more information please refer to the immensely helpful
YSH - THE YAML SHELL
libyaml-shell-perl contains a script called 'ysh', theTo run ysh, (assuming you installed it along with
ysh [options]
Please read the "ysh" documentation for the full details. There are lots of options.
BUGS & DEFICIENCIES
If you find a bug in
RESOURCES
<lists.sourceforge.net/lists/listinfo/yaml-core> is the mailing list. This is where the language is discussed and designed.<www.yaml.org> is the official
<www.yaml.org/spec> is the
<yaml.kwiki.org> is the official
SEE ALSO
- *
-
YAML::XS
AUTHOR
Ingy do.t Net <ingy@cpan.org>COPYRIGHT AND LICENSE
Copyright 2001-2015. Ingy do.t Net.This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.