Gumbo
1.0
A C library for parsing HTML.
|
#include <gumbo.h>
A struct representing a single attribute on an HTML tag. This is a name-value pair, but also includes information about source locations and original source text.
GumboAttributeNamespaceEnum GumboAttribute::attr_namespace |
The namespace for the attribute. This will usually be GUMBO_ATTR_NAMESPACE_NONE, but some XLink/XMLNS/XML attributes take special values, per: http://www.whatwg.org/specs/web-apps/current-work/multipage/tree-construction.html#adjust-foreign-attributes
const char* GumboAttribute::name |
The name of the attribute. This is in a freshly-allocated buffer to deal with case-normalization, and is null-terminated.
GumboStringPiece GumboAttribute::original_name |
The original text of the attribute name, as a pointer into the original source buffer.
const char* GumboAttribute::value |
The value of the attribute. This is in a freshly-allocated buffer to deal with unescaping, and is null-terminated. It does not include any quotes that surround the attribute. If the attribute has no value (for example, 'selected' on a checkbox), this will be an empty string.
GumboStringPiece GumboAttribute::original_value |
The original text of the value of the attribute. This points into the original source buffer. It includes any quotes that surround the attribute, and you can look at original_value.data[0] and original_value.data[original_value.length - 1] to determine what the quote characters were. If the attribute has no value, this will be a 0-length string.
GumboSourcePosition GumboAttribute::name_start |
The starting position of the attribute name.
GumboSourcePosition GumboAttribute::name_end |
The ending position of the attribute name. This is not always derivable from the starting position of the value because of the possibility of whitespace around the = sign.
GumboSourcePosition GumboAttribute::value_start |
The starting position of the attribute value.
GumboSourcePosition GumboAttribute::value_end |
The ending position of the attribute value.
Gumbo
1.0
A C library for parsing HTML.
|