Android Dev & Splinters: XPath overview

XPath is a language for finding information in an XML document.

XPath Terminology

Nodes

In XPath, there are 7 kinds of nodes:  

element,
attribute,
text,
namespace,
processing-instruction,
comment,
document nodes.

XML documents are treated as trees of nodes. 

The topmost element of the tree is called the 
root element.

Look at the following XML document:

<?xml version="1.0" encoding="ISO-8859-1"?>

<bookstore>
 
<book>

    <title lang="en">Harry Potter</title>

    <author>J K. Rowling</author>

<year>2005</year>

    <price>29.99</price>

</book>
</bookstore>

Example of nodes in the XML document above:

<bookstore>  (root element node)

<author>J K. Rowling</author>  (element node)

lang="en" (attribute node)

Atomic values

Atomic values are nodes with no children or parent.

Example of atomic values:

J K. Rowling

"en"

Items

Items are atomic values or nodes.

Relationship of Nodes

Parent

Each element and attribute has one parent.

In the following example; the book element is the parent of the title, author, year, and price:

<book>

  <title>Harry Potter</title>

  <author>J K. Rowling</author>

  <year>2005</year>

  <price>29.99</price>

</book>

Children

Element nodes may have zero, one or more children.

In the following example; the title, author, year, and price elements are all children of the book element:

<book>

  <title>Harry Potter</title>

  <author>J K. Rowling</author>

  <year>2005</year>

  <price>29.99</price>

</book>

Siblings

Nodes that have the same parent.

In the following example; the title, author, year, and price elements are all siblings:

<book>

  <title>Harry Potter</title>

  <author>J K. Rowling</author>

  <year>2005</year>

  <price>29.99</price>

</book>

Ancestors

A node's parent, parent's parent, etc.

In the following example; the ancestors of the title element are the book element and the bookstore element:

<bookstore>

<book>

  <title>Harry Potter</title>

  <author>J K. Rowling</author>

  <year>2005</year>

  <price>29.99</price>

</book>

</bookstore>

Descendants

A node's children, children's children, etc.

In the following example; descendants of the bookstore element are the book, title, author, year, and price elements:

<bookstore>

<book>

  <title>Harry Potter</title>

  <author>J K. Rowling</author>

  <year>2005</year>

  <price>29.99</price>

</book>

</bookstore> 

What is XPath?

XPath is a syntax for defining parts of an XML document
XPath uses path expressions to navigate in XML documents
XPath contains a library of standard functions
XPath is a major element in XSLT
XPath is a W3C recommendation

XPath Path Expressions
XPath uses path expressionsto select nodes or node-sets in an XML document. These path 
expressions look very much like the expressions you see when you work with a traditional computer file system.

XPath Standard Functions
XPath includes over 100 built-in functions. There are functions for 
string values, numeric values, date and time comparison, node and QName 
manipulation, sequence manipulation, Boolean values, and more.

XPath is Used in XSLT
XPath is a major element in the XSLT standard. Without XPath knowledge you will not be able to create XSLT documents.

You can read more about XSLT in our XSLT tutorial.

XQuery and XPointer are both built on XPath expressions. XQuery 1.0 
and XPath 2.0 share the same data model and support the same functions 
and operators.

You can read more about XQuery in our XQuery tutorial.

XPATH is a W3C Recommendation
XPath became a W3C Recommendation 16. November 1999.

XPath was designed to be used by XSLT, XPointer and other XML parsing software.

XPath uses path expressions to select nodes or node-sets in an XML document. 
The node is selected by following a path or steps.

The XML Example Document
We will use the following XML document in the examples below.

<?xml version="1.0" encoding="ISO-8859-1"?>
 
  <bookstore>

<book>

<title lang="eng">Harry Potter</title>

<price>29.99</price>
    
</book>

<book>

<title lang="eng">Learning XML</title>

<price>39.95</price>
    
</book>

   </bookstore>

Selecting Nodes
XPath uses path expressions to select nodes in an XML document.

The node is selected by following a path or steps.

The most useful 
path expressions are listed below:

Expression
    Description
  
    nodename
    Selects all child nodes of the named node
    
    /
    Selects from the root node
    
    //
    Selects nodes in the document from the current node that 
 match the selection no matter where they are 
  
    .
    Selects the current node
  
    ..
    Selects the parent of the current node
  
    @
    Selects attributes   
  
In the table below we have listed some path expressions and the result of the 
expressions:

Path Expression
    Result
  
    bookstore
    Selects all the child nodes of the bookstore element
    
    /bookstore
    Selects the root element bookstoreNote: If the path starts with a slash ( / ) it 
 always represents an absolute 
path to an element!
    
    bookstore/book
    Selects all book elements that are children of bookstore
  
    //book
    Selects all book elements no matter where they are in the 
 document
  
    bookstore//book
    Selects all book elements that are descendant of the 
 bookstore element, no matter where they are under the bookstore element
  
    //@lang
    Selects all attributes that are named lang
  
Predicates
Predicates are used to find a specific node or a node that contains a specific value.

Predicates are always embedded in square brackets.

In the table below we have listed some path expressions with predicates and the result of the 
expressions:

Path Expression
    Result
  
    /bookstore/book[1] 
    Selects the first book element that is the child of the 
 bookstore element.Note: IE5 and later has implemented that [0] should be the first node, 
but according to the W3C standard it should have been [1]!!
    
    /bookstore/book[last()]
    Selects the last book element that is the child of the 
 bookstore element
    
    /bookstore/book[last()-1]
    Selects the last but one book element that is the child of the 
 bookstore element
  
    /bookstore/book[position()<3]
    Selects the first two book elements that are children of the 
 bookstore element
  
    //title[@lang]
    Selects all the title elements that have an attribute named 
 lang
  
    //title[@lang='eng']
    Selects all the title elements that have an attribute named lang 
 with a value of 'eng'
  
    /bookstore/book[price>35.00]
    Selects all the book elements of the bookstore element that 
 have a price element with a value greater than 35.00
  
    /bookstore/book[price>35.00]/title
    Selects all the title elements of the book elements of the 
 bookstore element that have a price element with a value greater than 35.00
  
Selecting Unknown Nodes
XPath wildcards can be used to select unknown XML elements.

 Wildcard
    Description
  
    *
    Matches any element node
    
    @*
    Matches any attribute node
  
    node()
    Matches any node of any kind
    
In the table below we have listed some path expressions and the result of the 
expressions:

 Path Expression
    Result
  
    /bookstore/*
    Selects all the child nodes of the bookstore element
    
    //*
    Selects all elements in the document
    
    //title[@*]
    Selects all title elements which have any attribute
  
Selecting Several Paths
By using the | operator in an XPath expression you can select several paths.

In the table below we have listed some path expressions and the result of the 
expressions:

 Path Expression
    Result
  
    //book/title | //book/price
    Selects all the title AND price elements of all book 
 elements
    
    //title | //price
    Selects all the title AND price elements in the document
    
    /bookstore/book/title | //price
    Selects all the title elements of the book element of the 
 bookstore element AND all the price elements in the document

Expression	Description
nodename	Selects all child nodes of the named node
/	Selects from the root node
//	Selects nodes in the document from the current node that match the selection no matter where they are
.	Selects the current node
..	Selects the parent of the current node
@	Selects attributes

Path Expression	Result
/bookstore/book[1]	Selects the first book element* that is the child of the bookstore element.*Note: IE5 and later has implemented that [0] should be the first node, but according to the W3C standard it should have been [1]!!
/bookstore/book[last()]	Selects the last book element that is the child of the bookstore element
/bookstore/book[last()-1]	Selects the last but one book element that is the child of the bookstore element
/bookstore/book[position()<3]	Selects the first two book elements that are children of the bookstore element
//title[@lang]	Selects all the title elements that have an attribute named lang
//title[@lang='eng']	Selects all the title elements that have an attribute named lang with a value of 'eng'
/bookstore/book[price>35.00]	Selects all the book elements of the bookstore element that have a price element with a value greater than 35.00
/bookstore/book[price>35.00]/title	Selects all the title elements of the book elements of the bookstore element that have a price element with a value greater than 35.00

Wildcard	Description
*	Matches any element node
@*	Matches any attribute node
node()	Matches any node of any kind

Android Dev & Splinters

mercoledì 25 aprile 2012

XPath overview

XPath is a language for finding information in an XML document.

XPath Terminology

Nodes

Atomic values

Items

Relationship of Nodes

Parent

Children

Siblings

Ancestors

Descendants

What is XPath?

XPath Path Expressions

XPath Standard Functions

XPath is Used in XSLT

XPATH is a W3C Recommendation

The XML Example Document

Selecting Nodes

Predicates

Selecting Unknown Nodes

Selecting Several Paths

Nessun commento:

Posta un commento

Path Expression	Result
bookstore	Selects all the child nodes of the bookstore element
/bookstore	Selects the root element bookstoreNote: If the path starts with a slash ( / ) it always represents an absolute path to an element!
bookstore/book	Selects all book elements that are children of bookstore
//book	Selects all book elements no matter where they are in the document
bookstore//book	Selects all book elements that are descendant of the bookstore element, no matter where they are under the bookstore element
//@lang	Selects all attributes that are named lang

Path Expression	Result
/bookstore/*	Selects all the child nodes of the bookstore element
//*	Selects all elements in the document
//title[@*]	Selects all title elements which have any attribute

Path Expression	Result
//book/title \| //book/price	Selects all the title AND price elements of all book elements
//title \| //price	Selects all the title AND price elements in the document
/bookstore/book/title \| //price	Selects all the title elements of the book element of the bookstore element AND all the price elements in the document