0% found this document useful (0 votes)

94 views20 pages

XML Parsers: SAX vs DOM Guide

This document discusses XML parsers and provides an overview of different types of parsers including validating versus non-validating, DOM, and SAX parsers. It explains that DOM parsers build a tree of the entire XML document in memory, while SAX parsers are event-based and read-only. SAX parsers are generally better for large documents or data streams while DOM is better if random access or modifications are needed. Popular parser products and the advantages and disadvantages of DOM and SAX are also summarized.

Uploaded by

Muthumanikandan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views20 pages

XML Parsers: SAX vs DOM Guide

Uploaded by

Muthumanikandan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

XML Parsers

Overview
Types of parsers

Using XML parsers

SAX

DOM

DOM versus SAX

Products

Conclusion
Types of Parsers
There are several different ways to categorise
parsers:
Validating versus non-validating parsers
Parsers that support the Document Object Model
(DOM)
Parsers that support the Simple API for XML (SAX)
Parsers written in a particular language (Java, C++,
Perl, etc.)
Non-validating Parsers
Speed and efficiency
- It takes a significant amount of effort for an XML
parser to process a DTD and make sure that
every element in an XML document follows the
rules of the DTD.

If only want to find tags and extract

information - use non-validating
Using XML Parsers
Three basic steps to use an XML parser
Create a parser object
Pass your XML document to the parser
Process the results

Generally, writing out XML is outside scope of

parsers (though some may implement proprietary
mechanisms)
Parsing XML
Two established API's:

SAX (Simple API for XML)

Define handlers containing methods as XML
parsed

DOM (Document Object Model)

Defines a logical tree representing the parsed
XML
Parsing XML: DOM
Document Object Model
standard API for accessing and creating XML data
tree-based
programming language indepedent
developed by W3C
whole document is read into memory
read and write
Creating a DOM Tree
A DOM implementation will have a method to pass a
XML file to a factory object that will return a
Document object that represents root element of
whole document

After this, may use DOM standard interface to

interact with XML structure

A
P Application
I
Parsing XML: DOM

XML File DOM Tree

DOM Interfaces
The DOM defines several interfaces

Node The base data type of the DOM

Element Represents element
Attr Represents an attribute of an element
Text The content of an element or attribute
Document Represents the entire XML document.
A Document object is often referred to
as a DOM tree
DOM Level
DOM Level 1
- basic functionality for document navigation and
manipulation.

DOM Level 2
- includes a style sheet object model
- defines an event model and provides support for
XML namespaces.

DOM Level 3
- still under development
- addresses document loading and saving
- content model (DTDs and schemas) with document validation
support.
Parsing XML: SAX
Simple API for XML
API for accessing xml data
event based
programming language indepedent
application has to store fragments into memory
read only
Parsing XML: SAX
SAX is an interface to the XML parser based on
streaming and call-backs
You need to implement the HandlerBase interface :
startDocument, endDocument
startElement, endElement
characters
warning, error, fatalError
Parsing XML: SAX

XML File SAX calls

SAX versus DOM
DOM:
read and write
need to move back and forth in data
document is human created

SAX:
read only
huge data or streams
data is machine generated
DOM pro and contra
PRO
The file is parsed only once.
High navigation abilities : this is the aim of the DOM design.

CONTRA
More memory needed since the XML tree is in memory.
SAX pro and contra
PRO
Low memory needs since the XML file is never entirely in
memory
Can deal with XML streams

CONTRA
The file has to be parsed entirely to access any node. Thus,
getting the 10 nodes included in a catalog ended up in parsing
10 times the same file.
Poor navigation abilities : no way to get easily the children of a
given node or the list of "B" nodes
SAX versus DOM
If your document is very large and you only need a
few elements - use SAX

If you need to process many elements and perform

operations on XML - use DOM

If you need to access the XML many times

- use DOM
Parser Products
Xerces4J / Xerces4C++ (Apache)
James Clarks XP (Java)
IBM XML4J / XML4C++
Java Project X (Sun)
Oracles XML Parser for Java
MSXML (Microsoft)
Dan Connollys XML Parser (Phyton)

Conclusion
The parser is key building block for every XML
application.

When building XML applications, you have to think

how will you handle large chunks of data

Choosing between SAX and DOM is not always trivial

The End

Questions?

Thank you!

XML Parsers: When A Software Program Reads An XML Document and Takes Actions
No ratings yet
XML Parsers: When A Software Program Reads An XML Document and Takes Actions
7 pages
XML Processors
No ratings yet
XML Processors
4 pages
XML Parsers (Dom Sax)
No ratings yet
XML Parsers (Dom Sax)
20 pages
4th Question
No ratings yet
4th Question
1 page
XML Parsing Techniques in Java
No ratings yet
XML Parsing Techniques in Java
44 pages
SAX (Simple API For XML)
No ratings yet
SAX (Simple API For XML)
16 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
Untitled Document
No ratings yet
Untitled Document
19 pages
Chapter 5 XML With Java - Tan
No ratings yet
Chapter 5 XML With Java - Tan
45 pages
Mern Previous Papers
No ratings yet
Mern Previous Papers
59 pages
Java XML Parsers for Developers
No ratings yet
Java XML Parsers for Developers
23 pages
SAX DOM: 1. Which Parser Can Get Better Speed, DOM or SAX Parsers?
No ratings yet
SAX DOM: 1. Which Parser Can Get Better Speed, DOM or SAX Parsers?
51 pages
Two Types of XML Parsers
No ratings yet
Two Types of XML Parsers
6 pages
Understanding XML: Basics & Parsing
No ratings yet
Understanding XML: Basics & Parsing
5 pages
SAXvs DOM
No ratings yet
SAXvs DOM
3 pages
SAX DOMpresentation
No ratings yet
SAX DOMpresentation
19 pages
TCP Lec06
No ratings yet
TCP Lec06
39 pages
Lecture 6
No ratings yet
Lecture 6
39 pages
Unit4 - Ccs375-Webtechnologies
No ratings yet
Unit4 - Ccs375-Webtechnologies
48 pages
Java XML
No ratings yet
Java XML
59 pages
XML Document Object Model
No ratings yet
XML Document Object Model
33 pages
XML Parsing for Python Developers
No ratings yet
XML Parsing for Python Developers
42 pages
JAXP for Java XML Developers
No ratings yet
JAXP for Java XML Developers
8 pages
XML Parser
No ratings yet
XML Parser
66 pages
Parsing XML With SAX, DOM & JDOM: Hicham Qaissi
No ratings yet
Parsing XML With SAX, DOM & JDOM: Hicham Qaissi
16 pages
XML Parsers: SAX vs DOM Explained
No ratings yet
XML Parsers: SAX vs DOM Explained
4 pages
XML Parsing
0% (1)
XML Parsing
31 pages
SAP PI 7.3 XML Parsing Guide
50% (4)
SAP PI 7.3 XML Parsing Guide
153 pages
5.XML Processing
No ratings yet
5.XML Processing
30 pages
SAP PI 7.3 XML Parsing Guide
No ratings yet
SAP PI 7.3 XML Parsing Guide
153 pages
SAX Vs DOM Parsers
No ratings yet
SAX Vs DOM Parsers
2 pages
XML Dom
No ratings yet
XML Dom
12 pages
DOM vs SAX: XML Parsing Comparison
No ratings yet
DOM vs SAX: XML Parsing Comparison
6 pages
XML (Extensible Markup Language) UNIT-4: DR Anupama Jha
No ratings yet
XML (Extensible Markup Language) UNIT-4: DR Anupama Jha
43 pages
J2EE Guide for Developers
100% (1)
J2EE Guide for Developers
118 pages
X Cert1423 A4
No ratings yet
X Cert1423 A4
38 pages
Lab 7
No ratings yet
Lab 7
2 pages
07 Java API For XML Processing Jaxp
No ratings yet
07 Java API For XML Processing Jaxp
140 pages
XML Question
No ratings yet
XML Question
5 pages
XML Technologies and Applications: Rajshekhar Sunderraman
No ratings yet
XML Technologies and Applications: Rajshekhar Sunderraman
36 pages
We Can Insert or Delete Nodes We Can't Insert or Delete A Node
No ratings yet
We Can Insert or Delete Nodes We Can't Insert or Delete A Node
5 pages
XML Dom
No ratings yet
XML Dom
2 pages
XML Basics for Developers
No ratings yet
XML Basics for Developers
6 pages
Lec12 XMLCS
No ratings yet
Lec12 XMLCS
60 pages
Systems Analysis and Design 9th Edition Kendall Test Bank Instant Download
100% (16)
Systems Analysis and Design 9th Edition Kendall Test Bank Instant Download
43 pages
4130-Rc032-010d-Hibernate Search 0 1
100% (1)
4130-Rc032-010d-Hibernate Search 0 1
6 pages
Answer:: Table Using JDBC
No ratings yet
Answer:: Table Using JDBC
0 pages
EV-TLab8 Annex
No ratings yet
EV-TLab8 Annex
11 pages
Urdu and Sindhi Grammar XML Validation
No ratings yet
Urdu and Sindhi Grammar XML Validation
6 pages
Fix
No ratings yet
Fix
18 pages
XML & Web Services Guide
No ratings yet
XML & Web Services Guide
14 pages
HTML Tutorial
No ratings yet
HTML Tutorial
280 pages
XML DTD
No ratings yet
XML DTD
12 pages
Chapte 2 HTML 2021 V-F
No ratings yet
Chapte 2 HTML 2021 V-F
179 pages
SuccessFactors Training Hub
100% (1)
SuccessFactors Training Hub
7 pages
Web Data Exchange for Developers
No ratings yet
Web Data Exchange for Developers
29 pages
Chapter 1
No ratings yet
Chapter 1
33 pages
Controlled Language For Multilingual Document Production: Experience With Caterpillar Technical English
No ratings yet
Controlled Language For Multilingual Document Production: Experience With Caterpillar Technical English
12 pages
XML Bible Gold Edition Elliotte Rusty Harold Download
No ratings yet
XML Bible Gold Edition Elliotte Rusty Harold Download
124 pages
Web Services II NOTES For ASP Beginners
No ratings yet
Web Services II NOTES For ASP Beginners
18 pages
Web Tech & Database Basics
No ratings yet
Web Tech & Database Basics
26 pages
BM-800 Communication Protocol Specification
No ratings yet
BM-800 Communication Protocol Specification
35 pages
External Interface (S) Application Guide: Energyplus™ Version 9.3.0 Documentation
No ratings yet
External Interface (S) Application Guide: Energyplus™ Version 9.3.0 Documentation
31 pages
SOA Syllabus
No ratings yet
SOA Syllabus
3 pages
SMIL (Web) Presentation (Old)
No ratings yet
SMIL (Web) Presentation (Old)
63 pages
Module 5
No ratings yet
Module 5
29 pages
XML-Based Web Applications
No ratings yet
XML-Based Web Applications
114 pages
Ex-4 XML For Creation of DTD, Which Specifies Set of Rules. Create A Style Sheet in CSS/ XSL & Display The Document
No ratings yet
Ex-4 XML For Creation of DTD, Which Specifies Set of Rules. Create A Style Sheet in CSS/ XSL & Display The Document
3 pages
Ansiniso Z3918-2005 (R2010)
No ratings yet
Ansiniso Z3918-2005 (R2010)
96 pages
Skill Developmen LAB Manual
No ratings yet
Skill Developmen LAB Manual
32 pages
MusicXML: Future of Music Notation
No ratings yet
MusicXML: Future of Music Notation
8 pages
XML-Based Servers - Communicating Meaningful Information Over The Web Using XML
No ratings yet
XML-Based Servers - Communicating Meaningful Information Over The Web Using XML
42 pages
Cdata: CDATA Sections in XML
No ratings yet
Cdata: CDATA Sections in XML
5 pages
A Quick Introduction To XML
No ratings yet
A Quick Introduction To XML
3 pages

XML Parsers: SAX vs DOM Guide

Uploaded by

XML Parsers: SAX vs DOM Guide

Uploaded by

XML Parsers

Using XML parsers

DOM versus SAX

If only want to find tags and extract

Generally, writing out XML is outside scope of

SAX (Simple API for XML)

DOM (Document Object Model)

After this, may use DOM standard interface to

XML File DOM Tree

Node The base data type of the DOM

XML File SAX calls

If you need to process many elements and perform

If you need to access the XML many times

When building XML applications, you have to think

Choosing between SAX and DOM is not always trivial

You might also like