org.cyberneko.html

Class HTMLScanner.ContentScanner

Enclosing Class:
HTMLScanner
Implemented Interfaces:
HTMLScanner.Scanner

public class HTMLScanner.ContentScanner
extends java.lang.Object
implements HTMLScanner.Scanner

The primary HTML document scanner.
Author:
Andy Clark

Method Summary

protected void
addLocationItem(XMLAttributes attributes, int index)
Adds location augmentations to the specified attribute.
boolean
scan(boolean complete)
Scan.
protected boolean
scanAttribute(XMLAttributesImpl attributes, boolean[] empty)
Scans a real attribute.
protected boolean
scanAttribute(XMLAttributesImpl attributes, boolean[] empty, char endc)
Scans an attribute, pseudo or real.
protected void
scanCDATA()
Scans a CDATA section.
protected void
scanCharacters()
Scans characters.
protected void
scanComment()
Scans a comment.
protected void
scanEndElement()
Scans an end element.
protected boolean
scanMarkupContent(XMLStringBuffer buffer, char cend)
Scans markup content.
protected void
scanPI()
Scans a processing instruction.
protected boolean
scanPseudoAttribute(XMLAttributesImpl attributes)
Scans a pseudo attribute.
protected String
scanStartElement(boolean[] empty)
Scans a start element.

Method Details

addLocationItem

protected void addLocationItem(XMLAttributes attributes,
                               int index)
Adds location augmentations to the specified attribute.

scan

public boolean scan(boolean complete)
            throws IOException
Scan.
Specified by:
scan in interface HTMLScanner.Scanner

scanAttribute

protected boolean scanAttribute(XMLAttributesImpl attributes,
                                boolean[] empty)
            throws IOException
Scans a real attribute.
Parameters:
attributes - The list of attributes.
empty - Is used for a second return value to indicate whether the start element tag is empty (e.g. "/>").

scanAttribute

protected boolean scanAttribute(XMLAttributesImpl attributes,
                                boolean[] empty,
                                char endc)
            throws IOException
Scans an attribute, pseudo or real.
Parameters:
attributes - The list of attributes.
empty - Is used for a second return value to indicate whether the start element tag is empty (e.g. "/>").
endc - The end character that appears before the closing angle bracket ('>').

scanCDATA

protected void scanCDATA()
            throws IOException
Scans a CDATA section.

scanCharacters

protected void scanCharacters()
            throws IOException
Scans characters.

scanComment

protected void scanComment()
            throws IOException
Scans a comment.

scanEndElement

protected void scanEndElement()
            throws IOException
Scans an end element.

scanMarkupContent

protected boolean scanMarkupContent(XMLStringBuffer buffer,
                                    char cend)
            throws IOException
Scans markup content.

scanPI

protected void scanPI()
            throws IOException
Scans a processing instruction.

scanPseudoAttribute

protected boolean scanPseudoAttribute(XMLAttributesImpl attributes)
            throws IOException
Scans a pseudo attribute.
Parameters:
attributes - The list of attributes.

scanStartElement

protected String scanStartElement(boolean[] empty)
            throws IOException
Scans a start element.
Parameters:
empty - Is used for a second return value to indicate whether the start element tag is empty (e.g. "/>").

(C) Copyright 2002-2005, Andy Clark. All rights reserved.