LCC 6310 Computation as an Expressive Medium - PowerPoint PPT Presentation

1 / 17

About This Presentation

Title:

LCC 6310 Computation as an Expressive Medium

Description:

Number of Views:32

Avg rating:3.0/5.0

Slides: 18

Provided by: michael74

Category:

Tags: lcc | computation | disturbing | expressive | medium

Transcript and Presenter's Notes

Title: LCC 6310 Computation as an Expressive Medium

1
LCC 6310Computation as an Expressive Medium

2
Parsing HTML

3
Accessing an external class library

Place jars (or class files) within the libraries
folder in processing
For example, if the class library is called
htmlparser, create an htmlparser/library folder
within the libraries folder and add the jar files
Jason will go over how to use external class
libraries in your exported applets
Use the import statement to bring the external
classes into your program
Class libraries live in packages
import ltpackagenamegt. means make all the
classes in ltpackagenamegt available for use in my
program
Package names can be hierarchical (e.g.
org.htmlparser.util).
If you get an error that an htmlparser class can
not be found, look in the documentation to see
what package you need to import into your program
The imported packages in the example code should
get your pretty far

4
The Parser class

Parser is the main class for parsing the html
file pointed to by a URL
What is parsing? To parse an html file means to
turn the raw text of the page into structured
tags that you can process
Look for specific tags
Get attributes from tags
One way to parse an HTML file
Parser parser new Parser(ltURLgt)
NodeList nodeList parser.parse(null)
Parser.parse(NodeFilter filter) returns a
NodeList of all html nodes (tags) that satisfy
the filter
The null filter returns a list containing all the
tags

5
The try/catch block

Exceptions are thrown whenever java encounters an
error situation
Youve probably all run into exceptions, like the
NullPointerException
A bunch of different exceptions are defined by
Java, but programmers can define their own
When an exception is thrown, it travels up the
call stack (the stack of method calls)
The default behavior for exceptions is for them
to travel all the way to the top, where they
terminate your program (and print out the
exception)
Sometimes, however, you want to handle an
exception yourself and keep on going. You do this
with the try ltstatementsgt catch
(ltExceptionClassgt e) ltexception codegt
The idea is that you want your program to keep
goinig, so you write special code to clean up
after the error
You can declare that a method can throw specific
exceptions if you do, any caller of the method
must handle the exception with a try/catch
Parser.parse() does this, so we have to handle
the exception

6
NodeLists

7
Traversing the hierarchical structure

NodeList Node.getChildren()
Get a list of the children nodes of a node
So, to search the nodeList for specific nodes,
you need to search the top level, then search
within the children of the top level, and so
forth
NodeList NodeList.searchFor(Class classType,
boolean recursive)
If the second parameter is true, it will look in
the children lists for you
Need to tell it what class of node youre looking
for
In Java, classes are themselves objects
To get the Class object corresponding to the
class for an object, call getClass() on any
object

8
Example find all the images on a page

First create a throw away instance of the tag
youre looking for just need it to get your
hands on the class
ImageTag tempImageTag new ImageTag()
Use NodeList.searchFor() to create a list of just
the image tags (searching recursively)
NodeList imageList nodeList.searchFor(tempImageT
ag.getClass(), true)

9
Getting tag attributes

10
Example Drawing images you find on a web site

11
Example Only drawing images with the given alt
text

Just like we used getAttribute() to get the image
source, use getAttribute() to get the image text
(the ALT attribute)
Use string methods to see if the alt text
contains the text you care about
String.indexOf(lttestStringgt) if lttestStringgt
appears anywhere within the String, returns the
location within the String, otherwise -1
Lets use this to get only the images we care
about from cnn.com.

12
Following links

13
Non-disturbing code

14
Disturbing code

15
Making our disturbing code work

void method1(int depth)
if (depth lt 0) return
else
println(beginning of method 1)
method1(depth 1)
println(End of method 1)
Good recursive functions have a base case (where
they stop) and a recursive case (where they call
themselves)

16
Example Following links (recursively)