Course Overview - PowerPoint PPT Presentation

1 / 33
About This Presentation
Title:

Course Overview

Description:

... allocated in the 'method area' where also code for methods and representations ... method is declared in an interface (requires a different kind of method lookup) ... – PowerPoint PPT presentation

Number of Views:67
Avg rating:3.0/5.0
Slides: 34
Provided by: csUa
Category:
Tags: area | code | course | lookup | overview

less

Transcript and Presenter's Notes

Title: Course Overview


1
Course Overview
  • PART I overview material
  • 1 Introduction
  • 2 Language processors (tombstone diagrams,
    bootstrapping)
  • 3 Architecture of a compiler
  • PART II inside a compiler
  • 4 Syntax analysis
  • 5 Contextual analysis
  • 6 Runtime organization
  • 7 Code generation
  • PART III conclusion
  • Interpretation
  • 9 Review

Supplementary material Javas runtime
organization and the Java Virtual Machine
2
What This Topic is About
We look at the JVM as an example of a real-world
runtime system for a modern object-oriented
programming language.
JVM is probably the most common and widely used
VM in the world, so youll get a better idea what
a real VM looks like.
  • JVM is an abstract machine.
  • What is the JVM architecture?
  • What is the structure of .class files?
  • How are JVM instructions executed?
  • What is the role of the constant pool in
    dynamic linking?

Also visit this site for more complete
information about the JVM http//java.sun.com/doc
s/books/vmspec/2nd-edition/html/VMSpecTOC.doc.html
3
Recap Interpretive Compilers
  • Why?
  • A tradeoff between fast(er) compilation and a
    reasonable runtime performance.
  • How?
  • Use an intermediate language
  • more high-level than machine code easier to
    compile to
  • more low-level than source language easy to
    implement as an interpreter
  • Example A Java Development Kit for machine M

JavaJVM
JVM M
M
4
Abstract Machines
Abstract machine implements an intermediate
language in between the high-level language (e.g.
Java) and the low-level hardware (e.g. Pentium)
Implemented in Java Machine independent
Java
Java
High level
Java compiler
JVM (.class files)
Java JVM interpreteror JVM JIT compiler
Pentium
Pentium
Low level
5
Abstract Machines
  • An abstract machine is intended specifically as a
    runtime system for a particular (kind of)
    programming language.
  • JVM is a virtual machine for Java programs.
  • It directly supports object-oriented concepts
    such as classes, objects, methods, method
    invocation etc.
  • Easy to compile Java to JVM 1. easy to
    implement compiler 2. fast compilation
  • Another advantage portability

6
Class Files and Class File Format
External representation (platform independent)
JVM
Internal representation (implementation dependent)
.class files
load
classes
primitive types
arrays
objects
strings
methods
The JVM is an abstract machine in the truest
sense of the word. The JVM specification does not
give implementation details (can be dependent on
target OS/platform, performance requirements,
etc.) The JVM specification defines a machine
independent class file format that all JVM
implementations must support.
7
Data Types
  • JVM (and Java) distinguishes between two kinds of
    types
  • Primitive types
  • boolean boolean
  • numeric integral byte, short, int, long, char
  • numeric floating point float, double
  • internal, for exception handling returnAddress
  • Reference types
  • class types
  • array types
  • interface types

Note Primitive types are represented directly,
reference types are represented indirectly (as
pointers to array or class instances).
8
JVM Runtime Data Areas
Besides OO concepts, JVM also supports
multi-threading. Threads are directly supported
by the JVM.
Two kinds of runtime data areas 1. shared
between all threads 2. private to a single
thread
Shared
Thread 1
Thread 2
pc
pc
Garbage Collected Heap
Java Stack
Native Method Stack
Java Stack
Native Method Stack
Method area
9
Java Stacks
  • JVM is a stack based machine, much like TAM.
  • JVM instructions
  • implicitly take arguments from the stack top
  • put their result on the top of the stack
  • The stack is used to
  • pass arguments to methods
  • return a result from a method
  • store intermediate results while evaluating
    expressions
  • store local variables

This works similarly to (but not exactly the same
as) what we previously discussed about
stack-based storage allocation and routines.
10
Stack Frames
The Java stack consists of frames. The JVM
specification does not say exactly how the stack
and frames should be implemented. The JVM
specification specifies that a stack frame has
areas for
Pointer to runtime constant pool
args
A new call frame is created by executing some JVM
instruction for invoking a method (e.g.
invokevirtual, invokenonvirtual, ...)
local vars
operand stack
The operand stack is initially empty, but grows
and shrinks during execution.
11
Stack Frames
The role/purpose of each of the areas in a stack
frame
pointer to constant pool
Used implicitly when executing JVM instructions
that contain entries into the constant pool (more
about this later).
args
Space where the arguments and local variables of
a method are stored. This includes a space for
the receiver (this) at position/offset 0.
local vars
operand stack
Stack for storing intermediate results during the
execution of the method. Initially it is
empty. The maximum depth is known at compile
time.
12
Stack Frames
An implementation using registers such as SB, ST,
and LB and a dynamic link is one possible
implementation.
to previous frame on the stack
SB
LB
dynamic link
to runtime constant pool
JVM instructions store and load (for accessing
args and locals) use addresses which are
numbers from 0 to args locals - 1
args
local vars
operand stack
ST
13
JVM Interpreter
The core of a JVM interpreter is basically
this do byte opcode fetch an opcode
switch (opcode) case opCode1
fetch operands for opCode1 execute
action for opCode1 break case
opCode2 fetch operands for opCode2
execute action for opCode2
break case ... while (more to do)
14
Instruction-set typed instructions!
JVM instructions are explicitly typed different
opCodes for instructions for integers, floats,
arrays, reference types, etc. This is reflected
by a naming convention in the first letter of the
opCode mnemonics
Example different types of load instructions
integer load long load float load double
load reference-type load
iload lload fload dload aload
15
Instruction set kinds of operands
JVM instructions have three kinds of operands
- from the top of the operand stack - from the
bytes following the opCode - part of the opCode
itself Each instruction may have different
forms supporting different kinds of operands.
Example different forms of iload
Assembly code
Binary instruction code layout
iload_0 iload_1 iload_2 iload_3
26 27 28 29
iload n
21
n
wide iload n
196
n
21
16
Instruction-set accessing arguments and locals
arguments and locals area inside a stack frame
0
args indexes 0 .. args - 1
1
2
3
locals indexes args .. args locals - 1
  • A load instruction takes something from the
    args/locals area and pushes it onto the top of
    the operand stack.
  • A store instruction pops something from the top
    of the operand stack and places it in the
    args/locals area.

Instruction examples
iload_1 iload_3 aload 5 aload_0
istore_1 astore_1 fstore_3
17
Instruction-set non-local memory access
In the JVM, the contents of different kinds of
memory can be accessed by different kinds of
instructions.
accessing locals and arguments load and store
instructions
accessing fields in objects getfield, putfield
accessing static fields getstatic, putstatic
Note Static fields are a lot like global
variables. They are allocated in the method
area where also code for methods and
representations for classes (including method
tables) are stored.
Note getfield and putfield access memory in the
heap.
Note JVM doesnt have anything similar to
registers L1, L2, etc.
18
Instruction-set operations on numbers
Arithmetic
add iadd, ladd, fadd, dadd subtract isub,
lsub, fsub, dsub multiply imul, lmul, fmul, dmul
etc.
Conversion
i2l, i2f, i2d, l2f, l2d, f2d, f2i, d2i,
19
Instruction-set
Operand stack manipulation pop, pop2, dup, dup2,
swap, Control transfer Unconditional goto,
jsr, ret, Conditional ifeq, iflt, ifgt,
if_icmpeq,
20
Instruction-set
Method invocation invokevirtual usual
instruction for calling a method on an object.
invokeinterface same as invokevirtual, but used
when the called method is declared in an
interface (requires a different kind of method
lookup) invokespecial for calling things such as
constructors, which are not dynamically
dispatched (this instruction is also known as
invokenonvirtual). invokestatic for calling
methods that have the static modifier (these
methods are sent to a class, not to an
object). Returning from methods return,
ireturn, lreturn, areturn, freturn,
21
Instruction-set Heap Memory Allocation
Create new class instance (object) new Create
new array newarray for creating arrays of
primitive types. anewarray, multianewarray for
arrays of reference types.
22
Instructions and the Constant Pool
Many JVM instructions have operands which are
indexes pointing to an entry in the so-called
constant pool. The constant pool contains all
kinds of entries that represent symbolic
references for linking. This is the way that
instructions refer to things such as classes,
interfaces, fields, methods, and constants such
as string literals and numbers.
These are the kinds of constant pool entries that
exist
  • Integer
  • Float
  • Long
  • Double
  • Name_and_Type_info
  • Utf8_info (Unicode characters)
  • Class_info
  • Fieldref_info
  • Methodref_info
  • InterfaceMethodref_info
  • String

23
Instructions and the Constant Pool
Example We examine the getfield instruction in
detail.
180
indexbyte1
indexbyte2
Format
Utf8Info fully qualified class name
Class_info u1 tag u2 name_index
CONSTANT_Fieldref_info u1 tag u2
class_index u2 name_and_type_index
Utf8Info name of field
CONSTANT_Name_and_Type_info u1 tag u2
name_index u2 descriptor_index
Utf8Info field descriptor
24
Instructions and the Constant Pool
That previous picture is rather complicated,
lets simplify it a little
180
indexbyte1
indexbyte2
Format
Fieldref
Class
Name_and_Type
Utf8Info fully qualified class name
Utf8Info name of field
Utf8Info field descriptor
25
Instructions and the Constant Pool
The constant entries format is part of the Java
class file format.
Luckily, we have a Java assembler that allows us
to write a kind of textual assembly code and that
is then transformed into a binary .class file.
This assembler takes care of creating the
constant pool entries for us. When an instruction
operand expects a constant pool entry the
assembler allows you to enter the entry in
place in an easy syntax. Example getfield
mypackage/Queue i I
26
Instructions and the Constant Pool
Fully qualified class names and descriptors in
constant pool UTF8 entries.
1. Fully qualified class name a package class
name string. Note this uses / instead of . to
separate each level along the path. 2.
Descriptor a string that defines a type for a
method or field.
Java descriptor boolean Z integer I Object Ljava/
lang/Object String Ljava/lang/String int
foo(int,Object) (ILjava/lang/Object)I
27
Linking
In general, linking is the process of resolving
symbolic references in binary files. Most
programming language implementations have what we
call separate compilation. Modules or files
can be compiled separately and transformed into
some binary format. But since these separately
compiled files may have connections to other
files, they have to be linked. The binary
file is not yet executable, because it has some
kind of symbolic links in it that point to
things (classes, methods, functions, variables,
etc.) in other files/modules. Linking is the
process of resolving these symbolic links and
replacing them by real addresses so that the code
can be executed.
28
Loading and Linking in JVM
In JVM, loading and linking of class files
happens at runtime, while the program is
running! Classes are loaded as needed. The
constant pool contains symbolic references that
need to be resolved before a JVM instruction that
uses them can be executed (this is the equivalent
of linking). In JVM a constant pool entry is
resolved the first time it is used by a JVM
instruction.
Example When a getfield is executed for the
first time, the constant pool entry index in the
instruction can be replaced by the offset of the
field.
29
Closing Example
As a closing example on the JVM, we will take a
look at the compiled code of the following simple
Java class declaration.
class Factorial int fac(int n)
int result 1 for (int i2 i result result i
return result
30
Compiling and Disassembling
javac Factorial.java javap -c -verbose
Factorial Compiled from Factorial.java class
Factorial extends java.lang.Object
Factorial() / Stack1, Locals1,
Args_size1 / int fac(int) /
Stack2, Locals4, Args_size2 / Method
Factorial() 0 aload_0 1 invokespecial 1
4 return
31
Compiling and Disassembling ...
// address 0 1 2
3 Method int fac(int) // stack this n result i
0 iconst_1 // stack this n result i 1
1 istore_2 // stack this n result i
2 iconst_2 // stack this n result
i 2 3 istore_3 // stack this n
result i 4 goto 14 7 iload_2 //
stack this n result i result 8 iload_3
// stack this n result i result i 9 imul
// stack this n result i resulti 10
istore_2 // stack this n result i 11
iinc 3 1 // stack this n result i 14
iload_3 // stack this n result i i 15
iload_1 // stack this n result i i n 16
if_icmplt 7 // stack this n result i 19
iload_2 // stack this n result i result
20 ireturn
32
Writing Factorial in jasmin
Jasmin is a Java Assembler Interface. It takes
ASCII descriptions for Java classes, written in a
simple assembler-like syntax and using the Java
Virtual Machine instruction set. It converts
them into binary Java class files suitable for
loading into a JVM implementation.
.class package Factorial .super
java/lang/Object .method package (
)V .limit stack 50 .limit locals
1 aload_0 invokenonvirtual java/lang/Object/
( )V return .end method
33
Writing Factorial in jasmin (continued)
  • .method package fac(I)I
  • .limit stack 50
  • .limit locals 4
  • iconst_1
  • istore 2
  • iconst_2
  • istore 3
  • Label_1
  • iload 3
  • iload 1
  • if_icmplt Label_4
  • iconst_0
  • goto Label_5
  • Label_4
  • iconst_1
  • Label_5
  • ifeq Label_2
  • iload 2
  • iload 3
  • imul
  • dup
  • istore 2
  • pop
  • Label_3
  • iload 3
  • dup
  • iconst_1
  • iadd
  • istore 3
  • pop
  • goto Label_1
  • Label_2
  • iload 2
  • ireturn
  • iconst_0
  • ireturn
Write a Comment
User Comments (0)
About PowerShow.com