IA32 programming for Linux presentation

About This Presentation

Transcript and Presenter's Notes

Title: IA32 programming for Linux

1
IA32 programming for Linux

Concepts and requirements for writing Linux
assembly language programs for Pentium CPUs

2
A source programs format

3
Statement Layout (1950s)

4
The as program

5
The label field

A label is a symbol followed by a colon ()
The programmer invents his own symbols
Symbols can use letters and digits, plus a very
small number of special characters ( ., _,
)
A symbol is allowed to be of arbitrarily length
The Linux assembler (as) was designed for
translating source-text produced by a high-level
language compiler (such as cc)
But humans can also write such files directly

6
The opcode field

Opcodes are predefined symbols that are
recognized by the GNU assembler
There are two categories of opcodes (called
instructions and directives)
Instructions represent operations that the CPU
is able to perform (e.g., add, inc)
Directives are commands that guide the work of
the assembler (e.g., .globl, .int)

7
Instructions vs Directives

Each instruction gets translated by as into a
machine-language statement that will be fetched
and executed by the CPU when the program runs
(i.e., at runtime)
Each directive modifies the behavior of the
assembler (i.e., at assembly time)
With GNU assembly language, they are easy to
distinguish directives begin with .

8
A list of the Pentium opcodes

An official list of the instruction codes can
be found in Intels programmer manuals
http//developer.intel.com
But its three volumes, nearly 1000 pages (it
describes everything about Pentiums)
An unofficial list of (most) Intel instruction
codes can fit on one sheet, front and back
http//www.jegerlehner/intel/

9
The ATT syntax

The GNU assembler uses ATT syntax (instead of
official Intel/Microsoft syntax) so the opcode
names differ slightly from names that you will
see on those lists
Intel-syntax ATT-syntax
--------------- ----------------------
ADD ? addb/addw/addl
INC ? incb/incw/incl
CMP ? cmpb/cmpw/cmpl

10
The UNIX culture

Linux is intended to be a version of UNIX (so
that UNIX-trained users already know Linux)
UNIX was developed at ATT (in early 1970s) and
ATTs computers were built by DEC, thus UNIX
users learned DECs assembley language
Intel was early ally of DECs competitor, IBM,
which deliberately used incompatible designs
Also an East Coast versus West Coast thing
(California, versus New York and New Jersey)

11
Bytes, Words, Longwords

CPU Instructions usually operate on data-items
Only certain sizes of data are supported
BYTE one byte consists of 8 bits
WORD consists of two bytes (16 bits)
LONGWORD uses four bytes (32 bits)
With ATTs syntax, an instructions name also
incorporates its effective data-size (as a
suffix)
With Intel syntax, data-size usually isnt
explicit, but is inferred by context (i.e., from
operands)

12
The operand field

Operands can be of several types
-- a CPU register may hold the datum
-- a memory location may hold the datum
-- an instruction can have built-in data
-- frequently there are multiple data-items
-- and sometimes there are no data-items
An instructions operands usually are explicit,
but in a few cases they also could be implicit

13
Examples of operands

14
The comment field

An assembly language program often can be hard
for a human being to understand
Even a programs author may not be able to recall
his programming idea after awhile
So programmer comments can be vital
A comments begin with the character
The assembler disregards all comments (but they
will appear in program listings)

15
Directives

Sometimes called pseudo-instructions
They tell the assembler what to do
The assembler will recognize them
Their names begin with a dot (.)
Examples .section, .global, .int,
The names of valid directives appears in the
table-of-contents of the GNU manual

16
New program example

Lets look at a demo program (squares.s)
It prints out a mathematical table showing some
numbers and their squares
But it doesnt use any multiplications!
It uses an algorithm based on algebra
(n1)2 - n2 n n 1
If you already know the square of a given
number n , you can get the square of the
next number n1 by just doing additions

17
A program with a loop

18
Some new directives

19
Some new instructions

inc adds one to the specified
operand incl arg
cmp compares two specified operands cmpl m
ax, arg
jle jump (to a specified instruction) if
condition less than or equal to is true
jle again

20
In-class Exercise

Can you write a program that prints out a table
showing powers of 2 (very useful for computer
science students to have handy)
Can you see how to do it without using any
multiply operations just additions?
Hint study the squares.s source-code
Then write your own powers.s solution
Turn in printouts (source and its output)

Write a Comment

User Comments (0)

About PowerShow.com

IA32 programming for Linux PowerPoint PPT Presentation