Title: CS 2104 Type checking / Polymorphism
1CS 2104 Type checking / Polymorphism
- Reading Chapters 3.1, 4.3, 4.4.1
- Dr. Abhik Roychoudhury
- Adapted from Goh Aik Huis lecture
notes.
2Overview
- Types
- Motivation for typed languages
- Issues in Type Checking
- How to type check?
- How to cater for Polymorphism
- Type Equivalence
- When to type check?
- Strong and Weak typed languages
31. Motivation for typed languages
- Untyped Languages perform any operation on any
data. - Example Assembly
- movi 5 r0 // Move integer 5 (2s complement) to
r0 - addf 3.6 r0 // Treat bit representation in r0
as a - // Floating point representation and add 3.6
- // to it.
- Result? You can be sure that r0 does not contain
8.6! - () Flexibility I can do anything I want to
and you cant stop me - () Ease of Error Checking. (programs are prone
to errors, especially huge ones). I am human,
my brain is limited, I cant remember and monitor
everything.
41. Motivation for typed languages
- Typed Languages
- A type represents a set of values. Programs /
procedures / operators are functions from an
input type to an output type. - Type Checking is the activity of ensuring that
the operands / arguments of an operator /
procedure are of compatible type - This is done by using a set of rules for
associating a type with every expression in the
language. (These rules are known as the type
system). - A type error results when an operator is applied
to an operand of inappropriate/incompatible type. - Output of a type system
- There are type-errors (wrt type system) gt
Program is NOT type-safe. - There are no type-errors (wrt type system) gt
Program is type-safe.
51. Motivation for typed languages
Program really has errors
Program really does not have errors
TC says that there are type errors
TC errs on the conservative side
Usually true
Possible
TC says that there are no type errors
- Program MAY still have errors
- It may still have type errors due to unsafe
features of a language. This is due to bad type
system design. - It may have logic errors. This serves to show
that type errors is but one of the many errors
you encounter.
61. Motivation for typed languages
Program really has errors
Program really does not have errors
TC says that there are type errors
TC errs on the conservative side
Usually true
Possible
Typed Languages () Error Detection () Program
documentation () Loss of Flexibility (but its
ok, I dont lose much freedom anyway since I
dont usually program in that way in the first
place. I gain more than what I lose).
72. Issues in Type Checking
- How to type-check?
- How to cater for polymorphism?
- What is your definition of compatible type?
- When to perform type checking?
- Is your language strongly or weakly typed?
82.1 How to type-check?
- Definition
- Type statements are of the form
- ltexprgt lttypegt
- meaning that an expression ltexprgt
is-of-the-type (the symbol) lttypegt. - Examples
- 3 int
- 34 int
- 3.14 real
- abc String
- while (x lt 5) x Stmt
92.1 How to type-check?
- Definition
- Type rules are of the form
where each ei ti is a type statement, n ³
0. The rule is interpreted as IF e1 is of type
t1 and and en is of type tn THEN f e1 e2 en is
of type t. This is similar to the rules we saw
when we studied semantics.
102.1 How to type-check?
- Examples of type rules
- Rule for constants
- Rule for boolean comparison
112.1 How to type-check?
- Rule for assignment statement
122.1 How to type-check?
And Given the rules
- Given the program
- int x
-
- x x1
-
A program/expression is type-safe if we can
construct a derivation tree to give a type for
that program/expression.
xx1 Stmt
132.1 How to type-check?
And Given the rules
- Given the program
- int x float y
-
- if (x 3)
- y x
- else
- x x1
-
-
1 int
2 int
3 int
A program/expression is type-safe if we can
construct a derivation tree to give a type for
that program/expression.
Follow the rules! Try to build tree. Cannot
build tree gt Not type safe
if (x3) yx else xx1 Stmt
14Issues in Type Checking
- How to type-check?
- How to cater for polymorphism?
- What is your definition of compatible type?
- When to perform type checking?
- Is your language strongly or weakly typed?
152.2 How to cater for Polymorphism
- Polymorphism poly (many) morph (form)
- Polymorphism is the ability of a data object to
that can take on or assume many different forms. - Polymorphism can be categorised into 2 types
- Ad-hoc Polymorphism ( discussed now)
- Universal Polymorphism
- Parametric (discussed with Functional
Programming) - Inclusion (discussed later in the lecture)
162.2 How to cater for Polymorphism
Cardelli and Wegners classification (1985)
Ad-Hoc polymorphism is obtained when a function
works, or appears to work on several different
types (which may not exhibit a common structure)
and may behave in unrelated ways for each type.
Universal polymorphism is obtained when a
function works uniformly on a range of types
these types normally exhibit some common
structure.
172.2 Polymorphism Coercion
COERCION A coercion is a operation that converts
the type of an expression to another type. It is
done automatically by the language compiler. (If
the programmer manually forces a type conversion,
its called casting)
E int
(Int-Float Coercion)
E float
int x float y ... y x ...
182.2 Polymorphism Coercion
Example of the use of COERCION
- int x float y
-
- if (x 3)
- y x
- else
- x x1
-
-
1 int
2 int
3 int
(Coercion)
y float x float
()
if (x3) yx else xx1 Stmt
192.2 Polymorphism Coercion
Widening coercion converts a value to a type that
can include (at least approximations of) all of
the values of the original type. Widening is safe
most of the time. It can be unsafe in certain
cases.
Narrowing coercion converts a value to a type
that cannot store (even approximations of) all of
the values of the original type. Narrowing is
unsafe. Information is lost during conversion of
type.
202.2 Polymorphism Coercion
- Coercions
- () Increase flexibility in programming
- Example
- float x,y,z
- int a,b,c
- If I have no coercions, and I intend to add y and
a and store in x, then writing - x y ((float) a)
- is too much of a hassle.
- Therefore coercion can be useful.
212.2 Polymorphism Coercion
- Coercions
- () Decrease Reliability (error detection)
- Example
- float x,y,z
- int a,b,c
- If I have coercions and I intend to add x and y
and store in z, but I accidentally write - z x a
- then my error will go undetected because the
compiler will simply coerce the a to a float. - Therefore coercion can lead to problems.
222.2 Polymorphism Overloading
OVERLOADING An overloaded operation has different
meanings, and different types, in different
contexts.
232.2 Polymorphism Overloading
if (x3) xyz else abc Stmt
242.2 Polymorphism Overloading
- Overloading
- () Increase flexibility in programming
- Examples are when user wants to use an operator
to express similar ideas. - Example
- int a,b,c
- int p10, q10, r10
- int x1010, y1010, z1010
- a b c // integer multiplication
- p a q // Scalar multiplication
- x y z // Matrix multiplication
- Therefore overloading is good.
252.2 Polymorphism Overloading
- Overloading
- () Decrease Reliability (error detection)
- Examples are when user intends to use the
operator in one context, but accidentally uses it
in another. - Example
- In many languages, the minus sign is overloaded
to both unary and binary uses. - x zy and x -y
- will both compile. What if I intend to do the
first, but accidentally leave out the z? - Similarly, in C, we can have a situation when
- x zy and x y
- will both compile. Is overloading good?
262.2 Polymorphism Overloading
Overloading () Decrease Reliability (error
detection)
- Even for common operations, overloading may not
be good. - Example
- int sum, count
- float average
- ...
- average sum / count
- Since sum and count are integers, integer
division is performed first before result is
coerced to float. - Thats why Pascal has div for integer division
and / for floating point division.
272.2 How to cater for Polymorphism
Cardelli and Wegners classification (1985)
Ad-Hoc polymorphism is obtained when a function
works, or appears to work on several different
types (which may not exhibit a common structure)
and may behave in unrelated ways for each type.
Universal polymorphism is obtained when a
function works uniformly on a range of types
these types normally exhibit some common
structure.
282.2 Inclusion Polymorphism
- Q Is the subclass regarded as a subtype of the
parent class? - Yes Inclusion Polymorphism (Sub-typing)
- class A
- class B extends A
- Note that B Í A (Inclusion)
- A a new B()
- A a new A()
292.2 Inclusion Polymorphism
- Q Is the subclass regarded as a subtype of the
parent class? - Yes Inclusion Polymorphism (Sub-typing)
- Some people call it the IS-A relationship between
parent and derived class. - class Table extends Furniture
- Table IS-A Furniture.
- Table Í Furniture
302.2 Inclusion Polymorphism
- Variables are polymorphic since they can refer
to the declared class and to subclasses too. - Requirement (Do you know why?)
- Subclass must INHERIT EVERYTHING from the base
class. - Subclass must NOT MODIFY ACCESS CONTROL of the
base class methods/data. - Thats why C Inclusion Polymorphism definition
adds a public to the derived class since a
private derived class modifies access control of
base class methods/data.
31Issues in Type Checking
- How to type-check?
- How to cater for polymorphism?
- What is your definition of compatible type?
- When to perform type checking?
- Is your language strongly or weakly typed?
322.3 Type Equivalence
- type // type definitions
- Q array 1..10 of integer
- S array 1..10 of integer
- T S
- var // variable declarations
- a Q
- b S
- c T
- d array 1..10 of integer
- begin
- a b // Is this allowed?
- // Meaning to say Is a and b
- // the same type?
- a c // Is this allowed?
- a d // Is this allowed?
- b c // Is this allowed?
- type // type definitions
- Queue array 1..10 of integer
- Stack array 1..10 of integer
- Tree Stack
- var // variable declarations
- a Queue
- b Stack
- c Tree
- d array 1..10 of integer
- begin
- a b // Is this allowed?
- // Meaning to say Is a and b
- // the same type?
- a c // Is this allowed?
- a d // Is this allowed?
- b c // Is this allowed?
If you had said yes to most of it, chances are
that you are adopting structural equivalence. If
you had said no most of the time, then it is
likely you are adopting name equivalence.
332.3 Type Equivalence
- Difference between type names and anonymous type
names. - The type of a variable is either described
through - A type name (1) those names defined using a type
definition command. (eg. type for Pascal,
typedef for C.), or (2) the primitive numeric
types (eg. int, float) - Or directly through a type constructor (eg.
array-of, record-of, pointer-to). In this case,
the variable has an anonymous type name.
342.3 Type Equivalence
- type // type definitions
- Q array 1..10 of integer
- S array 1..10 of integer
- T S
- var // variable declarations
- a Q
- b S
- c T
- d array 1..10 of integer
- begin
- a b // Is this allowed?
- // Meaning to say Is a and b
- // the same type?
- a c // Is this allowed?
- a d // Is this allowed?
- b c // Is this allowed?
352.3 Type Equivalence
- When are two types equivalent (º)?
- Rule 1 For any type name T, T º T.
- Rule 2 If C is a type constructor and T1 º T2,
then CT1 º CT2 . - Rule 3 If it is declared that type name T,
then name º T. - Rule 4 (Symmetry) If T1 º T2,then T2 º T1.
- Rule 5 (Transitivity) If T1 º T2 and T2 º T3,
then T1 º T3. - What rules do you want to use?
362.3 Type Equivalence
- When are two types equivalent (º)?
- Rule 1 For any type name T, T º T.
- Rule 2 If C is a type constructor and T1 º T2,
then CT1 º CT2 . - Rule 3 If it is declared that type name T,
then name º T. - Rule 4 (Symmetry) If T1 º T2,then T2 º T1.
- Rule 5 (Transitivity) If T1 º T2 and T2 º T3,
then T1 º T3. - Structural Equivalence will use all the rules to
check for type equivalence.
372.3 Type Equivalence
- When are two types equivalent (º)?
- Rule 1 For any type name T, T º T.
- Rule 2 If C is a type constructor and T1 º T2,
then CT1 º CT2 . - Rule 3 If it is declared that type name T,
then name º T. - Rule 4 (Symmetry) If T1 º T2,then T2 º T1.
- Rule 5 (Transitivity) If T1 º T2 and T2 º T3,
then T1 º T3. - (Pure) Name Equivalence will use only the first
rule. Unless the two variables have the same
type name, they will be treated as different type
?
382.3 Type Equivalence
- When are two types equivalent (º)?
- Rule 1 For any type name T, T º T.
- Rule 2 If C is a type constructor and T1 º T2,
then CT1 º CT2 . - Rule 3 If it is declared that type name T,
then name º T. - Rule 4 (Symmetry) If T1 º T2,then T2 º T1.
- Rule 5 (Transitivity) If T1 º T2 and T2 º T3,
then T1 º T3. - Declarative Equivalence will leave out the second
rule.
392.3 Type Equivalence
R1 For any type name T, T º T. R2 If C is a
type constructor and T1 º T2, then CT1 º CT2
. R3 If it is declared that type name T, then
name º T. R4 (Symmetry) If T1 º T2,then T2 º
T1. R5 (Transitivity) If T1 º T2 and T2 º T3,
then T1 º T3.
- type // type definitions
- Q array 1..10 of integer
- S array 1..10 of integer
- T S
- var // variable declarations
- a,x Q
- b S
- c T
- d array 1..10 of integer
- e array 1..10 of integer
- begin
- a x // Is this allowed?
- // Meaning to say Is a and b
- // the same type?
- a b // Is this allowed?
- a c // Is this allowed?
- a d // Is this allowed?
402.3 Type Equivalence
- Name Equivalence
- Easy to implement checking, since we need only
compare the name. - Very restrictive, inflexible.
- type idxtype 1..100
- var count integer
- index idxtype
- Structure Equivalence
- Harder to implement since entire structures must
be compared. Other issues to consider eg.
arrays with same sizes but different subscripts
are they the same type? (similar for records and
enumerations) - More flexible, yet the flexibility can be bad
too. - type celsius real
- fahrenheit real
- var x celsius
- y fahrenheit
- ...x y // Allowed?
412.3 Type Equivalence
- Different Languages adopt different rules. And
the rules may change for one language (people can
change their minds too!) - Pascal
- Before 1982 unknown.
- ISO1982 Declarative Equivalence.
- ISO1990 Structural Eqivalence.
- C Structural Equivalence, except for structs
and unions, for which C uses declarative
equivalence. If the two structs are in different
files, then C goes back to structural
equivalence. - C Name Equivalence
- Haskell/SML Structural Equivalence.
42Issues in Type Checking
- How to type-check?
- How to cater for polymorphism?
- What is your definition of compatible type?
- When to perform type checking?
- Is your language strongly or weakly typed?
432.4 When to perform Type Checking?
442.4 When to perform Type Checking?
- Static Type Checking done at compile time.
- () Done only once
- () Earlier detection of errors
- () Less Program Flexibility (Fewer shortcuts and
tricks)
452.4 When to perform Type Checking?
- Dynamic Type Checking done at run time.
- () Done many times
- () Late detection of errors
- () More memory needed, since we need to maintain
type information of all the current values in
their respective memory cells. - () Slows down overall execution time, since
extra code is inserted into the program to detect
type error. - () Program Flexibility (Allows you to hack
dirty code.)
46Issues in Type Checking
- How to type-check?
- How to cater for polymorphism?
- What is your definition of compatible type?
- When to perform type checking?
- Is your language strongly or weakly typed?
472.5 Strong Type Systems
- A programming language is defined to be strongly
typed if type errors are always detected
STATICALLY. - A language with a strong-type system only allows
type-safe programs to be successfully compiled
into executables. (Otherwise, language is said to
have a weak type system). - Programs of strong-type systems are guaranteed to
be executed without type-error. (The only error
left to contend with is logic error).
482.5 Strong Type Systems
Fortran
Ada
Modula-3
C, C
Java
Pascal
492.5 Weak-Type Systems Variant Recs
- Variant Records in C (via union keyword)
compromises Type Safety - ...
- typedef union int X
- float Y
- char Z4 B
- ...
- B P
- Variant part all have overlapping (same)
L-value!!! - Problems can occur. What happens to the code
below? - P.X 142
- printf(O\n, P.Z3)
- All 3 data objects have same L-value and occupy
same storage. No enforcement of type checking.
? Poor language and type system design
502.5 Weak-Type Systems Variant Recs
- Variant Records in Pascal tries to overcome Cs
deficiency. They have a tagged union type. - type whichtype (inttype, realtype)
- type uniontype record
- case V whichtype of
- inttype (X integer)
- realtype (Y real)
- end
- But the compiler usually doesnt check the
consistency between the variant and the tag. So
we can subvert the tagged field - var P uniontype
- P.V inttype
- P.X 142
- P.V realtype // type safety compromised