Combinatorics - PowerPoint PPT Presentation

1 / 13
About This Presentation
Title:

Combinatorics

Description:

Combinatorics (Important to algorithm analysis ) Problem I: How many N-bit strings contain at least 1 zero? Problem II: How many N-bit strings contain more than 1 zero? – PowerPoint PPT presentation

Number of Views:89
Avg rating:3.0/5.0
Slides: 14
Provided by: cliffs5
Learn more at: http://courses.cs.vt.edu
Category:

less

Transcript and Presenter's Notes

Title: Combinatorics


1
Combinatorics
  • (Important to algorithm analysis )

Problem I How many N-bit strings contain at
least 1 zero? Problem II How many N-bit
strings contain more than 1 zero?
2
Which areas of CS need all this?
  1. Information Theory
  2. Data Storage, Retrieval
  3. Data Transmission
  4. Encoding

3
Information (Shannon) Entropy
quantifies, in the sense of an expected value,
the information contained in a message. Example
1 A fair coin has an entropy of 1 bit. If the
coin is not fair, then the uncertainty is lower
(if asked to bet on the next outcome, we would
bet preferentially on the most frequent result)
gt the Shannon entropy is lower than 1. Example
2 A long string of repeating characters
S0 Example 3 English text S 0.6 to 1.3
The source coding theorem as the length of a
stream of independent and identically-distribute
d random variable data tends to infinity, it is
impossible to compress the data such that the
code rate (average number of bits per symbol) is
less than the Shannon entropy of the source,
without information loss.
4
Information (Shannon) Entropy Contd
The source coding theorem It is impossible to
compress the data such that the code rate
(average number of bits per symbol) is less
than the Shannon entropy of the source, without
information loss. (works in the limit of large
length of a stream of independent and
identically-distributed random variable data )
5
Combinatorics contd
Problem Random Play on your I-Touch works
like this. When pressed once, it plays a random
song from your library of N songs. When pressed
again, it excludes the song just played from the
library, and draws another song at random from
the remaining N-1 songs. Suppose you have
pressed Random Play k times. What is the
probability you will have heard your one most
favorite song?
6
Combinatorics contd
  • The Random Play problem.
  • Tactics. Get your hands dirty.
  • P(1) 1/N. P(2) ? Need to be careful what if
    the song has not played 1st? What if it has?
    Becomes complicated for large N.
  • Table. Press (k) vs. P.
    1 1/N
    2 1/(N-1) k
    1/(N-k 1)
  • Key Tactics find complimentary probability
    P(not) (1 - 1/N)(1 - 1/(N-1))(1- 1/(N-k 1))
    then P(k) 1 - P(not).
  • Re-arrange. P(not) (N-1)/N (N-2)/ (N-1)
    (N-3)/(N-2)(N-k)/(N-k1) (N-k)/N. Thus, P
    1 - P(not) k/N.
  • If you try to guess the solution, make sure your
    guess works for simple cases when the answer is
    obvious. E.g. k1, kN. Also, P lt 1.
  • The very simple answer suggests that a simpler
    solution may be possible. Can you find it?

7
Data Compression
Lossless Compression 25.888888888 gt
25.98 Lossy Compression 25.888888888 gt
26 Lossless compression exploit statistical
redundancy. For example, In English, letter e
is common, but z is not. And you never have
a q followed by a z. Drawback not
universal. If no pattern - no compression.
Lossy compression (e.g. JPEG images).
8
Combinatorics Contd
Problem DNA sequence contains only 4 letters
(A,T,G and C). Short words made of K
consecutive letters are the genetic code. Each
word (called codon, ) codes for a specific
amino-acid in proteins. There are a total of 20
biologically relevant amino-acids. Prove that
genetic code based on fixed K is degenerate,
that is there are amino-acids which are coded
for by more than one word. It is assumed that
every word codes for an amino-acid
9
Permutations
  • How many three digit numbers are there if you
    cannot use a number more that once?

10
P(n,r)
  • The number of ways a subset of r elements can be
    chosen from a set of n elements is given by

11
Theorem
  • Suppose we have n objects of k different types,
    with nk identical objects of the kth type. Then
    the number of distinct arrangements of those n
    objects is equal to

12
Combinatorics Contd
  • The Mississippi formula.

Example What is the number of letter
permutations of the word BOOBOO?
6!/(2! 4!)
13
Permutations and Combinations with Repetitions
  • How many distinct arrangements are there of the
    letters in the word MISSISSIPPI?
Write a Comment
User Comments (0)
About PowerShow.com