Title: Embedded%20Signal%20Processing%20Laboratory%20at%20UT%20Austin
1Embedded Signal Processing Laboratory at UT Austin
- Prof. Brian L. Evans
- Dept. of Electrical and Computer Eng.
- The University of Texas at Austin
- http//www.ece.utexas.edu/bevans
2Got to Texas as Fast I Could
- BSEE/CS Rose-Hulman 1987
- MSEE Georgia Tech 1988
- PhDEE Georgia Tech 1993
- Post-Doc UC Berkeley 1993-1996
- Very happy to land in Austin in Fall 1996 at
Rambling Wreck
Cal
3Summary of Previous NI Interaction
- NI Support Prior to Fall 2002
- Funding for Babar Ahmed (undergraduate)
- Real-Time DSP Lab alumni who went to NI
- Prethi Gopinath, Newton Petersen, Junichi Suguira
- NI employees in Embedded Software Systems
- Hugo Andrade, Scott Kovner, Sadia Malik,Kurt
Nee, Newton Petersen, Ram Rajagopal,Michael
Schaeffer
4Outline
- Real-Time Digital Signal Processing Lab
- Programmable Digital Signal Processors
- Future Uses of LabVIEW
- Embedded Software Systems graduate course
- Electronic Design Automation Tools
- Interaction with National Instruments
- Research Group (Embedded Signal Proc. Lab)
- Common Themes
- ADSL Transceiver Design
5Real-Time DSP Lab
- Introduced Fall 1997 384 served
- Digital signal processing theory/algorithms
- Digital communication systems
- Digital signal processor architecture
- Deliverable Voiceband modem
- Design of sinusoidal generators, filters, etc.
- Implementation in C/assembly on TI floating-point
TMS320C6700 DSP using Code Composer Studio - Test implementation with spectrum analyzers, etc.
6Digital Signal Processors (DSPs)
- For real time (guaranteed delivery)
- Fixed-point DSPs for high-volume products
- Battery-powered cell phones, dial-up modems,
portable MP3 players, digital still cameras, and
digital video (e.g. TI C5000) - Wall-powered ADSL modems, VDSL modems, cell
phone basestations, modem banks, laser printers,
video conferencing systems (e.g. TI 6200, C6400) - Floating-point DSPs for low-volume products and
feasibility analysis on fixed-point DSPs - TI 45, Agere 25, Mot 10, 8 Analog
7Digital Signal Processor Architecture
- Harvard architecture program/data memory
separated and can be accessed on same cycle - Word size 16, 20, 24, or 32 bits
- Programmer must manage memory
- 32-128 kwords data/program on chip
- On-chip data cache rare (TI C6000)
- No support for virtual memory
- Predictable input/output deterministic interrupt
service routine latency (e.g. 11 cycles on TI
C6000)
8Digital Signal Processor Architecture
- Deterministic, no-overhead looping
- Single instruction cycle multiply unit(s)
- No-overhead addressing modes in hardware
- Modulo addressing for circular buffers, e.g.
filters - Bit-reversed addressing, e.g. fast Fourier
transforms (not available on TI C6000) - Native number formats
- Integer binary point on far right of bit pattern
- Fractional binary point just right of sign bit
- Floating-point could emulate on fixed-point DSPs
9Drawbacks to Programming DSPs
- General drawbacks
- Limited on-chip memory
- Poor C compiler performance
- Fixed-point issues
- Non-standard C extensions for fractional data
- Converting floating-point programs to fixed-point
- Manual tracking of binary point prone to error
- Conventional DSPs
- No byte addressing (needed for image/video)
- Limited addressable memory on fixed-point DSPs
10LabVIEW for Real-Time DSP Lab
- Students use LabVIEW in a pre-requisite
- Fall 2003 System-level representation
- In the first lab, students are given a LabVIEW
simulation of voiceband modem running on PC - In each subsequent lab, students substitute the
subsystem implemented on the DSP in the LabVIEW
simulation to test the design - Future Synthesis vs. handcoding
- In each lab, students use the LabVIEW modem
simulation to synthesize the subsystem being
designed and compare their handcode with it
11Embedded Software Systems
- Introduced in Spring 1997 87 served
- Modern methods for specifying, simulating, and
synthesizing embedded systems - Programming languages Concurrency
- Dataflow models Process network
- Scheduling Software synthesis
- Discrete-event models Cosimulation
- Students evaluate/build system designs in
- Ptolemy from UC Berkeley
- Advanced Design System from Agilent
12Dataflow Models
- Examples in modern design automation tools
13Synchronous Dataflow Lee 1986
- Arcs one-way first-in first-out queues
- A block is enabled for execution when enough
tokens are available on all inputs - Source blocks are always enabled
- When block executes, it always produces and
consumes the same fixed amount of tokens - Consumed data is dequeued from arc
- Flow of data through graph may not depend on
values of data - Delay is a property of an arc
- Delay of n samples means that n tokens are
initially in the queue of that arc
14Synchronous Dataflow
- Systems are determinate
- History of tokens produced on communication
channels do not depend on the execution order - May be executed sequentially or in parallel with
the same outcome - Scheduling
- Load balancing to make sure that all tokens
produced can be consumed linear complexity - Find a periodic schedule
- List scheduling worst-case is exponential
complexity - Heuristics to minimize buffer size cubic
complexity
15Synchronous Datalflow Modeling
- Signal Processing
- Finite impulse response filters
- Infinite impulse response filters
- Fast Fourier transform
- Multirate systems and filter banks
- Communication Systems
- Sinusoidal modulation and demodulation
- Pulse shapers
- Transmission subsystem
- Inappropriate for data-dependent graphs, e.g.
baud rate negotiation at modem startup
16Process Network Kahn 1974
- A set of concurrent processes that communicate
through network of one-way infinite first-in
first-out (FIFO) queues - Reads from queues are blocking
- If the queue is empty, the process will suspend
until there is enough data in the queue. - When a process blocks, the scheduler will not run
the process until enough data becomes available. - Writes to the queues are non-blocking
17Process Network
- A process is either enabled or blocked waiting
for data on only one of its input channels - Systems are determinate
- History of tokens produced on communication
channels do not depend on the execution order - May be executed sequentially or in parallel with
the same outcome - Supports recurrence and recursion
- Formal mathematical representation processes are
functions that map streams into streams
18Process Network
- Turing complete questions of termination and
bounded buffering are undecidable - Undecidable (in finite time) if process network
- Terminates
- Requires bounded memory
- Signal processing run for infinite time
- Scheduler can find a bounded memory solution
using infinite time Parks 1995 - Ptolemy Process Network domain
- UT Austin Computational Process Network framework
in C
http//www.ece.utexas.edu/allen/PNSourceCode/
19NIers in Embedded Software Systems
- Hugo Andrade and Scott Kovner, 1998,
- Software Synthesis from Dataflow Models for
Embedded Software Design in the G Programming
Language and the LabVIEW Development Environment - Kurt Nee (with Chad Roesle), 1999,
- Feasability of Implementating an H.263 Decoder
on a TMS320C6x Digital Signal Processor
20NIers in Embedded Software Systems
- Michael Schaeffer, 1999,
- An Extension to the Foundation Fieldbus Model
for Specifying Process Control Strategies - Sadia Malik and Ram Rajagopal, 2000,
- LabVIEW Based Embedded Design
- Newton Petersen (with Martin Wojcik), 2000,
- Node Prefetch Prediction in Dataflow Graphs
21Prof. Brian L. Evans
http//signal.ece.utexas.edu
Real-Time Imaging
ADSL/VDSL Transceiver Design
Ph.D. graduates Thomas D. Kite (Audio
Precision) Niranjan
Damera-Venkata (HP Labs)Ph.D. students
Gregory E. Allen (UT Applied Research Labs)
Serene BanerjeeMS
graduates Young Cho (UCLA)MS students
Vishal Monga
Ph.D. graduates Güner Arslan (Cicada)
Biao Lu (Schlumberger)Ph.D.
students Dogu Arifler
Ming Ding Milos
Milosevic (Schlumberger)
Wireless Communications
Ph.D. graduates Murat Torlak (UT Dallas)Ph.D.
student Kyungtae Han MS graduates
Srikanth K. Gummadi (TI)
Amey A. Deosthali (TI)MS students
Zukang Shen Ian Wong
Image Analysis
Ph.D. graduates Dong Wei (SBC Research)
K. Clint Slatton (UT Center
for Space Research)
Wade C. Schwartzkopf
Wireless Networking and Comm. Group
http//www.wncg.org
Center for Perceptual Systems http//www.cps.utex
as.edu
22Common Themes
- Find or derive optimal algorithm
- Develop low-complexity algorithms(bottom-up
design) - Keep in mind that these algorithms will
ultimately be realized in real time on a
fixed-point DSP - Algorithms should be statically scheduled
- Evaluate performance-implementation tradeoff
- System-level design (top-down design)
- Dataflow modeling for synthesis
- Simulate system to validate algorithm
- Software releases
23ADSL Transceiver Design
- Asymmetric Digital Subscriber Line modem
- Line driver (single chip)
- Transceiver analog front end digital baseband
- Sampling rate 2.208 Mbps (real time)
- Bit error rate 10-7 (Reed-Solomon codes)
- Symbol rate 4,000 symbols/s
- Frame is symbol plus redundant information
- Single frame transmission (low delay)
- Proper equalizer design can double bit rate
24Digital Subscriber Line (DSL)Broadband Access
Internet
DSLAM
Central Office
DSL modem
DSL modem
Voice Switch
LPF
LPF
Customer Premises
DSLAM - DSL Access Multiplexer LPF Low Pass
Filter
Telephone Network
25Discrete Multitone (DMT) Standards
- ADSL Asymmetric DSL (G.DMT Standard)
- Echo cancelled no longer deployed in central
office - Frequency division multiplexing max. data
rates13.38 Mbps downstream, 1.56 Mbps
upstream - ADSLcable modem 12 in US 51 non-US
- DMT VDSL Very HighRate DSL (Proposed)
- Faster G.DMT ADSL
- Freq. division multiplex
- 2m subcarriers m ? 8, 12
26Multicarrier Modulation
- Divide channel into narrowband subchannels
- No inter-symbol interference(ISI) if constant
gain in everysubchannel and ideal sampling - Discrete multitone modulation
- Based on fast Fourier transform (FFT)
channel
carrier
magnitude
subchannel
frequency
Subchannels are 4.3 kHz wide in ADSL and DMT VDSL
27Discrete Multitone Modulation Symbol
- Subsymbols are complex-valued
- ADSL training uses 4-level QuadratureAmplitude
Modulation (QAM) - ADSL uses QAM of 22, 23, 24, , 215levels during
data transmission
Quadrature
In-phase
QAM
28Discrete Multitone Modulation Frame
- Frame through D/A converter and transmitted
- Frame is the symbol with cyclic prefix prepended
- Cyclic prefix (CP) is last n samples of symbol
- Linear convolution of framew/ channel impulse
response - Is circular convolution if channellength is CP
length plus one or shorter - If circular, frequency equalization in FFT domain
copy
copy
s y m b o l i1
CP
CP
s y m b o l i
N samples
v samples
29Eliminating Inter-Symbol Interference
- Time domain equalizer (TEQ)
- Finite impulse response (FIR) filter
- Effective channel impulse responseconvolution
of TEQ impulse responsewith channel impulse
response - Frequency domain equalizer (FEQ)
- Compensates magnitude and phasedistortion of
channel TEQ by dividingeach FFT coefficient by
complex number - ADSL G.DMT equalizer training
- Reverb same symbol sent 1,024 to 1,536 times
- Medley aperiodic sequence of 16,384 symbols
- At 0.25 s after medley, receiver returns
numberof bits on each subcarrier that can be
supported
30ADSL Transceiver Data Transmission
N/2 subchannels
N real samples
S/P
quadrature amplitude modulation (QAM) encoder
mirror data and N-IFFT
add cyclic prefix
P/S
D/A transmit filter
Bits
00110
TRANSMITTER
channel
RECEIVER
N real samples
N/2 subchannels
P/S
time domain equalizer (FIR filter)
QAM demod decoder
N-FFT and remove mirrored data
S/P
remove cyclic prefix
receive filter A/D
invert channel frequency domain equalizer
31Simulation Results for 17-Tap TEQ
Cyclic prefix length 32 FFT size (N) 512 Coding
gain 4.2 dB Margin 6 dB
Input power 23 dBm Noise power -140
dBm/Hz Crosstalk noise 8 ADSL disturbers POTS
splitter 5th order Chebyshev
32Simulation Results for 3-Tap TEQ
Cyclic prefix length 32 FFT size (N) 512 Coding
gain 4.2 dB Margin 6 dB
Input power 23 dBm Noise power -140
dBm/Hz Crosstalk noise 8 ADSL disturbers POTS
splitter 5th order Chebyshev
33Contributions by Research Group
- New time-domain equalizer design methods
- Maximum Bit Rate method maximizes bit rate (upper
bound) - Minimum Inter-Symbol Interference
method(real-time, fixed-point) - Minimum Inter-Symbol Interference TEQ design
method - Reduces number of TEQ taps by a factor of ten
over Minimum Mean Squared Error method for the
same bit rate in discretized simulation - Implemented in real-time on Motorola 56000, TI
TMS320C6200 and TI TMS320C5000 DSPs
http//www.ece.utexas.edu/bevans/projects/adsl
34Matlab DMT TEQ Design Toolbox 3.1
- FIR, dual-path, per-tone filter bank
equalizers http//www.ece.utexas.edu/bevans/proj
ects/adsl/dmtteq/
default parameters from G.DMT ADSL standard
different graphical views
variousperformance measures
35Future Interaction with NI
- Integrate LabVIEW into Real-Time DSP Lab to
reinforce modem system being designed - Add lecture on LabVIEW computational model in
Embedded Software Systems course - Discuss ideas for extensions to LabVIEW for
synthesis onto programmable DSPs - Evaluate restrictions and extensions to the G
language for synthesis - Investigate methods for conversion of
floating-point source code to fixed-point