An Application of Reinforcement Learning to Aerobatic Helicopter - PowerPoint PPT Presentation

About This Presentation
Title:

An Application of Reinforcement Learning to Aerobatic Helicopter

Description:

... learned based on a model so you would need to create a new ... Videos: http://heli.stanford.edu/ http://www.youtube.com/watch?v=VCdxqn0fcnE. Helicopter ... – PowerPoint PPT presentation

Number of Views:91
Avg rating:3.0/5.0
Slides: 18
Provided by: MohanSr
Category:

less

Transcript and Presenter's Notes

Title: An Application of Reinforcement Learning to Aerobatic Helicopter


1
An Application of Reinforcement Learning to
Aerobatic Helicopter
  • Greg McChesney
  • Texas Tech University
  • Greg.mcchesney_at_ttu.edu

2
Overview
  • Creating a robot that can fly autonomously
  • Software developed at Stanford as part of their
    AI lab
  • This paper is slightly outdated as many new
    maneuvers have been created.

3
Learning Approach
  • Apprenticeship
  • Collect data from human trying maneuver (multiple
    times)
  • Learn a model from the data
  • Find controller than can simulate based on model
  • Test on helicopter (pray it doesnt crash)

4
Helicopters State
  • Position
  • Velocity
  • Angular Velocity
  • Controlled with 4 dimensions
  • Cyclic pitch
  • Tail rotor
  • Take gravity out when calculating the model

5
Controller Design
  • Use a Markov decision process
  • Sextuple (S,A,T,H,s(0),R)
  • S-set of states
  • A-set of actions (inputs)
  • T-dynamic model-set of probability distributions
    for the next state
  • H-horizon or number of time steps of interest
  • s(0)-initial state
  • R-reward function

6
Differential Dynamic Programming(DDP)
  • Compute the linear approximation
  • Compute the optimal solution to the linear
    quadratic regulator
  • Must take into account error state
  • Cost for change in input-needed in real testing

7
DDP-Continued
  • 2 phases
  • DDP to find open loop input sequence
  • Use DDP again refining the inputs as a deviation
    from the nominal open-loop input sequence
  • Integral control-take into account wind and
    errors in the model

8
Rewards
  • 24 features
  • Used inverse reinforcement learning
  • Rewards from inverse reinforcement usually did
    not produce correct result
  • Took inverse results and manually tuned them to
    get good results

9
Helicopter
  • Xcell Tempest
  • 54 long
  • 19 high
  • 13 lbs
  • Two-stroke engine
  • Orientation sensors
  • GPS-doesnt work during flips

10
(No Transcript)
11
Flip
12
Roll
13
Tail-In Funnel
14
Nose-In Funnel
15
Questions
  • Motivations/Who pays for it
  • I can see applications in the defense sector
  • DARPA
  • Could more maneuvers be done just by changing
    some parameters?
  • Probably not because the filter is learned based
    on a model so you would need to create a new model

16
More Questions
  • What's the relationship between reinforcement
    learning and MDP?
  • Not Sure
  • Could a helicopter like this operate in the West
    Texas wind storms?

17
Fun Stuff
  • Videos
  • http//heli.stanford.edu/
  • http//www.youtube.com/watch?vVCdxqn0fcnE
  • Helicopter
  • http//www.miniatureaircraftusa.com/helicopterkits
    /1025_Spectra_G/1025_kit_main.asp
Write a Comment
User Comments (0)
About PowerShow.com