Computing for Bioinformatics Introduction to databases - PowerPoint PPT Presentation

1 / 14
About This Presentation
Title:

Computing for Bioinformatics Introduction to databases

Description:

data collected for one kind of analysis. Why are these things not databases? ... What are they used for? Uses of databases in biodiversity: data about species ... – PowerPoint PPT presentation

Number of Views:33
Avg rating:3.0/5.0
Slides: 15
Provided by: richar487
Category:

less

Transcript and Presenter's Notes

Title: Computing for Bioinformatics Introduction to databases


1
Computing for BioinformaticsIntroduction to
databases
  • What is a database?
  • Database system components
  • Data types
  • DBMS architectures
  • DBMS systems available
  • Microsoft Access

2
What is a database?
  • We meet them everywhere. Everyone has a vague
    idea of what a database is.
  • What they are not
  • spreadsheets
  • What they are
  • What are they used for?

3
What databases are not
  • unstructured piles of data (including heaps of
    web pages)
  • spreadsheets such as Excel tables
  • text files with neatly tabulated data
  • data collected for one kind of analysis
  • Why are these things not databases?

4
Spreadsheets versus databases (1)
  • A spreadsheet is typically viewed as an entire
    table of cells which may contain
  • numbers (data)
  • text (labels)
  • formulae (calculations producing results)
  • A database may be structured in various ways,
    usually so that a small subset of the data is
    presented as the result of a search

5
Spreadsheets - databases (2)
  • Spreadsheets
  • Can be used immediately with little preparation
    (or thought)
  • Data is visible
  • Data entry is simple
  • Databases
  • Require planning
  • Data is hidden
  • May require a program to help you enter or
    retrieve data

6
Spreadsheets - databases (3)
  • Spreadsheets
  • Little checking is carried out
  • Tables and graphs can be produced
  • Single user
  • Databases
  • Extensive integrity checks can be arranged
  • Reports can be programmed
  • Searches can be made
  • Can be multi-user
  • Can be put on the Web

7
What a database is
  • Data is stored separately from any application
    programs which might use it
  • Multiple uses of the data are envisaged
  • Designed for retrieval in various anticipated and
    unanticipated forms

8
What are they used for?
  • Uses of databases in biodiversity
  • data about species
  • data about biological specimens
  • data about areas, places, sampling sites, etc.
    (sometimes stored in Geographical Information
    Systems (GIS)
  • Uses of databases in bioinformatics
  • molecular and sequence databases
  • gene frequencies, gene expression data, etc.

9
DBMS types (database internal structure)
  • What are the main types of database design? (The
    internal mechanics, not the bioinformatics or
    biodiversity information or the appearance of the
    database as seen by the user.)
  • Free text - records not divided into fields
  • Flat-file - records have fields (one table with
    columns like a spreadsheet), common and easy to
    understand, often inefficient
  • Hierarchical, Network - now obsolete
  • Relational - several tables, usually the choice
    of the professional
  • Object-oriented - for the adventurous enthusiast

10
Database system components (1)
  • A database management system (DBMS) has the
    following essential components
  • Data tables (the data itself)
  • Database engine (stores data to and retrieves
    data from the tables)
  • User interface (for humans to enter, view and
    edit data)
  • Some commercial general-purpose DBMSs, such as
    Microsoft Access, make the engine and the
    interface appear as one

11
Stand-alone and client-server systems
  • Some database systems are integrated
    (stand-alone) the engine and the interface
    are combined (MS Access)
  • the data may also be on the same machine
  • Client-server database systems put the data
    tables and the storage engine on a remote
    server computer
  • the user accesses the remote database server
    using a local database client program

12
Accessing the data in the database
  • A user can use a built-in user interface to
    search, edit, etc. (e.g. in Microsoft Access)
  • A user can use a third-party client program,
    especially in the case of client-server systems
    such as MySQL, Oracle, etc.
  • Such clients often use the SQL language
    (pronounced either ess-cue-ell or sequel) as
    a (fairly) standard way to formulate search
    requests, data editing instructions, etc.
  • Special-purpose programs may also be written (in
    Perl, Java, PHP, etc.) to perform such access,
    also using embedded SQL

13
Smallest ever guide to SQL
  • Database table definition column names, data
    types, indexes, etc.
  • Data retrieval is based on the idea of selecting
    columns and rows to obtain a subset of a larger
    stored table, e.g.
  • SELECT name, salary FROM Employee WHERE name LIKE
    Smith
  • Data may be retrieved from linked tables using
    joins
  • Data records may be inserted, altered or deleted

14
Database system components (2)
  • A DBMS usually also includes or requires
  • Database drivers, import export modules, etc.
    (for programs to store, retrieve and alter data)
  • Application programs (using the above to connect
    with the data and do useful things with it,
    sometimes called business logic may be
    general-purpose or specialised)
  • Report writer (a specialised application program)
  • Utilities (for back-ups, integrity checking, etc.)
Write a Comment
User Comments (0)
About PowerShow.com