Title: Software for data management: The contribution of Stata
1Software for data management The contribution of
Stata
- Dr Karen Robson, Senior Research Fellow, The
Geary Institute, University College Dublin,
Ireland
2Getting acquainted with Stata
- StataCorp develops and distributes Stata,
software for statistical analysis. - Stata is available for Windows, Macintosh, and
Unix computers. - Stata is used by medical researchers,
biostatisticians, epidemiologists, economists,
sociologists, political scientists, geographers,
psychologists, social scientists, and other
research professionals needing to analyze data.
Gaining popularity in the social and medical
sciences - Particularly useful for handling large-scale
longitudinal data
3Stata SE (for large data sets)
- can analyze datasets with as many as 32,766
variables, and the only limit on observations is
the amount of RAM on your computer - can handle string variables with a maximum length
of 244 characters - can handle matrices up to 11,000 x 11,000.
- requires at least 512 megabytes of RAM and 80
megabytes of disk space
4Stata/Intercooled (the standard one)
- can analyze datasets with as many as 2,047
variables, and the only limit on observations is
the amount of RAM on your computer - can handle string variables with a maximum length
of 244 characters - can handle matrices up to 800 x 800.
5Small Stata
- A smaller, student version of Stata (for
educational purchases only)
6Stata MP
- The fastest version of Stata (for dual-core and
multicore/multiprocessor computers) - Stata/MP is the fastest and largest version of
Stata.
7Resources
- StataCorp website (www.stata.com)
8(No Transcript)
9Resources
- StataCorp website (www.stata.com)
- Timberlake website (www.timberlake.co.uk)
10(No Transcript)
11Resources
- StataCorp website (www.stata.com)
- Timberlake website (www.timberlake.co.uk)
- UCLA Stata portal (http//www.ats.ucla.edu/stat/
)
12(No Transcript)
13Resources
- StataCorp website (www.stata.com)
- Timberlake website (www.timberlake.co.uk)
- UCLA Stata portal (statcomp.ats.ucla.edu/stata)
- Statalist (www.hsph.harvard.edu/statalist)
14(No Transcript)
15Resources
- StataCorp website (www.stata.com)
- Timberlake website (www.timberlake.co.uk)
- UCLA Stata portal (statcomp.ats.ucla.edu/stata)
- Statalist (www.hsph.harvard.edu/statalist)
- Stata Journal (www.stata-journal.com)
16(No Transcript)
17(No Transcript)
18As well, available Dec 2008
19Launching Stata
- OS contingent
- Default window preferences
- Window preferences fully adjustable
- Auto memory set
20(No Transcript)
21(No Transcript)
22(No Transcript)
23Comparing with SPSS
24(No Transcript)
25(No Transcript)
26Comparing with SPSS
- Start up differences
- With data file open
27(No Transcript)
28(No Transcript)
29Comparing with SPSS
- Start up differences
- With data file open
- Viewing data
- data viewer, data editor
30(No Transcript)
31Comparing with SPSS
- Start up differences
- With data file open
- Viewing data
- data viewer, data editor
- Viewing variables
32(No Transcript)
33Comparing with SPSS
- Start up differences
- With data file open
- Viewing data
- data viewer, data editor
- Viewing variables
- Viewing output/commands
- output window buffer, log files
34(No Transcript)
35(No Transcript)
36Comparing with SPSS
- Start up differences
- With data file open
- Viewing data
- data viewer, data editor
- Viewing variables
- Viewing output/commands
- output window buffer, log files
- Syntax and do files
37(No Transcript)
38Variable window
INPUT Stata command window Do file Pull-down menu
Review window
Computation
RESULTS Output window Log file
39Advantages and disadvantages of Stata
- User driven
- Free STBs
- Dedicated journal
- Web active
- Memory requirements
- Backward compatible
- Change!
- SPSS dominance
- Orientated to writing syntax/code
- Pull-down windows debate! Now in version 8 forward
40Advantages and disadvantages of Stata
- Easier code
- Easier data handling
- Clarity of operations/ feedback
- Results table function
- Before version 8, limited graphics
- Now, complex graphics
- Variable labelling
- Editing of output
41Advantages and disadvantages of Stata
- Nested/master do files
- Flexible terminology
- Setting types of data
- Interactive help
- Switch output (log file) on/off
42(No Transcript)
43Overview of analytic techniques
- Too numerous to mention!
- Comprehensive manuals
- A selection
- All types of regression
- Survey package
- Epidemiological package
- Multilevel modelling
- Time series functions
- Cluster analysis
44Data
- Data files .dta
- Stat/Transfer software
45(No Transcript)
46Stata using wide and long file formats
- Wide file formats (everything you add goes to the
right of the existing data) - Long file formats (everything you add goes
underneath the existing data)
47(No Transcript)
48(No Transcript)