Data analysis using stata

Select an Action

Title:

Author:

Kohler, Ulrich Dr. phil.

ISBN:

9781597181105

Personal Author:

Kohler, Ulrich Dr. phil.

Edition:

3rd ed.

Publication Information:

College Station, Tex. : Stata Press, c2012.

Physical Description:

xxvi, 497 p. : ill. ; 24 cm.

Contents:

Machine generated contents note: 1.The first time -- 1.1.Starting Stata -- 1.2.Setting up your screen -- 1.3.Your first analysis -- 1.3.1.Inputting commands -- 1.3.2.Files and the working memory -- 1.3.3.Loading data -- 1.3.4.Variables and observations -- 1.3.5.Looking at data -- 1.3.6.Interrupting a command and repeating a command -- 1.3.7.The variable list -- 1.3.8.The in qualifier -- 1.3.9.Summary statistics -- 1.3.10.The if qualifier -- 1.3.11.Defining missing values -- 1.3.12.The by prefix -- 1.3.13.Command options -- 1.3.14.Frequency tables -- 1.3.15.Graphs -- 1.3.16.Getting help -- 1.3.17.Recoding variables -- 1.3.18.Variable labels and value labels -- 1.3.19.Linear regression -- 1.4.Do-files -- 1.5.Exiting Stata -- 1.6.Exercises -- 2.Working with do-files -- 2.1.From interactive work to working with a do-file -- 2.1.1.Alternative 1 -- 2.1.2.Alternative 2 -- 2.2.Designing do-files -- 2.2.1.Comments -- 2.2.2.Line breaks -- 2.2.3.Some crucial commands --

Contents note continued: 2.3.Organizing your work -- 2.4.Exercises -- 3.The grammar of Stata -- 3.1.The elements of Stata commands -- 3.1.1.Stata commands -- 3.1.2.The variable list -- List of variables: Required or optional -- Abbreviation rules -- Special listings -- 3.1.3.Options -- 3.1.4.The in qualifier -- 3.1.5.The if qualifier -- 3.1.6.Expressions -- Operators -- Functions -- 3.1.7.Lists of numbers -- 3.1.8.Using filenames -- 3.2.Repeating similar commands -- 3.2.1.The by prefix -- 3.2.2.The {breach loop -- The types of foreach lists -- Several commands within a foreach loop -- 3.2.3.The forvalues loop -- 3.3.Weights -- Frequency weights -- Analytic weights -- Sampling weights -- 3.4.Exercises -- 4.General comments on the statistical commands -- 4.1.Regular statistical commands -- 4.2.Estimation commands -- 4.3.Exercises -- 5.Creating and changing variables -- 5.1.The commands generate and replace -- 5.1.1.Variable names -- 5.1.2.Some examples -- 5.1.3.Useful functions --

Contents note continued: 5.1.4.Changing codes with by, _n, and _N -- 5.1.5.Subscripts -- 5.2.Specialized recoding commands -- 5.2.1.The recode command -- 5.2.2.The egen command -- 5.3.Recoding string variables -- 5.4.Recoding date and time -- 5.4.1.Dates -- 5.4.2.Time -- 5.5.Setting missing values -- 5.5.Labels -- 5.7.Storage types, or the ghost in the machine -- 5.8.Exercises -- 6.Creating and changing graphs -- 6.1.A primer on graph syntax -- 6.2.Graph types -- 6.2.1.Examples -- 6.2.2.Specialized graphs -- 6.3.Graph elements -- 6.3.1.Appearance of data -- Choice of marker -- Marker colors -- Marker size -- Lines -- 6.3.2.Graph and plot regions -- Graph size -- Plot region -- Sealing the axes -- 6.3.3.Information inside the plot region -- Reference lines -- Labeling inside the plot region -- 6.3.4.Information outside the plot region -- Labeling the axes -- Tick lines -- Axis titles -- The legend -- Graph titles -- 6.4.Multiple graphs -- 6.4.1.Overlaying many twoway graphs --

Contents note continued: 6.4.2.Option by() -- 6.4.3.Combining graphs -- 6.5.Saving and printing graphs -- 6.6.Exercises -- 7.Describing and comparing distributions -- 7.1.Categories: Few or many? -- 7.2.Variables with few categories -- 7.2.1.Tables -- Frequency tables -- More than one frequency table -- Comparing distributions -- Summary statistics -- More than one contingency table -- 7.2.2.Graphs -- Histograms -- Bar charts -- Pie charts -- Dot charts -- 7.3.Variables with many categories -- 7.3.1.Frequencies of grouped data -- Some remarks on grouping data -- Special techniques for grouping data -- 7.3.2.Describing data using statistics -- Important summary statistics -- The summarize command -- The tabstat command -- Comparing distributions using statistics -- 7.3.3.Graphs -- Box plots -- Histograms -- Kernel density estimation -- Quantile plot -- Comparing distributions with Q-Q plots -- 7.4.Exercises -- 8.Statistical inference --

Contents note continued: 8.1.Random samples and sampling distributions -- 8.1.1.Random numbers -- 8.1.2.Creating fictitious datasets -- 8.1.3.Drawing random samples -- 8.1.4.The sampling distribution -- 8.2.Descriptive inference -- 8.2.1.Standard errors for simple random samples -- 8.2.2.Standard errors for complex samples -- Typical forms of complex samples -- Sampling distributions for complex samples -- Using Stata's svy commands -- 8.2.3.Standard errors with nonresponse -- Unit; nonresponse and poststratification weights -- Item nonresponse and multiple imputation -- 8.2.4.Uses of standard errors -- Confidence intervals -- Significance tests -- Two-group mean comparison test -- 8.3.Causal inference -- 8.3.1.Basic concepts -- Data-generating processes -- Counterfactual concept of causality -- 8.3.2.The effect of third-class tickets -- 8.3.3.Some problems of causal inference -- 8.4.Exercises -- 9.Introduction to linear regression -- 9.1.Simple linear regression --

Contents note continued: 9.1.1.The basic principle -- 9.1.2.Linear regression using Stata -- The table of coefficients -- The table of ANOVA results -- The model fit table -- 9.2.Multiple regression -- 9.2.1.Multiple regression using Stata -- 9.2.2.More computations -- Adjusted R2 -- Standardized regression coefficients -- 9.2.3.What does "under control" mean? -- 9.3.Regression diagnostics -- 9.3.1.Violation of E(εi) = 0 -- Linearity -- Influential cases -- Omitted variables -- Multicollinearity -- 9.3.2.Violation of Var(εi) = α2 -- 9.3.3.Violation of Cov(εi, εj) = 0, i [≠] j -- 9.4.Model extensions -- 9.4.1.Categorical independent variables -- 9.4.2.Interaction terms -- 9.4.3.Regression models using transformed variables -- Nonlinear relationships -- Eliminating heteroskedasticity -- 9.5.Reporting regression results -- 9.5.1.Tables of similar regression models -- 9.5.2.Plots of coefficients -- 9.5.3.Conditional-effects plots --

Contents note continued: 9.6.Advanced techniques -- 9.6.1.Median regression -- 9.6.2.Regression models for panel data -- From wide to long format -- Fixed-effects models -- 9.6.3.Error-components models -- 9.7.Exercises -- 10.Regression models for categorical dependent variables -- 10.1.The linear probability model -- 10.2.Basic concepts -- 10.2.1.Odds, log odds, and odds ratios -- 10.2.2.Excursion: The maximum likelihood principle -- 10.3.Logistic regression with Stata -- 10.3.1.The coefficient table -- Sign interpretation -- Interpretation with odds ratios -- Probability interpretation -- Average marginal effects -- 10.3.2.The iteration block -- 10.3.3.The model fit block -- Classification tables -- Pearson chi-squared -- 10.4.Logistic regression diagnostics -- 10.4.1.Linearity -- 10.4.2.Influential cases -- 10.5.Likelihood-ratio test -- 10.6.Refined models -- 10.6.1.Nonlinear relationships -- 10.6.2.Interaction effects -- 10.7.Advanced techniques -- 10.7.1.Probit models --

Contents note continued: 10.7.2.Multinomial logistic regression -- 10.7.3.Models for ordinal data -- 10.8.Exercises -- 11.Reading and writing data -- 11.1.The goal: The data matrix -- 11.2.Importing machine-readable data -- 11.2.1.Reading system files from other packages -- Reading Excel files -- Reading SAS transport files -- Reading other system files -- 11.2.2.Reading ASCII text files -- Reading data in spreadsheet format -- Reading data in free format -- Reading data in fixed format -- 11.3.Inputting data -- 11.3.1.Input data using the Data Editor -- 11.3.2.The input command -- 11.4.Combining data -- 11.4.1.The GSOEP database -- 11.4.2.The merge command -- Merge 1:1 matches with rectangular data -- Merge 1:1 matches with nonrectangular data -- Merging more than two files -- Merging m:1 and 1:m matches -- 11.4.3.The append command -- 11.5.Saving and exporting data -- 11.6.Handling large datasets -- 11.6.1.Rules for handling the working memory --

Contents note continued: 11.6.2.Using oversized datasets -- 11.7.Exercises -- 12.Do-files for advanced users and user-written programs -- 12.1.Two examples of usage -- 12.2.Four programming tools -- 12.2.1.Local macros -- Calculating with local macros -- Combining local macros -- Changing local macros -- 12.2.2.Do-files -- 12.2.3.Programs -- The problem of redefinition -- The problem of naming -- The problem of error checking -- 12.2.4.Programs in do-files and ado-files -- 12.3.User-written Stata commands -- 12.3.1.Sketch of the syntax -- 12.3.2.Create a first ado-file -- 12.3.3.Parsing variable lists -- 12.3.4.Parsing options -- 12.3.5.Parsing if and in qualifiers -- 12.3.6.Generating an unknown number of variables -- 12.3.7.Default values -- 12.3.8.Extended macro functions -- 12.3.9.Avoiding changes in the dataset -- 12.3.10.Help files -- 12.4.Exercises -- 13.Around Stata -- 13.1.Resources and information -- 13.2.Taking care of Stata -- 13.3.Additional procedures --

Contents note continued: 13.3.1.Stata Journal ado-files -- 13.3.2.SSC ado-files -- 13.3.3.Other ado-files -- 13.4.Exercises.

Title Subject:

Stata

Subject Term:

Statistics -- Data processing.

Mathematical statistics -- Data processing.

Added Author:

Kreuter, Frauke.

Electronic Access:

Contributor biographical information http://catdir.loc.gov/catdir/enhancements/fy1302/2012934051-b.html

Copies:

Available:*

Library	Material Type	Item Barcode	Shelf Number	Copy	Status
Searching... IIEMSA	General Books	33168025594199	519.20285 K79D 2012	1	Searching... Unknown

Bound With These Titles

On Order

Summary

Data Analysis Using Stata, Third Editionis a comprehensive introduction to both statistical methods and Stata. Beginners will learn the logic of data analysis and interpretation and easily become self-sufficient data analysts. Readers already familiar with Stata will find it an enjoyable resource for picking up new tips and tricks.

The book is written as a self-study tutorial and organized around examples. It interactively introduces statistical techniques such as data exploration, description, and regression techniques for continuous and binary dependent variables. Step by step, readers move through the entire process of data analysis and in doing so learn the principles of Stata, data manipulation, graphical representation, and programs to automate repetitive tasks. This third edition includes advanced topics, such as factor-variables notation, average marginal effects, standard errors in complex survey, and multiple imputation in a way, that beginners of both data analysis and Stata can understand.

Using data from a longitudinal study of private households, the authors provide examples from the social sciences that are relatable to researchers from all disciplines. The examples emphasize good statistical practice and reproducible research. Readers are encouraged to download the companion package of datasets to replicate the examples as they work through the book. Each chapter ends with exercises to consolidate acquired skills.