Navigation Aids

 
 
 
 
 
Click here to IM, text, or chat
Courseware at the University of Pennsylvania
FindIt:

Sidebar

Main Content

Social Science Data Analysis Tools

The GSS (General Social Survey) is an almost annual, "omnibus," personal interview survey of U.S. households conducted by the National Opinion Research Center (NORC). The first survey took place in 1972 and since then more than 38,000 respondents have answered over 3,260 different questions.

General Social Survey [http://hdl.library.upenn.edu/1017/9086]

The special features of the GSS follow from its unique origin as the first, perhaps only, social science data set designed to be analyzed by "users," rather than the PIs and project staff.

Key features of the GSS are its broad coverage, its use of replication, its cross-national perspective, and its attention to data quality.

It covers a broad range of variables from "Abortion, attitudes toward," to "Zodiac, sign of." Topics are chosen, drawing on the advice of a large number of social scientists, to cover variables strategic for social science research. The eclecticism of the GSS is a mirror of the eclectic interests of contemporary social science. For more information on coverage, see the subject index.

ICPSR's collection of SDA-interface data sets. A selection of ICPSR data collections is available for online data analysis. This means that users can perform certain statistical procedures on the data, create custom subsets, or browse the codebook on the Internet, without downloading the entire collection and importing the data into a statistical package.


SDA Homepage from ICSPR [http://hdl.library.upenn.edu/1017/6945/2]

This is the main entry point to ICPSR's online data analysis system. In addition to the main archives listed below, the SDA site provides access to other datasets as well.

Studies are in 5 main categories:

  • Aging
  • Child Care & Early Education
  • Criminal Justice
  • Education
  • Substance Abuse & Mental Health

Substance Abuse & Mental Health [http://hdl.library.upenn.edu/1017/7030]

The goal of the archive is to provide ready access to substance abuse and mental health research data. Topics covered include drugs, tobacco, and alcohol, and their use, insurance coverage, treatment, illegal activities, and relevant demographic, health, and mental health data. Major studies archived include Drug Abuse Warning Network, Monitoring the Future, National Household Survey on Drug Abuse, National Pregnancy and Health Survey, Treatment Episode Data Set, and National Youth Survey.

National Archive of Criminal Justice Data [http://www.icpsr.umich.edu/NACJD/]

The National Archive of Criminal Justice Data (NACJD) preserves and distributes computerized crime and justice data from Federal agencies, state agencies, and investigator initiated research projects to users for secondary statistical analysis.

International Archive of Education Data [http://www.icpsr.umich.edu/IAED]

"The Archive acquires, processes, documents, and disseminates data collected by national, state or provincial, local, and private organizations, pertaining to all levels of education in countries for which data can be made available. Data will encompass the "inputs" to education (funding, personnel, teaching resources, facilities, teacher and student preparation, etc.), the variety of processes by which teaching and learning occur, and the "outputs" of education (graduation and matriculation rates, drop-out rates, test scores, job placements, life histories, life assessments, etc.). The data stored in this new Archive are intended to support a wide variety of comparative and longitudinal research through the preservation and sharing of data resources. The Archive seeks to serve the needs of academics, policymakers, and researchers in the field of education."

National Archive of Computerized Data on Aging (NACDA) [http://www.icpsr.umich.edu/NACDA]

"NACDA's mission is to advance research on aging by helping researchers to profit from the under-exploited potential of a broad range of datasets. NACDA acquires and preserves data relevant to gerontological research, processing as needed to promote effective research use, disseminates them to researchers, and facilitates their use. By preserving and making available the largest library of electronic data on aging in the United States, NACDA offers opportunities for secondary analysis on major issues of scientific and policy relevance."

Spreadsheet-like interface to enormous U.S. federal data sets, users may subset major data sets and download them to their own statistical packages. Users must download and install the DataFerrett application in order to access the data. It is an extremely useful tool with a somewhat clunky interface.

DataFerrett [http://www.thedataweb.org/index.html]

The Data Ferrett provides access to the following datasets.

  • American Community Survey (ACS)
  • American Housing Survey (AHS)
  • Behavioral Risk Factor Surveillance System (BRFSS)
  • Consumer Expenditure Survey (CES)
  • Current Population Survey (CPS)
  • Decennial Census of Population and Housing (Census2000)
  • National Ambulatory Medical Care Survey (NAMCS)
  • National Center for Health Statistics Mortality-Underlying Cause-of-Death (MORT)
  • National Health and Nutrition Examination Survey (HANES)
  • National Health Interview Survey (NHIS)*
  • National Hospital Ambulatory Medical Care Survey (NHAMCS)
  • National Survey of Fishing, Hunting, and Wildlife-Assocated Recreation (FHWAR)
  • Survey of Income and Program Participation (SIPP)
  • Survey of Program Dynamics (SPD)

Sociometrics SSEDL microdata analysis focusing on AIDS/STD, disability in the U.S., the American family, adolescent pregnancy, aging, and other health-related topics.

Multivariate Interactive Data Analysis System (MIDAS) [http://hdl.library.upenn.edu/1017/7042/1]

The datasets included in Midas are:

American Family Data Archive (AFDA) -- The archive contains data sets in family research that have the greatest potential for secondary analysis relating to family structure, marriage, divorce, family formation, child & family well being, child care, and other topics of importance to family research.

AIDS/STD Data Archive (AIDS) -- Included data sets address the following topics: the incidence and prevalence of specific sexual behaviors; contraceptive and STD-preventive behavior; attitudes and beliefs regarding sexual behavior and methods of contraception and STD prophylaxis; AIDS/HIV knowledge, attitudes, behavior, and serostatus; current and past episodes of STDs; and high-risk behavior, including alcohol/drug use and prostitution.

Contextual Data Archive -- Contextual data describe the population, social, and economic characteristics of geographic areas, from census tracts to states, in which people reside or work. The contextual data archive consists of a series of files, each organized around a different geographic unit of analysis (such as census tracts, school districts, counties, states, etc.)

Child Well-Being and Poverty Data Archive -- The collection provides descriptive and comparative data on the structure of child well-being and poverty; factors that contribute to poverty and child well-being; the consequences of child poverty; and evaluations of education, welfare, and other programs related to child well-being and poverty.

Data Archive of Social Research on Aging (DASRA) -- DASRA contains three large national surveys that deal with nearly all components of the aging process.

Data Archive on Adolescent Pregnancy and Pregnancy Prevention (DAAPPP) -- DAAPPP includes social science data on the incidence, prevalence, antecedents and consequences of teenage pregnancy and family planning. It also includes studies that focus more broadly on adolescent sexual health issues, thereby including studies examining behavioral factors related to sexually transmitted diseases (STDs) in addition to pregnancy.

Maternal Drug Abuse Data Archive (MDA) -- The collection includes data on the following topics: the prevalence of drug use among pregnant women and women of childbearing age; demographic characteristics of pregnant drug users; types and patterns of illicit drug use; social, psychological and economic antecedents of pre- and perinatal drug abuse; the effects of pre- and perinatal substance use on pregnancy complications and neonatal status; and the effects of fetal alcohol and drug exposure on children's physical, neurobehavioral, psychological and social development.

Research Archive on Disability in the U.S. (RADIUS) -- The purpose of the project is to facilitate access to the best data sets on the prevalence, incidence, correlates, and consequences of disability in the U.S.

*