SOCI332 Statistics for Social Science
SOCI332 Week 1 Discussion: Choosing a Topic
Description
This discussion supports CO2. Please review the Week 1 Lesson Materials prior to responding.
Choose a topic that you would like to research this term. Use the GSS database to find two variables that would support your topic – one independent variable and one dependent variable. Be sure to include the topic, variable names and description in your post. Then discuss how you hope to explore the variables this term and what you hope to learn.
Example:
Does people’s education level affect their income? Independent variable (IV) is education level – the variable DEGREE. Dependent variable (DV) is income – the variable RINCOM16.
SOCI332 Week 2 Discussion: Frequency Tables and Charts
This is an Individual Discussion. The discussion prompt is a mini-assignment where students post statistical analyses conducted via SPSS. The instructor then gives feedback within the discussion as a response post. Once feedback is given, students are able to make corrections (if needed) within response posts to the instructor’s feedback. If no corrections are needed, students should respond and acknowledge they have read the instructor’s feedback.
Initial Discussion posts are due Wednesday. All interaction and corrections should be completed by Sunday. There is no interaction with peers. The responses are only visible to each individual student and the instructor. Initial posts should be thorough, completing all tasks given in the discussion prompt. All posts should demonstrate college level writing skills.
Prior to completing this week’s discussion, you must have SPSS rented and downloaded. You must also have the GSS dataset downloaded from the SPSS & GSS Overview and saved to your computer. You will first open SPSS and then open the GSS dataset within SPSS. (See Intro to SPSS.)
You are now going to create and post a frequency table, chart, and descriptives table (central tendency/dispersion) of each of your variables.
Complete the following steps:
- Post a brief explanation of your topic. Include your research question and for each variable – the name, survey question or description, answer categories (yes/no, strongly agree, disagree, etc.), and level of measurement (nominal, ordinal, or interval/ratio)
- Include a frequency table for each of your variables. Explain your outputs in no more than 5 sentences for each variable. Cite numbers in the outputs to support your conclusion. When you cite %, use the % reported in the “valid percent” column.
- Create a chart for each variable, which is a graphic representation of your data. The type of chart (pie, bar, or histogram) is based on a variable’s level of measurement. Explain your outputs for each variable. It is OK if your explanation is similar to (but not the same as) the frequency table interpretation since a chart is a different data presentation on the SAME variable. Cite numbers in the outputs to support your conclusion.
- Describe the measures of central tendency (mean, median, mode) and dispersion (variance, standard deviation) for each of your variables. Based on what you have learned in the readings and lessons this week, identify the best measures for each variable and explain what they tell us. What do these measures summarize for us about the variable’s data?
- Prior to completing this week’s discussion, you must have SPSS rented and downloaded. You must also have the GSS dataset downloaded from the SPSS & GSS Overview and saved to your computer. You will first open SPSS and then open the GSS dataset within SPSS. (See Intro to SPSS.)
You are now going to create and post a frequency table, chart, and descriptives table (central tendency/dispersion) of each of your variables.
SOCI332 Week 3 Discussion Crosstabs
Description
In our lessons this week, we discuss the use of crosstabulations (crosstabs) as a preliminary analysis to begin investigating the relationship between the IV and DV. A crosstab creates a “snapshot” of our data. Measures of association help to identify the strength and direction of the potential relationship.
You are now going to create and post a crosstab of your variables and a measures of association table.
Complete the following steps:
- Post a brief explanation of your topic. Include your research question and a broad research hypothesis — that is, the relationship of IV to DV. (For example, educational attainment affects family income in US adults.)
- Run a crosstab on your variables. Be sure to explain your findings, including a description of the table, a calculation of the epsilons, and a discussion of the 10% rule.
- Run the correct measure of association for your variables (Choose one – either Pearson R, Gamma, Phi, Cramer’s V or Lambda). Explain what the output means in terms of strength and direction of the relationship. Interpret Proportional Reduction of Error (PRE) using the following statement: Knowing the IV will reduce error in predicting the DV by *%.
Copy the crosstab and measure of association table into the discussion window or into a document (PDF, MS Word) and attach to discussion. If your table does not fit to the page, choose “copy special” and then “images” or take a screen shot of the table to copy/past into the window.
Special note:
When a variable is continuous (interval/ratio level of measurement), for example age of respondent, we do not run crosstabs directly because it will result in a really spread-out table with lots of zeros and low frequency cells. Such a crosstab does not help us understand the data. The correct way is to reduce the level of measurement to either ordinal level or nominal level (group the numbers into categories) by recoding and then run the crosstab. (Please refer to the Lesson Recoding in SPSS for further information.)
As a reminder, here are the guidelines for choosing your measure of association:
- Both DV and IV are nominal variables: Lambda (when it is not a 2X2 table)
- Both DV and IV are nominal variables and it is a 2X2 table: Phi
- Both DV and IV are ordinal variables: Gamma
- One variable ordinal or interval/ratio AND the other variable dichotomous nominal (like Yes/No, male/female, etc.): Gamma
- One variable ordinal or interval/ratio AND the other variable nominal (not dichotomous, has more than 2 categories): Cramer’s V.
- Both DV and IV are I/R variables: Pearson’s r
SOCI332 Assignment 1: Learning and Applying Tests of Significance
Throughout this assignment you will review six mock studies. Follow the step-by-step instructions:
- Mock Studies 1 – 3 require you to enter data from scratch. You need to create a data set for each of the three mock studies by yourself. (Refresh the data entry skill acquired in Week 1.)
- Mock Studies 4 – 6 require you to use the GSS dataset specified in the course. The variables are given in each Mock Study.
- Go through the five steps of hypothesis testing (below) for EVERY mock study.
- All calculations should be coming from your SPSS. You will need to submit the SPSS output file (.spv) to get credit for this assignment.
The five steps of hypothesis testing when using SPSS are as follows:
- State your research hypothesis (H1) and null hypothesis (H0).
- Identify your significance level (alpha) at .05 or .01, based on the mock study. You only need to use ONE level of significance (either .05 or .01) as specified in the instructions.
- Conduct your analysis using SPSS.
- Look for the valid score for comparison. This score is usually under ‘Sig 2-tail’ or ‘Sig. 2’ or ‘Asymptotic Sig.’ We will call this “p.”
- Compare the two and apply the following rule:
- If “p” is < or = alpha, then you reject the null.
- Please explain what this decision means in regards to this mock study. (Ex: Will you recommend counseling services?)
Please make sure your answers are clearly distinguishable. Perhaps you could bold your font or use a different color.
This assignment is due no later than Sunday of Week 4 by 11:55 pm ET. Save this Word file in the following format: [your last name_SOCI332_A1]. Your spv (SPSS output) file should be labeled [your last name_SOCI332_A1Output].
t-Tests
Mock Study 1: t-Test for Independent Samples (20 points)
- Six months after an industrial accident, a researcher has been asked to compare the job satisfaction of employees who participated in counseling sessions with those who chose not to participate. The job satisfaction scores for both groups are reported in the table below.
Use the five steps of hypothesis testing to determine whether the job satisfaction scores of the group that participated in counseling session are statistically different from the scores of employees who chose not to participate in counseling sessions at .01 level of significance. (Alpha = .01)
Clearly list each step of hypothesis testing. As part of Step 5, indicate whether the researcher should recommend counseling as a method to improve job satisfaction following industrial accidents based on evaluation of the null hypothesis.
Data to be entered in SPSS (instructions below)
PARTICIPATED IN COUNSELING | DID NOT PARTICIPATE IN COUNSELING |
35 | 38 |
39 | 36 |
41 | 36 |
36 | 32 |
37 | 30 |
36 | 39 |
37 | 41 |
39 | 35 |
42 | 33 |
38 | 38 |
Step 1: Data managing
- Open a blank SPSS data file: FileàNewà Data
- In the blank SPSS data file, create your SPSS data set by entering the job satisfaction scores of those who participated/did not participate in the counseling sessions (reported on previous page). Please create two columns. Column one is the test variable, where you enter ALL the 20 scores in the table. Column 2 is the grouping variable, where you use “1” to indicate if a score is from someone who participated in the counseling sessions; and “0” to indicate if a score is from someone who chose not to participate in the counseling sessions. The data set will look like this in SPSS Data View window:
35 1
39 1
……….
38 0
36 0
……….
- After data entry, go to Variable View window, change the name of the first variable (test variable) to “JOBSAT” and the second variable (grouping variable) as “group.” Set decimals for both variables to zero.
Step 2: SPSS execution
- Click: AnalyzeàCompare MeansàIndependent-Samples T Testà use arrow to move JOBSAT to “Test Variable” à use arrow to move “group” to “Grouping Variable” àwhen two (? ?) appear, click Define Groups. On the next pop up window, enter “1” for “Group 1” and “0” to “Group 2.”
- Click
Mock Study 2: t- Test for Dependent Means (15 points)
- Researchers are interested in whether depressed people undergoing group therapy will perform a different number of activities of daily living before and aftergroup therapy. More ADL after therapy is a positive outcome. The researchers randomly selected 10 depressed clients in a 6-week group therapy program.
Use the five steps of hypothesis testing to determine whether the observed differences in the numbers of activities of daily living obtained before and after therapy are statistically significant at .05 level of significance. (Alpha = .05)
Clearly list each step of hypothesis testing. As part of Step 5, indicate whether the researchers should recommend group therapy for all depressed people based on evaluation of the null hypothesis.
Data to be entered in SPSS (instructions below)
CLIENT | BEFORE THERAPY | AFTER THERAPY |
11 | 16 | |
12 | ||
10 | 13 | |
13 | 20 | |
11 | 14 | |
12 | 15 | |
15 | ||
17 | ||
13 | 18 | |
12 |
Step 1: Managing data
- Open a blank SPSS data file: FileàNewàData
- In the blank SPSS data file, create your SPSS data set by entering the number of activities of daily living performed by the depressed clients (see above) in the Data View window. Enter the “before therapy” scores in the first column and the “after therapy” scores in the second column.
- In the Variable View window, change the variable name for the first variable to “ADLPRE” and the second variable to “ADLPOST.” Set the decimals for both variables to zero.
Step 2: SPSS execution
- Click: Analyze àCompare Means àPaired-Samples t-Test à use the arrow to move ADLPRE under “variable 1” inside Paired Variable(s) windowà and then use the arrow to move ADLPOST under “variable 2” inside Paired Variable(s) window.
- Click OK.
Mock Study 3: t-Test for a Single Sample (15 points)
- Researchers are interested in whether depressed people undergoing group therapy will perform a different number of activities of daily living (ADL) after group therapy than the average for depressed people. More ADL is a positive outcome. The researchers randomly selected 20 depressed clients to undergo a 6-week group therapy program.
Use the five steps of hypothesis testing to determine whether the average number of activities of daily living (shown below in the table) obtained after therapy is significantly different from a mean number of activities – 15 – that is typical for depressed people. (Clearly list each step).
Test the difference at both the .05 level of significance. (Alpha = .05)
Clearly list each step of hypothesis testing. As part of Step 5, indicate whether the behavioral scientists should recommend group therapy for all depressed people based on evaluation of the null hypothesis.
Data to be entered in SPSS (instructions below)
CLIENT | AFTER THERAPY ADL |
17 | |
14 | |
11 | |
23 | |
24 | |
17 | |
14 | |
10 | |
21 | |
11 | |
22 | |
19 | |
15 | |
17 | |
23 | |
12 | |
10 | |
15 | |
20 | |
18 |
Step 1: Data managing
- Open a blank SPSS data file: FileàNewà Data
- In the blank SPSS data file, create your SPSS data set by entering the number of activities of daily living performed by the depressed clients (numbers listed under AFTER THERAPY – see above) in the Data View window.
- In the Variable View window, change the variable name to “ADL.” Set the decimals to zero.
Step 2: SPSS execution
- Click: Analyze àCompare Means à One-Sample T test à use the arrow to move “ADL” to the Variable(s) window on the right.
- Enter the population mean (15) in “Test Value”
- Click OK.
ANOVA (15 points)
Mock study 4: One-Way ANOVA
- An advertising firm has been hired to assess whether different demographics have different rates of TV watching to help determine their advertising strategy. Using the GSS 2018 data, determine whether hours of tv watched differs by race.
Use the five steps of hypothesis testing to determine whether the observed differences in the number of hours watching TV across three groups are statistically significant at .05 level of significance. (Alpha = .05)
Clearly list each step of hypothesis testing. As part of Step 5, indicate whether the advertising firm should target each racial group differently (if their habits differ) based on evaluation of the null hypothesis.
Variables from GSS 2018 dataset to be used (instructions below):
RACE – race of respondent
1 = WHITE
2 = BLACK
3 = OTHER
TVHOURS – hours per day watching TV
Step 1: Data managing
- Open a blank SPSS data file: FileàOpen Dataàsav (from wherever you have it saved)
Step 2: SPSS execution
- Click: Analyze àCompare Means à One-Way ANOVA à use arrow to move TVHOURS to “Dependent Variable list” à use arrow to move RACE to “Factor,” which instructs SPSS to conduct the analysis of variance on the number of activities performed by therapy type.
- Click: Options àDescriptive (to obtain descriptive statistics).
- Click: Continue
- Click: OK.
Additional question based on Mock Study 4
- Describethe circumstances under which you should use ANOVA instead of t-Tests. Explain why t-Tests are inappropriate in these circumstances.
Regression (20 points)
Mock study 5: Linear Regression
- Researchers in the field of gerontology are researching the effects of age on mental health. They are using GSS data to gather some preliminary findings.
Following the five steps of hypothesis testing, conduct a linear regression analysis to determine whether age affects number of poor mental health days at the .05 level of significance. (Alpha = .05)
Clearly list each step of hypothesis testing. As part of Step 5, indicate whether there is a significant relationship between age and mental health at the .05 level and what does this mean in regard to this mock study. Should the researchers continue their study?
Variables from GSS 2018 dataset to be used (instructions below):
AGE – age of respondent
MNTLHLTH – Days of poor mental health past 30 days
Step 1: Data managing
- Open a blank SPSS data file: FileàOpen Dataàsav (from wherever you have it saved)
Step 2: SPSS execution
- Click: Analyze àRegression à Linear à use arrow to move MNTLHLTH to “Dependent list” à use arrow to move AGE to “Independent,” which instructs SPSS to conduct the linear regression on the relationship of age to poor mental health.
- Click: OK.
For Mock Study 5, after completing the 5 steps of hypothesis testing, also construct the regression equation for the analysis. What does this tell us?
Chi-Square (15 points)
Mock study 6: Chi-Square Test for Independence
- Researchers are interested in whether US adults have different levels of confidence in Congress (legislative branch of the federal government) in conjunction with how strongly that person identifies with a specific political party. These data are presented below.
Following the five steps of hypothesis testing, conduct chi-square test for independence at the .05 level of significance. (Alpha = .05).
Clearly list each step of hypothesis testing. As part of Step 5, indicate whether the observed frequency is significantly different from the expected frequency, and what that means in regard to this mock study. In other words, does political party affiliation effect one’s confidence in Congress?
Variables from GSS 2018 dataset to be used (instructions below):
CONLEGIS – confidence in congress (legislative branch of government)
1 = A GREAT DEAL
2 = ONLY SOME
3 = HARDLY ANY
PARTYID – political party affiliation
0 = STRONG DEMOCRAT
1 = NOT STR DEMOCRAT
2 = IND NEAR DEMOCRAT
3 = INDEPENDENT
4 = IND NEAR REPUBLICAN
5 = NOT STR REPUBLICAN
6 = STRONG REPUBLICAN
7 = OTHER PARTY
Step 1: Data managing
- Open a blank SPSS data file: FileàOpen Dataàsav (from wherever you have it saved)
Step 2: SPSS execution
- Click: Analyze àDescriptive Statistics à Crosstabs à use arrow to move “PARTYID” to “Column(s)”à use arrow to move “CONLEGIS” to “Row(s).” (Recall in crosstab, DV is always in the row and IV is always in the column.)
- Click: Statistics àcheck “Chi-Square.”
- Click: Continue.
- Click: Cellsàcheck “Expected.”
- Click: Continue.
- Click: OK.
SOCI332 Assignment 1: Beginning a Research Study – Univariate Analysis
Complete the following assignment by filling in all pertinent areas of research. You will need to utilize SPSS and the GSS dataset specified in the class for this assignment. You should complete this assignment using the variables and topic that you have chosen for your Final Portfolio Project. You will then be able to follow this as a guide, as well as a check-point, with your instructor. It is essential that you read through all of the feedback regardless of your score. You will be required to submit:
- This word document with blanks filled and SPSS outputs inserted. Throughout the assignment you will see places where your tables, charts, and graphs can be placed.
- An SPSS output file (spv) with this assignment for credit.
You may need to go back through the document to address formatting issues that shift as you begin to input your data. Points will be deducted for sloppiness. Use a different, but legible, color font for your responses.
This assignment is to completed and submitted no later than the Sunday of Week 3 by 11:55pm ET. This assignment is worth 100 points. Save the word file as follows [your last name_SOCI332_A1] and submit it to Assignments for feedback. Label the SPV file as [your last name_SOCI332_A1output]
(A) My Purpose (research question) (10 pts)
My research question is: _______________________________________________________.
I chose this topic because ____________________________________________________________________________________________________________________________________________________________.
APA citation of an academic resource that relates to your topic:
____________________________________________________________________________________________________________________________________________________________
(B) All About the GSS (10 pts) ***Reference Lesson 1 and http://gss.norc.org/faq***
- Who are the participants? ___________________________________________________________________________
- What population does the sample represent? ___________________________________________________________________________
- Who is funding the research? ___________________________________________________
- When is the data collected? __________________________________________________
- How is the data collected? ___________________________________________________
(C) Variables (You are expected to have only one dependent variable (DV) and one independent variable (IV). (15 pts)
My IV: Provide information for the IV using the format below.
IV Variable name in SPSS: ___________________
IV Question (as asked to the respondent verbatim) __________________________________________
__________________________________________________________________________________________________________________________________________________________________________
IV Answer categories: ___________________________________________________________________
__________________________________________________________________________________________________________________________________________________________________________
IV Level of Measurement (nominal, ordinal, interval/ratio): ___________________
My DV: Provide information for the DV using the format below.
DV variable name in SPSS: ______________________________
DV Question (as asked to the respondent verbatim)- __________________________________________
__________________________________________________________________________________________________________________________________________________________________________
DV Answer categories: _____________________________________________________________________
__________________________________________________________________________________________________________________________________________________________________________
DV Level of Measurement (nominal, ordinal, interval/ratio): __________________________
(D) Frequency Tables (15 pts)
Run frequencies for each variable listed above. Summarize your findings in a paragraph or two below. What do the counts and valid percents tell you about each variable? Cite numbers in the frequency tables to support your conclusion. Be sure to insert your tables (copy and paste from SPSS) into this document.
[Insert SPSS frequency tables here]
(E) Graphs and Charts (10 pts)
Run the appropriate graphs/charts for each of your variables listed above. Summarize your findings briefly in a paragraph or two. Cite numbers in the graph/charts to support your conclusion. How does the visual representation help us understand the data? Include a title on each of your charts and other pertinent labels.
[Insert SPSS graphs/charts here]
(F) Measures of Central Tendency and Dispersion (15 pts)
Run the measures of central tendency (mean, median, mode) and dispersion (variance, standard deviation) for each of your variables. Summarize your findings briefly in a paragraph or two. Which measures are appropriate for nominal, ordinal, or interval/ratio variables? What do these measures tell us about each variable?
[Insert SPSS output here]
(G) Recoding (15 pts)
Choose one of your variables to recode. If you have an interval/ratio variable, you may recode it into an ordinal variable. If you have two nominal/ordinal variables, recode the one with the most categories into fewer categories, or check with your instructor on the best option.
[Insert the following items: SPSS syntax for the recoding process; the frequency table for the original variable; and the frequency table for the recoded variable]
SOCI332 Assignment 2: Tests of Significance
Throughout this assignment you will review six mock studies. Follow the step-by-step instructions:
- Mock Studies 1 – 3 require you to enter data from scratch. You need to create a data set for each of the three mock studies by yourself. (Refresh the data entry skill acquired in Week 1.)
- Mock Studies 4 – 6 require you to use the GSS 2018 dataset. The variables are specified in each Mock Study.
- Go through the five steps of hypothesis testing (below) for EVERY mock study.
- All calculations should be coming from your SPSS. You will need to submit the SPSS output file (.spv) to get credit for this assignment.
The five steps of hypothesis testing when using SPSS are as follows:
- State your research hypothesis (H1) and null hypothesis (H0).
- Identify your significance level (alpha) at .05 or .01, based on the mock study. In Mock Study One, you are required to use BOTH .05 and .01 to test your hypotheses. For the remaining mock studies, you only need to use ONE level of significance (either .05 or .01) as specified in the instructions.
- Conduct your analysis using SPSS.
- Look for the valid score for comparison. This score is usually under ‘Sig 2-tail’ or ‘Sig. 2’ or ‘Asymptotic Sig.’ We will call this “p.”
- Compare the two and apply the following rule:
- If “p” is < or = alpha, then you reject the null.
- Please explain what this decision means in regards to this mock study. (Ex: Will you recommend counseling services?)
Please make sure your answers are clearly distinguishable. Perhaps you could bold your font or use a different color.
Related; PSY-380 Introduction to Probability and Statistics Project 3 – SPSS or Excel Analyses
This assignment is due no later than Sunday of Week 5 by 11:55 pm ET. Save this Word file in the following format: [your last name_SOCI332_A2]. Your spv (SPSS output) file should be labeled [your last name_SOCI332_A2Output].
t-Tests (50 points)
Mock Study 1: t-Test for a Single Sample (20 points)
- Researchers are interested in whether depressed people undergoing group therapy will perform a different number of activities of daily living (ADL) after group therapy than the average for depressed people. More ADL is a positive outcome. The researchers randomly selected 15 depressed clients to undergo a 6-week group therapy program.
Use the five steps of hypothesis testing to determine whether the average number of activities of daily living (shown below in the table) obtained after therapy is significantly different from a mean number of activities of 17 that is typical for depressed people. (Clearly list each step).
Test the difference at both the .05 and .01 levels of significance.
As part of Step 5, indicate whether the behavioral scientists should recommend group therapy for all depressed people based on evaluation of the null hypothesis at both levels of
Reviews
There are no reviews yet.