Unit 4 Self-Check Assignment: Milligan Chapter 10
Welcome to your hands-on activity! This assignment invites you to work with Tableau using a mix of generated and real-world data. You will follow these steps:
- Download the attached spreadsheets, and upload those data sets into Tableau. If you need help loading your data sources, refer to the Unit 1 Self Check Assignment: Milligan Chapter 1
- Go through this document, and use Tableau to answer all the questions listed below. Where applicable, paste screenshots into the template below.
- When you are ready, complete the online quiz, which verifies your homework. Use the answers you found in this document to answer the questions.
- When you have completed the online quiz, submit the Word document.
- Remember, you can always ask your instructor for help if needed.
- If you need to adjust the size of your visualizations to match the options in the questions, use the “Format”-> “Cell size” options. For example, “Ctrl+Shift+B” on a Windows computer will make the visualization bigger, and “Ctrl+Up” will make it taller.
This assignment contains 7 datasets and 6 icon files:
- Pacific West National Park Visitation 2001–2022
- Monthly Personal Budget (two worksheets: July and August)
- Video game sales
- 2022 NYC Restaurant Inspections
- September 2023 Baltimore City Crime Data
- University course completion rates
- University pass rates
- Icon files:
- Biology
- Chemistry
- Computer Programming
- Data Analytics
- Math
- Physics
Question 1: Bump chart
You have been asked to help the Northwest National Park system to find trends. Specifically, what do five-year visitation trends look like for the parks? Load the Pacific West National Park Visitation 2001–2022 into your data sources. Drag “Year” into the columns and create an ad hoc calculation in the row: SUM (Visitor Count). Then drag “Park Type.” Select only the “National Park” park type. Then, drag “Year” into the filters for the years 2018–2022. Keep in mind that the year 2020 will have anomalies due to the COVID-19 pandemic shutdown of many parks. Drag “Park Name” onto “Color” in the “Marks” tab and “Park Name” onto “Label.” Make sure the marks type is set to “Line.” Edit the label so that it shows the “Line Ends.”
You should now have a bump chart that looks like this:
From this chart, we can see that Yosemite, Olympic, and Death Valley all had [missing word], but they took downturns in 2020. They are rising again, but still not seeing the numbers they had previously. Joshua Tree, Mount Rainier, Hawaii Volcanoes, and Haleakala NP have increased visitors in 2022 compared to 2018.
While this is helpful, the leadership has requested to see a chart with 10 years (2013–2022). They also want to see only national parks with at least 1 million visitors annually. Add both filters to your bump chart. Add Visitor Count as a Label.
Question 1A: For the Yosemite NP, was there an increase or decrease between 2013 and 2022?
Question 1A Answer: |
<<insert your screenshot>>
Question 1B: What was the highest visitor count for Yosemite NP between 2013 and 2022?
Question 1B Answer: |
<<insert your screenshot>>
Question 2: Waterfall
Your friend is always complaining to you that they can’t save any money and their bank account is always going down. You ask your friend to write down all the expenses and income in a month, and you record them in the Monthly Personal Budget: July. For each expense, you have tracked the date, type, whether it is income, fixed, or discretionary, and the amount (negative for expenses and positive for income).
Load the dataset into Tableau and choose the July worksheet to start. Create a waterfall chart of the July data by dragging the “Income/Fixed/Discretionary” and “Type” dimensions into the columns. Then drag “Amount” into the rows. Because we want to see the total as the month goes on, we will create a running total calculation using table across. To do this, first, right-click “Amount” and select “Add Table Calculation…”. This should bring up the following window, … in which you should change the Calculation Type and Compute Using to match:
Drag the “Amount” measure onto Color, Size, and Label in the Marks tab.
Change the chart type to “Gantt Bar.”
Double-click on “SUM(Amount)” on Size and edit in the shelf. Add a negative sign before the SUM([Amount]).
Change the “SUM(Amount)” on Label to be a Running Total.
Your waterfall chart should be taking shape now. You should see something similar to the following image:
Now, we can see quickly that your friend spent $794 in the month of July just on discretionary spending. Then by the time they paid their rent, they had spent a total of $4,960 for the month. They make $4,140 from their regular job and another $805 from side gigs. This means that in July, your friend overspent their monthly income by $15 dollars. If they spend like that every month, that means they are overspending $180 a year.
Your friend would really like to start saving money, so you help them go through expenses and try to spend less money. Now, they return with their August expenses (another worksheet in the Monthly Personal Budget), and they want to see the same chart.
Question 2: How much money did your friend save this month? Provide your answer rounded to the nearest dollar (no cents).
Question 2 Answer: |
<<insert your screenshot>>
Question 3: Step lines
For this exercise, you will be using the “Video game sales” data set to answer questions about how certain genres have performed by global sales in the last five years.
Drag “Genre” over to Columns and “Global Sales” into Rows. Add a quick calculation of Running Total to the “Global Sales.”
Drag “Global Sales” onto Color in the Marks tab.
Filter by Years 2011 to 2016.
Change the Marks type to “Line,” and then click on “Path” and choose “Step.”
Click on Ctrl+Shift+B (or CMD+Shift+B if you’re on a Mac) a few times to make the graph wider (this might help to read the bottom axis better).
We can see that shooter games had a significant step within global sales between 2011 and 2016.
However, we can see that action starts around $140. Let’s modify this a bit.
Right-click on “Genre” in the columns area, click on the “Sort” function. We want to sort by Field, Ascending, Field Name-Global Sales, and Aggregation type: Sum.
Now, we can see where the calculation starts in the bottom left corner with Fighting and slowly steps up until we get to Action and then Shooter games, where we see two big jumps. This is a better representation of the step-line chart.
Now, change the years to 2006–2010.
Question 3: What is the top-selling genre? What was the amount of global sales for that genre?
Option A: Role-playing, 316.1
Option B: Shooter, 899.3
Option C: Action, 437.4
Option D: Sports, 209.4
Question 3 Answer: |
<<insert your screenshot>>
Question 4: Sparklines
You are being asked to find trends about the New York City Health Department Restaurant Inspections. There are over 27,0000 restaurants in the five boroughs of New York City. You need to load the data source: 2022 NYC Restaurant Inspections. This is a rather large data source and may take a little longer to load the data.
When restaurants have violations, they receive scores for each violation. Therefore, restaurants want low scores, and high scores are not desirable. Grades A–C are assigned to restaurants based on their scores. Restaurants with scores of 0 to 13 receive an “A.” If they score between 14 and 27, they receive a “B.” A “C” grade is 28 or more. If you would like to read more, see ABCEats-Restaurants.
Drag “Inspection Date” into columns and “Boro” into rows. Now
Reviews
There are no reviews yet.