Overview
Random sampling is used when performing data analysis on a large data set. It allows you to more accurately predict what the data is telling you without looking at the whole large data set. This technique is commonly used in polls. Polling is a survey of a population. With polling, a sample of people are asked the same questions, and the answers are analyzed to generalize the sample results against the entirety of a population. Sampling allows for insight into the larger data from which generalizations about the whole data set are made.
Prompt
Use the Module Six Assignment Data Set to take a sample of around 10 percent. Analyze that smaller sample set to find out the top three most frequent crime types in Miami-Dade county. Next, compare your sample set results with the results from the larger data set. Then, report on your conclusions.
Part 1: Analysis
Using Excel:
- Capture a random selection from the larger data set.
- Take one row every 10 rows.
- Create a new sample data set.
- Create a new sheet in Excel with the sample data.
- Determine the top three most frequent crimes from your sample data.
- Create a pivot table to show the frequency of crimes by type.
- Create a pivot table for the larger data set to determine the top three most frequent crimes across the county.
Part 2: Conclusions
Answer the following in regard to your results:
- List the top three most frequent crimes from both the sample and the larger data set.
- Discuss the similarities or differences in the results of both analyses.
- Describe the purpose of taking a sample for analysis.
What to Submit
Submit your assignment as a Word document of 1 to 2 pages using Times New Roman, one-inch margins, and double spacing. Sources should be cited according to APA style.