Home » Assignment Example • Examples • Statistical Data analysis • Undergraduate » Data Analysis using SPSS

The following report provides a comprehensive data analysis using SPSS software, whereas the targeted area is to determine the significance of training on the accuracy of malaria detection using virtual microscopy. The data comprises 10 participants divided equally into two groups: a control group and the trained group. The initial and final scores for both groups have been assessed to determine the statistical significance of training. To test the hypothesis of the undertaken model, the researcher has applied an independent sample T-test. Moreover, other supplementary tests have also been used, such as the normality test, descriptive statistics, and box plots have been obtained to understand the data more comprehensively.

In this research, the data was obtained from 10 participants bifurcated into two groups: the control group and the trained group. The Control group included the individuals who were not provided training regarding virtual microscopy, and the trained group included participants who were given proper training after their initial score. The following table shows a summarized view of

The above table states the value of the mean and standard deviation for the initial score and final score for the participants of the study. The mean value in the initial score is recorded to be 13.51, which is deviated by 1.64 units. On the other hand, in the case of the final score, the mean score was 56.5, which is higher than the former. It can be stated that overall, for both groups, the final results were significantly better than the initial score. The average final score deviated by 4.26 points.

Following is the hypothesis that is being tested by this report:

- H0 = The mean values of the experimental group and control group after undergoing training regarding virtual microscopy will be equal
- H1 = The mean values of the experimental group and control group after undergoing training regarding virtual microscopy are not equal.

In the process of data analysis using SPSS, the normality tests hold strong significance. Normality tests were conducted on the data to assess whether or not the data is normally distributed (Park, 2015). Moreover, the normality was to be determined to evaluate which type of test was to be conducted. The following table shows the results of the normality test:

The null hypothesis for this test is that the data is normally distributed (Norusis, 2011). The null hypothesis has not been rejected for both the Kolmogorov-Smirnova or Shapiro-Wilk tests. This indicates that the information is normally distributed. Following is the Q-Q plot for the initial score:

The above graph shows the observed values for the initial score plotted against the expected values. The graph shows an upward trend where most of the values are plotted within the line except for an outlier value. This further validates that the data for initial scores are normally distributed. The following graph shows the Q-Q plot for final scores:

**Figure 2: Q-Q Plot for Final Scores**

The above graph shows the observed values for the final score plotted against the expected values. The graph shows an upward trend where most values are plotted within the line with no outliers. This further corroborates that the data for final scores are normally distributed. Moreover, from the normality test results, it can also be said that a parametric test can be applied to the model. Hence, an independent sample T-test will be used to test the main hypothesis of this research.

For the data under consideration, box plots have been used to depict the groups of numerical data based on the quartiles. As Box plots highlights the five statistical values from a data set including maximum, minimum, first quartile, median and third quartile, it is necessary for data analysis using SPPS. The box plots also determine the data's variability and outliers (Kerr, Hall, and Kozub, 2002). The following image shows the box plot for the initial scores of the participants:

The box plot shows the minimum value, i.e. 8.33, and the maximum value, i.e. 15.83. However, there is a presence of an outlier in the data, denoted by a small dot superscripted with a 3. This indicates that the score at the third number is the outlier in the data, i.e. 25.83. It is considered an outlier because it is at an abnormal distance from the average values in the initial score. Moreover, the box plot also indicates that most scores are more than the median. The following image shows the box plot for the final scores of the participants:

In the case of the final scores, the minimum value is 35, and the maximum value is 77.5. It is also apparent from this box plot that there are no outliers in the case of final scores. The median value for the final scores appears to be 60 in the above box plot. At the same time, most participants scored less than the median value of 60. As compared to the box plot for initial scores, it can be stated that their extent of variability is significantly less in the case of the final score.

When two independent groups' populations are compared to see differences or similarities, an independent sample T-test is applied (Allen, Bennett, and Heritage, 2018). In the case of the data set that has been considered, the two independent groups are the control group and the trained group. The following table shows the group statistics of the model:

The mean values of the initial score indicate that the control group had a slightly higher score than the trained group. However, the deviation from the average value was significantly higher in the control group for the initial score. The mean values of the final score depict that the scores were improved majorly for both groups. However, the mean score of the trained group was higher, i.e. 66.5, compared to the mean value of the control group, i.e. 46.5. The deviation in the average value was again more elevated for the control group. Overall, from the group statistic, it can be evaluated that the final score has improved significantly after the intervention applied (training). However, at this stage, the significance of the difference between the mean of the two groups can be determined with the help of the following table:

During data analysis using SPSS software, Leven's test is used to determine the equality of Firstly, in the above table, the sig value for Levene’s test is given, which hypothesizes that the population of variances are equal (Marshall and Boggis, 2016). In the case of the initial scores of the participants, the sig value for this test is 0.273, which is higher than the alpha value at the 95% significance level; hence the null hypothesis is accepted, stating that the population of variances is homogenous or equal. This indicates that to test the equality of means, the sig value for ‘equal variances’ will be undertaken. As per this assumption, the sig value appears to be 0.542, which means that the null hypothesis of equality of means between the control and trained groups cannot be rejected.

On the other hand, in the case of the final scores of the participants, the sig value for Levene’s test is 0.154, which is higher than the alpha value at a 95% of significance level; hence the null hypothesis is accepted, stating that the population of variances are homogenous or equal. This indicates that to test the equality of means, the sig value for ‘equal variances’ will be undertaken. As per this assumption, the sig value appears to be 0.008, which means that the null hypothesis of equality of means between the control and trained groups is rejected. Henceforth, the results indicate that the final scores for the trained and control groups differ significantly.

This report evaluated statistical analysis and interpretation for comparing the quiz results undertaken by two groups, a control group and the trained group, using the independent sample T-test. The intervention used on the participants was providing training for virtual microscopy. The results indicate a lesser difference between the initial scores for the control and trained groups. However, in the final scores, which were recorded after the provision of training, a statistically significant difference was observed for both groups. Conclusively, the results have suggested that training is an efficient intervention in improving the accuracy of malaria detection by using the method of virtual microscopy.

**Review the following:**

**References**

Allen, P., Bennett, K. and Heritage, B., 2018. *SPSS Statistics: A Practical Guide with Student Resource Access 12 Months*. Cengage AU.

Kerr, A.W., Hall, H.K. and Kozub, S.A., 2002. *Doing statistics with SPSS*. Sage.

Norušis, M.J., 2011. *IBM SPSS statistics 19 guide to data analysis*. Upper Saddle River, New Jersey: Prentice Hall.

Park, H.M., 2015. Univariate analysis and normality test using SAS, Stata, and SPSS.

Please fill the free topic form and share your requirements

The writer starts to find a topic for you (based on your requirements)

The writer shared custom topics with you within 24 hours