country
$20 Bonus + 25% OFF
Securing Higher Grades Costing Your Pocket? Book Your Assignment at The Lowest Price Now!

COMP 5070 Statistical Programming For Data Science

tag 0 Download8 Pages / 1,779 Words tag Add in library Click this icon and make it bookmark in your library to refer it later. GOT IT

Questions:

1. Write a short explanation (approximately 1 paragraph) of the analysis to be  performed and an explanation of the data. Include your session IDs for the expert responses, and any data  manipulation performed prior to analysis should you do so.
 
2) Exploratory Factor Analysis: conduct two separate exploratory factor analyses: the first for your selected id  sessions for the expert responses, the other for the full set of amateur responses. You may present the  analyses side-by-side or in sequence; however you believe is best.

 

Answer:

Question 1

Initial Data Discussion

The sensometric values for the fourteen attributes regarding chocolates were analyzed on the basis of average contribution in product definition. The bar plot and heat map for average sesnometric scores of attributes have been plotted. It was evident that chocolate aroma, sweetness, and crispy texture were comparatively more essential qualities for choice and ranking of chocolates, for the experts. 

The sensometric values for the amateurs concerning the fourteen attributes regarding chocolates were analyzed. The bar plot and heat map for average sesnometric scores of attributes have been plotted. It was evident that chocolate aroma, sweetness, chocolate aroma, and crispy texture were essential qualities for choice and ranking of chocolates, for the amateurs.

Exploratory Factor Analysis

Cronbach Alpha

The responses of the experts and the amateurs were tested for reliability for exploratory factor analysis by Cronbach alpha. The response matrix of experts was found to be moderately reliable () and the responses of amateurs was found to be comparatively less reliable () to that of the experts. The trend of the reliability statistics indicated that the factor analysis based on experts’ opinions was more accurate than that of the amateurs.

 

Correlation Output 

The positive and negative correlations between the ratings of the attributes by the experts for various product ranges of chocolates have denoted by blue and green circles. Chocolate aroma was significantly positive with bitterness, astringency, and crispy flavor, whereas, milk aroma was associated positively sweetness, caramel flavor, vanilla flavor, and somewhat with texture of the chocolates. At this stage probable two factors were identified as chocolate and milk attributes of the chocolates. For the amateurs, highest negative correlation was identified for chocolate and milk flavor (r = 0.96), whereas, bitterness and chocolate flavor were found to be associated in a highly positive (r = 0.93) way.

Determinant test, Bartlett’s test of Sphericity and the KMO Statistic

The determinant value of the correlation matrix for experts was greater than 0.00001, signifying that there were no multicollinearity issues for exploratory factor analysis. A similar result was obtained for amateurs’ response, where multicollinearity was not a problem for the dataset. The Bartlett's Test of Sphericity was used to test that the correlation matrix was an identity matrix and there was only one factor to be identified. The claim was rejected for the experts’ opinions at 1% level of significance (for arbitrary chosen sessions (9, 5). The amateur data set also indicated that the correlation matrix was significantly different to be an identity matrix at 1% level of significance. In test of adequacy of the sample data, Kaiser-Meyer-Olkin statistic was used, and the value was found to be closer to 1 (KMO = 0.91). This signified that the sample dataset was adequate for factor extraction. Parallel study for adequacy in amateur data revealed that (KMO = 0.83) there was enough data for factor analysis.

Number of Factors to Estimate

The Scree plot identified two components having Eigen values greater than 1 in expert reviews. The output suggested extraction of two factors from the analysis. From the amateurs’ response sheet extraction of two factors was proposed. 

FINAL Factor Solution

Among PCA, ML, and PA methods of extractions, the principal axis (PA) extraction method was able to load all the components on two factors. Milk flavor, caramel flavor, milk aroma, vanilla flavor, stick texture, sweetness, and melting texture loaded on factor 1. Chocolate flavor, astringency, bitterness, chocolate aroma, acidity, granular texture, and crispy texture loaded on factor 2 of the analysis. All the components loaded with statistically significant association with the factors. The first factor was identified as the Milk Characteristic and the second factor was named as Chocolate Characteristic because of the components’ features.

In amateur data set the all the components loaded cleanly on two factors by Principal Components Analysis (PCA). Milk flavor, chocolate flavor, bitterness, sweetness, caramel flavor, chocolate aroma, vanilla flavor, crispy texture, milk aroma, and melting texture loaded on factor 1, whereas, sticky texture, acidity, astringency, and granular texture loaded on factor 2. Patter of factor loading indicated the confusion in judgment and decisions on likings.  The factors could be identified as taste and feel of the chocolates.

Differences between the Expert and Amateur Sensometric Ratings

Experts were particular in identifying the two sensometric factors of the analysis, based on components of the chocolates. Milk and chocolate are the two primary components in a chocolate and experts correctly identified the attributes in a proper alignment. On the other hand, amateurs were greatly inclined towards the taste factor of the chocolates. They ranked chocolates based on its taste and feel. The difference in ranking was pretty obvious in nature from the point of expertise and information about details of chocolates.

Conclusions

Reliability of the responses for factor analysis was greater for experts’ opinions compared to that of the amateurs. Item was reliability for experts’ views revealed that exclusion of milk aroma and milk flavor increased the Cronbach alpha from 0.48to 0.49. A very high positive correlation was observed for these two components of the study, whereas, chocolate and milk flavors were almost perfectly and negatively associated with correlation coefficient of – 0.97. There was a significant negative relation between bitterness and milk flavor, which made them to load on two different factors. The sample was found to be adequate and considerably different from unit matrix for accurate factor extraction. Individual KMO statistics were significantly high; the minimum value of 0.834 was noted for milk flavor of a chocolate. Sample size was found to be sufficiently large for proper EFA.

Reliability for amateurs was found to be 0.38, which was found to increase up to 0.4 for removal of chocolate and milk flavor from the dataset. The most important aspect was identified as the sticky texture, and astringency of the chocolates. Caramel flavor was the dominant reason for reliability purpose. Here, chocolate flavor and bitterness had highly positive correlation, whereas relation between chocolate and milk flavor, and sweetness and bitterness were highly negative.  The sample was found to be adequate and considerably different from unit matrix for accurate factor extraction. Individual KMO statistics were significantly high; the minimum value of 0.834 was noted for milk flavor of a chocolate. Sample size was found to be large for EFA. The preference for ranking the chocolates was solely based on taste and feel of the chocolate. From the correlation between the factors, it was noted that no oblique rotation was required for EFA (Hanna, de Araújo, Vilarino, & Mayhew, 2016). 

Question 2

Initial Data Discussion

For the distance matrix data for Asian continent, Hierarchical clustering and Partition clustering were performed to identify the zones of the location of the twenty five cities in the dataset. The distance matrix was evaluated for exploratory purpose by the following heat map and spring map. The red marked cells indicated the distances which pointed towards the closeness of another country. The spring map was drawn to identify the proximity of two cities. The bold line signified those counties which can isolated easily in a cluster. The dataset was scaled by shifting the origin to median and changing the scale by absolute deviation from median.

Hierarchical clustering

Default hierarchical clustering is “complete” method. In the study AGNES based methods along with Ward’s method was used for comparative purpose. The resulting dendograms from the four methods have been provided below for cluster identification. From all the methods, 5 clusters were identified from the visualization of the dendograms.

From the hanging tree it was easy to locate the five clusters or zones of cities. The country wise picture has been provided in the following matrix. Though, Karachi and Madras are two far off destinations, they still load to a same cluster. 

The Pearson’s and Spearman’s correlation matrices were plotted graphically for all the methods of hierarchical clustering. All the methods were found to have yield almost similar results, with average method leading the table. The spearman’s correlation plot in the successive figure established the efficiency of average method in this study for hierarchical clustering.

Partition clustering 

The 3d map for the two dimensional distance matrix indicated five separate zones for clustering. The clusters were later identified using the Elbow Method. Considering the optimality (minimalist) of total within clusters sum of squares, 3 clusters with 7, 5, and 13 cities were identified. The cluster numbers were later changed to 5 for proper portioning of the cities.

3d Map of 2d Distance Matrix

Considering the number of clusters = 5, the cluster plotting yielded five clusters with 6, 2, 3, 9, and 5 cities. The cluster with 9 cities was located near the Bangkok and Singapore region. Nine countries clustered due to proximity in that region.

Discussion

The partitional clustering was based on choice of k-means or centers. Initial processing suggested 3 clusters of cities with minimum total within clusters sum of squares at 75.4%. Later, appropriate choice of clusters was decided on the basis of Elbow method, considering the previous methods of cluster analysis. The 3d plot was an indicative figure in this case. Five zones were identified, which were i) near Bangkok region, ii)near Delhi region, iii) near Yokahama region, iv) near Bangalore region, and v) near Istanbul region.

Validation

No outlier distance was identified from the matrix, and proper choice of zones or clusters of countries was identified to be 5. The initial clustering was able to reduce the SS of the total clustering, but with formation of clusters with far-off countries. The solution with k=5 number of partitioning was found to be appropriate from point of view of practical significance.

Conclusions

Both the hierarchical clustering and partitional clustering were efficient clustering technique. But, considering the choice of clusters, hierarchical clustering was easy to interpret because of the clear picture of the cluster loadings in dendograms. The k-means clustering had the power of generating the optimal partitioning of the data points with minimum total within clusters sum of squares.

In the present study, hierarchical clustering was efficient in deciding the number of clusters compared to partitional clustering. In partitional clustering mutually exclusive spherical shaped clusters were obtained. And in hierarchical clustering, based on agglomerative approach and divisive approach, the countries were assumed as individual clusters and then clustered form bottom to top direction in the tree (Yates et al., 2015).

 

References 

Hanna, L. M. O., de Araújo, R. J. G., Vilarino, E. F. A., & Mayhew, A. S. B. (2016). The caries experience and dentistry following evaluation of children submitted to antineoplastic therapy. Journal of Research in Dentistry, 4(2), 45-50.

Yates, L. R., Gerstung, M., Knappskog, S., Desmedt, C., Gundem, G., Van Loo, P., ... & Li, Y. (2015). Subclonal diversification of primary breast cancer revealed by multiregion sequencing. Nature medicine, 21(7), 751.

Download Sample

Get 100% money back after download, simply upload your unique content* of similar no. of pages or more. We verify your content and once successfully verified 100% value credited to your wallet within 7 days.

Upload Unique Document

Document Under Evaluation

Get Credits into Your Wallet

*The content must not be available online or in our existing Database to qualify as unique.

Cite This Work

To export a reference to this article please select a referencing stye below:

My Assignment Help. (2019). Statistical Programming For Data Science. Retrieved from https://myassignmenthelp.com/free-samples/comp-5070-statistical-programming-for-data-science.

"Statistical Programming For Data Science." My Assignment Help, 2019, https://myassignmenthelp.com/free-samples/comp-5070-statistical-programming-for-data-science.

My Assignment Help (2019) Statistical Programming For Data Science [Online]. Available from: https://myassignmenthelp.com/free-samples/comp-5070-statistical-programming-for-data-science
[Accessed 14 August 2020].

My Assignment Help. 'Statistical Programming For Data Science' (My Assignment Help, 2019) <https://myassignmenthelp.com/free-samples/comp-5070-statistical-programming-for-data-science> accessed 14 August 2020.

My Assignment Help. Statistical Programming For Data Science [Internet]. My Assignment Help. 2019 [cited 14 August 2020]. Available from: https://myassignmenthelp.com/free-samples/comp-5070-statistical-programming-for-data-science.


Chicago referencing is one of the most common referencing styles used in various papers and academic documents. Students studying in academic institutions need to be well-versed with the Chicago format citation as many colleges and papers demand such referencing style. Our citation generator tool has gained immense popularity owing to corrected Chicago style references generated in few seconds. The best part is that it can produce chicago referencing for books, blogs, articles, journals, e- newspapers along with all types of other academic documents. It produces detailed and precise citations, strictly in adherence to Chicago style Referencing Rules. Now get over chances of improper referencing that can lead to charges of plagiarism.

Latest Management Samples

PPTCH04 Business Law

Download : 0 | Pages : 3
  • Course Code: PPTCH04
  • University: Hong Kong University Of Science And Technology
  • Country: Hong Kong

Answer: Introduction Alternative dispute resolution is a process to addresses conflict between parties out of court. There are various processes through which ADR can be carried out. These processes come with their own features, advantages and disadvantages. The paper discusses the various alternatives which are available to the parties in a dispute when it comes to solution. The three primary kind of ADR which are available to the parties to...

Read More arrow Tags: United States Boston Management health finance management  University of Boston 

F-601-0556 Marketing Principles For Micro And Macro Environmental Factors

Download : 0 | Pages : 15
  • Course Code: F-601-0556
  • University: UK College Of Business And Computing
  • Country: United Arab Emirates

Answer: Introduction  Marketing principles are the widely used strategies used by organizations for optimizing the market performance of existing products and successfully launching new products (Sok, O’Cass and Sok 2013). The study is provides a complete understanding about the marketing principles adopted by Toyota for getting success in the market. In task 1, the study will discuss various marketing elements and evaluate the mar...

Read More arrow

AS1 Individual Report

Download : 0 | Pages : 12
  • Course Code: AS1
  • University: University Of Northampton
  • Country: United Kingdom

Question: Introduction The following report is based on the incorporation of digital marketing strategies into JCW’s business. JCW is a supplier of risk management, audit professionals and regulatory compliance to the clients across the industry. The purpose of the report is to develop a digital marketing strategy besides the existing offline communication strategies for JCW. The report also provides suitable e-marketing theory that hel...

Read More arrow

AS1 Individual Report

Download : 0 | Pages : 12
  • Course Code: AS1
  • University: University Of Northampton
  • Country: United Kingdom

Question: Introduction The following report is based on the incorporation of digital marketing strategies into JCW’s business. JCW is a supplier of risk management, audit professionals and regulatory compliance to the clients across the industry. The purpose of the report is to develop a digital marketing strategy besides the existing offline communication strategies for JCW. The report also provides suitable e-marketing theory that hel...

Read More arrow

AS1 Individual Report

Download : 0 | Pages : 12
  • Course Code: AS1
  • University: University Of Northampton
  • Country: United Kingdom

Question: Introduction The following report is based on the incorporation of digital marketing strategies into JCW’s business. JCW is a supplier of risk management, audit professionals and regulatory compliance to the clients across the industry. The purpose of the report is to develop a digital marketing strategy besides the existing offline communication strategies for JCW. The report also provides suitable e-marketing theory that hel...

Read More arrow
Next
watch

Save Time & improve Grade

Just share Requriment and get customize Solution.

question
We will use e-mail only for:

arrow Communication regarding your orders

arrow To send you invoices, and other billing info

arrow To provide you with information of offers and other benefits

1,385,831

Orders

4.9/5

Overall Rating

5,086

Experts

Our Amazing Features

delivery

On Time Delivery

Our writers make sure that all orders are submitted, prior to the deadline.

work

Plagiarism Free Work

Using reliable plagiarism detection software, Turnitin.com.We only provide customized 100 percent original papers.

time

24 X 7 Live Help

Feel free to contact our assignment writing services any time via phone, email or live chat.

subject

Services For All Subjects

Our writers can provide you professional writing assistance on any subject at any level.

price

Best Price Guarantee

Our best price guarantee ensures that the features we offer cannot be matched by any of the competitors.

Our Experts

Assignment writing guide
student rating student rating student rating student rating student rating 5/5

154 Order Completed

97% Response Time

Harold Alderete

PhD in Economics

London, United Kingdom

Hire Me
Assignment writing guide
student rating student rating student rating student rating student rating 5/5

752 Order Completed

100% Response Time

Hugh Cleave

Masters in Human Resource Management (MMgt, HRM)

Wellington, New Zealand

Hire Me
Assignment writing guide
student rating student rating student rating student rating student rating 5/5

453 Order Completed

98% Response Time

Howard Asuncion

LLM in Criminal Law

London, United Kingdom

Hire Me
Assignment writing guide
student rating student rating student rating student rating student rating 5/5

2115 Order Completed

97% Response Time

Kimberley Chen

MPA in Accounting

Singapore, Singapore

Hire Me

FREE Tools

plagiarism

Plagiarism Checker

Get all your documents checked for plagiarism or duplicacy with us.

essay

Essay Typer

Get different kinds of essays typed in minutes with clicks.

edit

GPA Calculator

Calculate your semester grades and cumulative GPa with our GPA Calculator.

referencing

Chemical Equation Balancer

Balance any chemical equation in minutes just by entering the formula.

calculator

Word Counter & Page Calculator

Calculate the number of words and number of pages of all your academic documents.

Refer Just 5 Friends to Earn More than $2000

Check your estimated earning as per your ability

1

1

1

Your Approx Earning

Live Review

Our Mission Client Satisfaction

Awesome work. Awesome response time. Very thorough & clear. Love the results I get with MAH!

flag

User Id: 383727 - 31 Jul 2020

Australia

student rating student rating student rating student rating student rating

Work was done in a timely manner took it through grammarly checked for plagiarism very well satisfied

flag

User Id: 463334 - 31 Jul 2020

Australia

student rating student rating student rating student rating student rating

Great work for the short notice given. Thank you for never disappointing and helping out.

flag

User Id: 194216 - 31 Jul 2020

Australia

student rating student rating student rating student rating student rating

I received a full point on the assignment. Thank you for all the help with the assignment.

flag

User Id: 411395 - 31 Jul 2020

Australia

student rating student rating student rating student rating student rating
callback request mobile
Have any Query?