Using the data provided to you (on Excel file), build a computational statistics data pipeline that automates the cleaning, transformation and initial analysis of the data. As part of this, you are expected to employ some dimension reduction and/or feature extraction techniques to reduce the dimensionality of the data for future analysis.

Please ensure that the code produced is fully commented and documented, and also include a brief explanation of your approach and efforts to ensure computational efficiency of your methods.

Sample Solution

This question has been answered.

Get Answer