计算机代考程序代写 python COMP20008 – cscodehelp代写

COMP20008
Elements of data processing
Semester 2 2021
Lecture 2: Visualisation – Part III

Basic Visualisation
✓Line plots ✓Boxplots ✓Histograms ✓Bar charts
• Scatter plots • Heatmap
• Parallel Coordinate plots

Scatter plots
Two numeric variables
https://www.data-to-viz.com/graph/scatter.html

Scatter plots
• X-axis: one numeric variable • Y-axis: the other numeric
variable
• A dot is a data point with 2- values as the x, y coordinates.
https://datavizcatalogue.com/methods/scatterplot.html

Scatter plots – patterns
Relationship between two variables
https://datavizcatalogue.com/methods/scatterplot.html

Outliers and scatter plots

Outliers detection with PLS regression for NIR spectroscopy in Python

More than 2 features with scatter plots 1. Bubble plots
• A special scatter plot representing 3-dimensional data
• Size of circle around a point indicates the value of the 3rd dimension.

2. Enhanced scatter plots
• Use colours for the values of the 3rd dimension.

3. Scatterplot matrix
• A matrix of scatter plots of all pairs of dimensions (variables)
• Inspect many relationships simultaneously.
• Convenient for spotting correlation between variables
• Spotting outliers

‘Overplotting’ in scatter plots
When there are many data points, dots tend to overlap
– Reduce dot size – Sampling
– Jitter (for moderate overplotting)
– Use other plots
See https://python-graph-gallery.com/134-how-to-avoid-overplotting-with-python/

Leave a Reply

Your email address will not be published. Required fields are marked *