Python, NumPy and MatPlotLib 2D Line Plot and 2D Scatter

This page is obsolete, I wrote it while I was just starting to learn python. I have wrote a much better set of notes on pandas available here:

An Introduction to Scientific Programming with Python

Table of contents

Tutorial Video

Perquisites
Configuring the Layout of Figures
Data Points
1. NumPy Vectors
2. NumPy Matrices
3. Pandas DataFrame

Line Colour (color – US Spelling)
Line Style (linestyle)
Line Width (linewidth)

Line Marker (marker)
Marker Size (markersize)
Marker Edge Width (markeredgewidth)

Marker Edge Colour (markeredgecolor – US Spelling)
Marker Face Colour (markerfacecolor – US Spelling)
Marker Fill Style (fillstyle)

Marker Face Colour Alternative (markerfacecoloralt – US Spelling)
Legend and Plot Labels
Multiple Lines

Labels (xlabel, ylabel, title)
Limits
Scales

Grid Lines
2D Line Plot Code
2D Scatter Plot

2D Scatter Plot Code

Tutorial Video

Perquisites

We will need the numpy and matplotlib.pylot libraries:

import numpy as np
import scipy as sp
import matplotlib.pyplot as plt
import pandas as pd

Configuring the Layout of Figures

Before creating any figures, you should adjust your preferences for how you wish to display figures. The default option is inline which means all figures will be printed to the Console as shown:

If instead you want the Figures to be shown as a separate Window, you can change the setting to Automatic. To do this go to Tools → Preferences:

To do this go to Tools → Preferences:

Next on the left hand menu select iPython console:

Select Graphics:

Change the setting from Inline to Automatic:

Select Apply:

Now go to Consoles and Restart the Kernal:

When rerunning your code, your figure will be in a separate window opposed to being inline within the Console:

Note Spyder Version 3.3 may give a stream of errors instead of making a plot. If you have this version (installed by default with the Anaconda March 2019 installer) you should close down Spyder and then update both Anaconda and Spyder. To do this open the Anaconda PowerShell Prompt and type in:

conda update anaconda
conda update spyder

Note it is also possible to toggle between the two settings without restarting the Kernal using the following commands:

%matplotlib auto
%matplotlib inline

In these guides, the setting automatic will be applied and the figures will all be shown as separate windows.

To create a new figure we can use the following function. Leaving the input argument empty will create a new figure:

plt.figure()

To view the figure in some other Python IDEs, you may need to show it, this can be done using the command:

plt.show

This is not needed when Spyder is used with an Automatic Backline for Interactive figures.

If no figures are open this will be "Figure 1". We can also specify the figure number using:

plt.figure(1000)
plt.show

Now that we have Figure 1000, if we once again type in:

plt.figure()
plt.show

We will get Figure 1000 +1 i.e. Figure 1001

The figures can be closes using the x on the top right corner or by using the command close with the input argument being the figure number in our case 1, 1000 and 1001:

plt.close(1)
plt.close(1000)
plt.close(1001)

The command:

plt.close('all')

Will close all open figures.

Data Points

The first two inputs into the plot command used for making 2D line plots are the xdata and ydata respectively. The plot command has to be called from plt (MatPlotLib.pyplot).

plt.plot(xdata,ydata)

Supposing we have recorded the speed of a rocket at the following 6 time points:

$\displaystyle \begin{array}{*{20}{c}} {\text{t (s)}} & {\text{v (m/s)}} \\ 0 & 0 \\ {10} & {227.04} \\ {15} & {362.78} \\ {20} & {517.35} \\ {22.5} & {602.97} \\ {30} & {901.67} \end{array}$

NumPy Vectors

We can create 2 single NumPy vectors (lists) from this data:

t=np.array([0,10,15,20,22.5,30])
v=np.array([0,227.04,362.78,517.35,602.97,901.67])

We can plot this using the variable names of both vectors. The first vector will be the x data on the figure and the second vector will be the y data on the figure.

plt.figure(1)
plt.show
plt.plot(t,v)

NumPy Matrices

We could also create a single NumPy array with two columns.

tv=np.array([[0,0],
             [10,227.04],
             [15,362.78],
             [20,517.35],
             [22.5,602.97],
             [30,901.67]])

To plot from this we will need to index into this using the Row and Column Indexes. Recall that we use square brackets to index into a NumPy array. We also use zero order indexing so start at 0 and go up in steps of 1. When indexing the rows are selected before the columns using a comma as a delimiter. And finally in order to select all elements of a row or column we use the semicolon :

We can open tv in variable explorer and see that for the x data, the rows we want are all of them and the column we want is 0:

tv[:,0]

We can open tv in variable explorer and see that for the y data, the rows we want are all of them and the column we want is 1:

tv[:,1]

In order to plot we use these as the x and y inputs into the plot command.

plt.figure(2)
plt.show
plt.plot(tv[:,0],tv[:,1])

Recall that we use zero order indexing. In some cases we may not want to plot all the data. Say we want to plot all the data in all rows except for the last

plt.figure(3)
plt.show
plt.plot(tv[0:5,0],tv[0:5,1])

Recall that we can set the lower bound before the colon and the upper bound after the colon. Recall with zero order indexing we go up to the last value but never reach it stopping at the value before. So this will plot row0, row1, row2, row3 and row4. Compare the differences in the axes on Figure 2 and Figure 3 and you'll notice that the data on the last row, row5 is missing.

For practice with indexing try to plot the data from row1, row2 and row3 only. Try to plot all the data using a colon with an upper and lower bound. Repeat for the individual vectors t and v.

# Row1, Row2 and Row3 Column 0
tv[1:4,0]
# Row1, Row2 and Row3 Column 1
tv[1:4,1]
# All rows Column 0
tv[0:6,0]
# All rows Column 1
tv[0:6,1]
# Row1, Row2 and Row3 t
t[1:4]
# Row1, Row2 and Row3 v
v[1:4]
# All rows t
t[0:6]
# All rows v
v[0:6]

Pandas DataFrame

tvdf=pd.DataFrame([[0,0],
                   [10,227.04],
                   [15,362.78],
                   [20,517.35],
                   [22.5,602.97],
                   [30,901.67]],
                  index=['R0',
                         'R1',
                         'R2',
                         'R3',
                         'R4',
                         'R5'],
                  columns=['t','v'])

Recall that one can index into a dataframe using dot indexing with the column name.

tvdf.t
tvdf.v

To perform the line plot from the data in the dataframe one can use the plot command as before:

plt.figure(4)
plt.show
plt.plot(tvdf.t,tvdf.v)

It is also possible to index a selection using dot indexing with the row names or using the row numbers for instance the column t is indexed using the row names and the column v is indexed using the row numbers.

plt.figure(5)
plt.show
plt.plot([tvdf.t.R0,tvdf.t.R1,tvdf.t.R2],tvdf.v[[0,1,2]])

This last line is equivalent to:

plt.plot([tvdf.t.R0,tvdf.t.R1,tvdf.t.R2],tvdf.v[0:3])

Note the additional set of square brackets used with indexing a list of specified points opposed to indexing a using a colon or colon with upper and lower bound (this also applies when referencing data within a NumPy matrix). It is advisable for the reader to create their own dataset with four columns and practice that they can plot each column on a separate figure and that they can plot a selection of data using a colon, a colon with an upper and lower bound, a selection using the row names and a selection using the row numbers to ensure they are up to scratch with indexing.

Line Colour (color – US Spelling)

For convenience we will return to plotting data using two single vectors and we will now look at changing properties of the plot using additional input arguments. Let's start with the colour of the line. Note Python uses the US spelling of colour which does not have the u, color.

plt.close('all')
plt.figure(1)
plt.plot(t,v,color='r')

Primary and Secondary colours aswell as Black and White can be encoded using a single letter string and also a string of the full name of the colour.

Single Letter String	Full String	RGB	RGB	Hex
r	red	[1,0,0]	[255/255, 0/255,0/255]	#ff0000
g	green	[0,1,0]	[0/255,255/255,0/255]	#00ff00
b	blue	[0,0,1]	[0/255,0/255,0/255]	#0000ff
y	yellow	[1,1,0]	[255/255,255/255,0/255]	#ffff00
c	cyan	[0,1,1]	[0/255,255/255,255/255]	#00ffff
m	magenta	[1,0,1]	[255/255,0/255,255/255]	#ff00ff
k	black	[0,0,0]	[0/255,0/255,0/255]	#000000
w	white	[1,1,1]	[255/255,255/255,255/255]	#ffffff

For more fine tuning colours can be specified as a vector of [r,g,b] values. Many programs list this vector of [r,g,b] values between 0 and 255 but Python recognises these are normalised values between 0 and 1. For instance the standard colours in Microsoft Word are as follows. Using these colours may be useful if you want to keep consistency with plots and a Word Document for instance.

Microsoft Word RGB	Hex
[192/255,0/255,0/255]	#c00000
[255/255,0/255,0/255]	#ff0000
[255/255,192/255,0/255]	#ffbf00
[255/255,255/255,0/255]	#ffff00
[146/255,208/255,80/255]	#92d050
[0/255,176/255,80/255]	#00b050
[0/255,176/255,240/255]	#00b0f0
[0/255,112/255,96/255]	#007060
[0/255,32/255,96/255]	#002060
[112/255,48/255,160/255]	#6f30a0

plt.close('all')
plt.figure(1)
plt.plot(t,v,color=[0/255,176/255,80/255])

Colours may also be listed as hex values.

plt.close('all')
t=np.array([0,10,15,20,22.5,30])
v=np.array([0,227.04,362.78,517.35,602.97,901.67])
plt.figure(1)
plt.plot(t,v,color='#6f30a0')

For more details about the colour scheme see:

Python and MatPlotLib: Understanding How we Encode Colour

In general one uses a lookup table to select the [R,G,B] or Hex value desired however it is useful to keep a list together of your favourite colours.

Line Style (linestyle)

The line style can be selected using the additional input argument linestyle. linestyle can have the following string arguments.

Line Style	String
None (Default)	None
Solid	–
Dashed	—
Dash Dot	-.
Dotted	:

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle='--')

Line Width (linewidth)

The linewidth can also be changed using the additional input argument linewidth, this is set to a numeric value.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3)

Line Marker (marker)

It is possible to add a Marker at every data point using the argument marker and setting it to a string or number.

Marker String	Description
.	point
,	pixel
o	circle
v	triangle_down
^	triangle_up
<	triangle_left
>	triangle_right
1	tri_down
2	tri_up
3	tri_left
4	tri_right
8	octagon
s	square
p	pentagon
P	plus
*	star
h	hexagon1
H	hexagon2
+	plus
x	x
X	X
D	diamond
d	thin_diamond
\|	vline
_	hline
None	None
Marker Numeric
0	tickleft
1	tickright
2	tickup
3	tickdown
4	caretleft
5	caretright
6	caretup
7	caretdown
8	caretleft
9	caretright
10	caretup
11	caretdown

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,marker='o')

Marker Size (markersize)

The marker size can also be modified using the additional input argument markersize, this is set to a numeric value.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,marker='o',markersize=15)

Marker Edge Width (markeredgewidth)

The marker edge width (thickness of the marker outside line) can also be modified using the additional input argument markeredgewidth, this is set to a numeric value.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3)

Marker Edge Colour (markeredgecolor – US Spelling)

The marker edge colour can be modified using the input argument markeredgecolor and setting it to a colour string, [R,G,B,] or hex value as discussed when looking at the input argument color.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255])

Marker Face Colour (markerfacecolor – US Spelling)

The marker face colour can be modified using the input argument markerfacecolor and setting it to a colour string, [R,G,B,] or hex value as discussed when looking at the input argument color.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255])

Marker Fill Style (fillstyle)

The marker fillstyle can be modified using the input argument fillstyle and setting it to a string.

Fill Style String

none

top

bottom

left

right

full

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255],
         fillstyle='left')

Marker Face Colour Alternative (markerfacecoloralt – US Spelling)

If the fillstyle is set to a direction (top, bottom, left, right), then the marker face has both a colour and alternative colour. This can be modified using the input argument markerfacecoloralt and setting it to a colour string, [R,G,B,] or hex value as discussed when looking at the input argument color.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255])

Legend and Plot Labels

The plot can be assigned a label set as a string. This will show up on the chart if the command legend() is typed.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255],
         label='Rocket 1')
plt.legend()

The location of the legend can be set using the input argument loc and assigning it to a string or a number. Note once again that English US is used for center opposed to the English UK version centre. Unfortunately with the numerical input, it is implemented in the following way:

Location String	Location Integer
best	0
upper right	1
upper left	2
lower left	3
lower right	4
right	5
center left	6
center right	7
lower center	8
upper center	9

Opposed to using the shape of the number square which would have made much more sense.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255],
         label='Rocket 1')
plt.legend(loc='lower right')

Multiple Lines

Unless a new figure is selected or the figure is closed, then additional plots will be added on top of the existing plot, this will allow multiple lines to be plotted.

plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255],
         label='Rocket 1')
plt.plot(t,2*v,color=[255/255,0/255,0/255],linestyle='--',linewidth=3,
         marker='*',markersize=15,markeredgewidth=3,
         markeredgecolor=[0/255,176/255,80/255],
         markerfacecolor=[112/255,48/255,160/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255],
         label='Rocket 2')
plt.legend(loc='lower right')

Labels (xlabel, ylabel, title)

The axis labels and figure title can be input using the commands xlabel, ylabel and title with input arguments as strings.

plt.xlabel('time (s)')
plt.ylabel('velocity (m/s)')
plt.title('Rocket Projectile')

Limits

It is possible to adjust the limits of the xaxis and yaxis using the function xlim and ylim. These functions have 2 value vectors as input arguments.

plt.xlim([-10,100])
plt.ylim([1,10000])

Scales

The functions xscale and yscale can be used to convert an axis from linear (default) to logarithmic using the strings linear and log respectively:

plt.xscale('linear')
plt.yscale('log')

Grid Lines

Gridlines may also be selected usign the function grid. This like the line plot has multiple input arguments such as color, linestyle and linewidth which have all been seen before. There are two additional input arguments, axis and which, which can have the following string input arguments.

axis	which
x	major
y	minor
both	both

For example:

plt.grid(axis='x',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)

By default the minor gridlines are not enabled. These need to be enabled using the function minor_ticks_on which should be called with an empty argument or True

plt.grid(axis='x',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='x',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)

We can now repeat the code with the y axis or alternatively in this case, alter the code to set the axis to both:

plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)

2D Line Plot Code

It is worthwhile putting all the code used to create the plot above together. The user is advised to modify it and use it with some of their own test data.

import numpy as np
import scipy as sp
import matplotlib.pyplot as plt
import pandas as pd
plt.close('all')
t=np.array([0,10,15,20,22.5,30])
v=np.array([0,227.04,362.78,517.35,602.97,901.67])
plt.plot(t,v,color=[0/255,176/255,80/255],linestyle=':',linewidth=3,
         marker='o',markersize=15,markeredgewidth=3,
         markeredgecolor=[112/255,48/255,160/255],
         markerfacecolor=[255/255,192/255,0/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255],
         label='Rocket 1')
plt.plot(t,2*v,color=[255/255,0/255,0/255],linestyle='--',linewidth=3,
         marker='*',markersize=15,markeredgewidth=3,
         markeredgecolor=[0/255,176/255,80/255],
         markerfacecolor=[112/255,48/255,160/255],
         fillstyle='left',
         markerfacecoloralt=[0/255,112/255,96/255],
         label='Rocket 2')
plt.legend(loc='lower right')
plt.xlabel('time (s)')
plt.ylabel('velocity (m/s)')
plt.title('Rocket Projectile')
plt.xlim([-10,100])
plt.ylim([1,10000])
plt.xscale('linear')
plt.yscale('log')
plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)

2D Scatter Plot

The 2D scatter plot is very similar to the 2D line plot. However the former places an emphasis on the line and the individual datapoints denoted by markers are secondary in nature, the latter places an emphasis solely on the individual datapoints.

Like the line plot, the first two input arguments are the x-data and the y-data:

plt.figure(2)
plt.scatter(t,v)

The scatter plot has a third input argument which selects the marker size. This can be a scalar where all markers will be the same size or it can be a vector of the same length as the x and y data. First let's use a scalar:

plt.scatter(t,v,50)

Now let's try a vector. In this case let's relate the size of the points to the data v, because v[0]=0, we will assign an offset of 5 so we can see v[0]

plt.scatter(t,v,v+5)

Many of the additional input arguments have the same name however, many of the properties such as color, linewidth correspond directly to the marker. The input argument color is used alone and applies to a filled marker. It is also possible to specify an edgecolor and facecolor separately, note how marker is docked from these in the scatter plot as all the defaults apply to the marker.

plt.scatter(t,v,v+5,color=[0/255,176/255,80/255],
         linewidth=5,
         marker='o',
         edgecolor=[112/255,48/255,160/255],
         facecolor=[255/255,192/255,0/255],
         label='Rocket 1')

The plot label, legend, axes labels, title, limits, scale and gridlines work identically to a 2D line plot.

2D Scatter Plot Code

import numpy as np
import scipy as sp
import matplotlib.pyplot as plt
import pandas as pd
plt.close('all')
t=np.array([0,10,15,20,22.5,30])
v=np.array([0,227.04,362.78,517.35,602.97,901.67])
plt.figure(2)
plt.scatter(t,v,v+5,color=[0/255,176/255,80/255],
         linewidth=5,marker='o',
         edgecolor=[112/255,48/255,160/255],
         facecolor=[255/255,192/255,0/255],
         label='Rocket 1')
plt.scatter(t,2*v,2*v+5,color=[255/255,0/255,0/255],
         linewidth=5,marker='*',
         edgecolor=[0/255,176/255,80/255],
         facecolor=[112/255,48/255,160/255],
         label='Rocket 2')
plt.legend(loc='lower right')
plt.xlabel('time (s)')
plt.ylabel('velocity (m/s)')
plt.title('Rocket Projectile')
plt.xlim([-10,100])
plt.ylim([1,10000])
plt.xscale('linear')
plt.yscale('log')
plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)