Python and MatPlotLib: Introduction to Image Processing

Table of contents

Example Picture
Prequisites

Loading a Picture as float32 array using the function imread
A Picture as a 3D Array
Resolution of the Image

Viewing the 3D Array by Other Axes
Viewing the Picture
Turning Axes On or Off

Plotting a Grid
Compressing and Saving
Rotating an Image

Splitting Image into Primary RGBA Channels
Looking at Secondary Channels
Enhancing or Reducing One of the Channels

Converting to Greyscale
Colourmaps for Greyscale Data
Enhancing Greyscale Data with Colourmap

Adding Noise
Adding a Bright Pixel
Applying Filters to Improve Image Quality

Inversion

Example Picture

In this example, I will use the following picture. You can use either the same picture, if you are working through this guide or you can replace it with your own one. Save this picture by right clicking it and selecting save as… and call it LondonPNG.png this picture should be in the same folder as your Python Script.

Prequisites

We require the following libraries

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage

We will also close all existing plots:

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')

Loading a Picture as float32 array using the function imread

The function imread found in the matplotlib.image library can be used to read in a picture from a file.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')

A Picture as a 3D Array

The picture otherwise known as a float32 3D array can be found in the variable explorer.

Opening it up, we can see the data as viewed via axis0.

Note at the bottom we are viewing the array using axis 0 and the shape is 720×960×4. This is the rows×columns×pages with rows selected. In other words we are looking only at row0. In this view (axis=0) the data shown is for each of the 960 columns of row0. Each column is listed in this view as a row which has 4 values (the four pages). The four pages actually correspond to the colours. Recall that all colours are made up of three primary colours red, green and blue; these are the 0th, 1st and 2nd pages (or 0th, 1st and 2nd column in the view of axis=0). The four page (page 3 – recall we use 0 order indexing) is the alpha channel representing the transparency (in this case the image is not transparent so every value in the 4th page is 1).

Let's look at the first row that we see on the view where axis=0. These set of four values correspond to the colour of the 0th row and 0th column otherwise known as pixel 0,0. We can index this value using:

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
colourpixel0=img1[0,0,:]
print(colourpixel0)

[0.23921569 0.46666667 0.7882353  1.        ]

This is the 0th row, as viewed from axis0.

Recall that Python displays colours as floats normalised between 0 and 1 whereas Microsoft Office displays them as values from 0 to 255. We can get the values Microsoft Office would specify by using:

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Get Colour of Pixel 0,0
colourpixel_0_0=img1[0,0,:]
rgb_colourpixel_0_0=np.round(colourpixel_0_0*255,0)
print('r=',rgb_colourpixel_0_0[0])
print('g=',rgb_colourpixel_0_0[1])
print('b=',rgb_colourpixel_0_0[2])

r= 61.0
g= 119.0
b= 201.0

We can take these into the colour picker in Microsoft Office and we get the colour of the sky (as expected with this image).

Resolution of the Image

We can use the function np.shape to look at the dimensions of the array. To get the number of pixels we multiply the rows and columns together. In computer science we divide by 1024 to get kilopixels and by 1024 again to get megapixels.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Get Dimensions of Array
[rows,cols,pages]=np.shape(img1)
print('rows=',rows)
print('cols=',cols)
print('pages=',pages)
# Get Resolution of Image
img1_resolution_pixels=rows*cols
img1_resolution_Mpixels=resolution_pixels/(1024*1024)
print('resolution=',np.round(resolution_Mpixels,2),'Mpixels')


rows= 720
cols= 960
pages= 4
resolution= 0.66 Mpixels

Viewing the 3D Array by Other Axes

In the variable explorer we can change the axes to view the array from. Switching from axis 0 to axis 1 will set the view to put the rows as the rows and the colours as the columns:

And to axis 2 will set the rows to display as the rows and the columns to display as the columns. In other words it will show page 0 as the numeric float of the red channel:

Page 1 as the numeric float of the green channel:

And page 2 as the numeric float of the green channel.

In such a small regime of the picture, 14 rows and 3 columns there is very little change. Here is the image, recall this is 14/720 rows and 4/960 columns displaying in the variable editor. So it will all be within a tiny patch of sky blue within the top left corner of the image.

Viewing the Picture

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Create a Figure and plot img1
plt.figure(1)
plt.imshow(img1)

We can select a point on the picture using the mouse cursor. Let us go to the White Ensign and select different colours.

Let's once again index and select the colours from:

row=185, col=571

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Create a Figure and plot img1
plt.figure(1)
plt.imshow(img1)
plt.savefig('fig1')
# Get Colour of Pixel y=571,x=185
colourpixel_185_571=img1[185,571,:]
print(colourpixel_185_571)
rgb_colourpixel_185_571=np.round(colourpixel_185_571*255,0)
print('r=',rgb_colourpixel_185_571[0])
print('g=',rgb_colourpixel_185_571[1])
print('b=',rgb_colourpixel_185_571[2])

[0.5176471  0.17254902 0.21960784 1.        ]
r= 132.0
g= 44.0
b= 56.0

row=203, col=605

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Create a Figure and plot img1
plt.figure(1)
plt.imshow(img1)
# Get Colour of Pixel y=605,x=203
colourpixel_203_605=img1[203,605,:]
print(colourpixel_203_605)
rgb_colourpixel_203_605=np.round(colourpixel_203_605*255,0)
print('r=',rgb_colourpixel_203_605[0])
print('g=',rgb_colourpixel_203_605[1])
print('b=',rgb_colourpixel_203_605[2])

[0.47843137 0.5019608  0.5647059  1.        ]
r= 122.0
g= 128.0
b= 144.0

row=167 col=617

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Create a Figure and plot img1
plt.figure(1)
plt.imshow(img1)
# Get Colour of Pixel y=617,x=167
colourpixel_167_617=img1[167,617,:]
print(colourpixel_167_617)
rgb_colourpixel_167_617=np.round(colourpixel_167_617*255,0)
print('r=',rgb_colourpixel_167_617[0])
print('g=',rgb_colourpixel_167_617[1])
print('b=',rgb_colourpixel_167_617[2])

[0.         0.07058824 0.23921569 1.        ]
r= 0.0
g= 18.0
b= 61.0

row=304, col=102

Turning Axes On or Off

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
[rows,cols,pages]=np.shape(img1)
img1_resolution_pixels=rows*cols
img1_resolution_Mpixels=img1_resolution_pixels/(1024*1024)
# Create a Figure and plot img1
plt.figure(2)
plt.imshow(img1)
plt.axis('off')
plt.savefig('fig2')

Plotting a Grid

A grid can be added to the figure, just like it would be any ordinary figure.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Create a Figure and plot img1
plt.figure(3)
plt.imshow(img1)
plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)
plt.savefig('fig3')

Compressing and Saving

Recalling that the image consists of 720 rows×960 columns×4 pages. To reduce the file size we can take every nth row and nth column. For example if we wanted to reduce the file size by 4.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Select Every 4 pixels of img1 and assign to img2
img2=img1[::2,::2,:]
[rows,cols,pages]=np.shape(img2)
img2_resolution_Mpixels=rows*cols/(1024*1024)
print('img2 resolution=',np.round(img2_resolution_Mpixels,2),'Mpixels')
plt.imsave('LondonPNG_compress1.png',img2)
plt.figure(4)
plt.imshow(img2)
plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)
plt.savefig('fig4')

Let's compare this with the earlier figure 3…

img1 resolution= 0.66 Mpixels

As can be seen, the number of pixel numbers in both the x and y axes have halved.

img2 resolution= 0.16 Mpixels

The file sizes in Windows Explorer is seen to about quarter (as expected with a compression ratio of 4):

Note the difference between line 15 and line 21, line 15 saves the compressed 3D array to a new png file whereas line 21 saves figure 4 which contains the image as a file.

We can repeat this, compressing by 16 fold, 64 fold, 256 fold and 1024 fold. For convenience this will be done using a for loop

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')

index=np.arange(start=0,stop=6,step=1)
compressionratio=[1,4,16,64,256,1024]
for i in index: 
    # add 4 to i to get the figure name
    j=i+3
    imagename='img'+str(j)
    # Select Every n pixels of img1 and assign to imagename
    # Every n pixels is the sqrt of the compresion ratio
    # This needs to be an int to slice
    sqrtindex=int(np.sqrt(compressionratio[i]))
    imagename=img1[::sqrtindex,::sqrtindex,:]
    # Calculate the Resolution of the new image
    [rows,cols,pages]=np.shape(imagename)
    imageresolution=rows*cols/(1024*1024)
    imageresolutionrounded=+np.round(imageresolution,6)
    print('img'+str(j)+'_resolution='+str(imageresolutionrounded)+' Mpixels')
    # Save the new filename
    filename='LondonPNG_compress'+str(int(compressionratio[i]))+'.png'
    plt.imsave(filename,imagename)
    # The figure number should be j
    plt.figure(j)
    # Plot the data
    plt.imshow(imagename)
    # Add the Grid
    plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], 
             linestyle='-', linewidth=2)
    plt.minorticks_on()
    plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], 
             linestyle=':', linewidth=1)
    # Save the figure
    figname='fig'+str(j)
    plt.savefig(figname)

img3_resolution=0.65918 Mpixels
img4_resolution=0.164795 Mpixels
img5_resolution=0.041199 Mpixels
img6_resolution=0.0103 Mpixels
img7_resolution=0.002575 Mpixels
img8_resolution=0.000658 Mpixels

Here you can see the compression ruin the quality of the picture. The file size in Windows Explorer:

Rotating an Image

We can use the function rotate from mpimg to rotate the image by for example 45 degrees (line 13).

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')

# Rotate an Image
img7=ndimage.rotate(img1, 45)
plt.figure(9)
plt.imshow(img7)
plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)

The rotated image has a large proportion of white space. It is also possible to crop the image by selecting a sub selection. For instance pixels 400-800,400-800

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')

# Rotate an Image
img7=ndimage.rotate(img1, 45)
plt.figure(9)
plt.imshow(img7)
plt.grid(axis='both',which='major',color=[166/255,166/255,166/255], linestyle='-', linewidth=2)
plt.minorticks_on()
plt.grid(axis='both',which='minor',color=[166/255,166/255,166/255], linestyle=':', linewidth=1)

# Crop an Image
img8=img7[400:800,400:800,:]
plt.figure(10)
plt.imshow(img8)

Splitting Image into Primary RGBA Channels

Now let's look at img1 and get the dimensions (line 12). Let's select each individual page, page0 is the red floats per pixel, page1 is the green floats per pixel, page2 is the blue floats per pixel and page3 is the alpha or transparency floats per pixel. One can index into img1 by selecting all rows and all columns and the page number 0,1,2 and 3 to individual variables r,g,b and alpha respectively (line 14-17). The functions zeros can be used with the array dimensions (line 12) to create an empty array (line 19). Page 3 of this empty array can be assigned to the alpha values (line 20). The empty array can be copied to make a new array r2 (line 22) which can be modified by adding only the red values to pageo of the empty array (line 23). This r2 now only contains the red values (page0) and the alpha values (page3). The green values (page1) and blue values (page2) are left as zeros. The procedure can be repeated with the green and blue channels. These can be plotted as a subplots alongside img1.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create an empty array where rgb=0,a=alpha
im1empty=np.zeros([rows,cols,pages])
im1empty[:,:,3]=alpha
# Create an array with r=r, g=0, b=0, a=1
r2=np.copy(im1empty)
r2[:,:,0]=r2[:,:,0]+r
# Create an array with r=0, g=g, b=0, a=1
g2=np.copy(im1empty)
g2[:,:,1]=g2[:,:,1]+g
# Create an array with r=0, g=0, b=b, a=1
b2=np.copy(im1empty)
b2[:,:,2]=b2[:,:,2]+b
# Create a Figure
plt.figure(11)
# Plot img1 as subplot 1
plt.subplot(2,2,1)
plt.imshow(img1)
# Plot Red only as subplot 2
plt.subplot(2,2,2)
plt.imshow(r2)
# Plot Green only as subplot 3
plt.subplot(2,2,3)
plt.imshow(g2)
# Plot Blue only as subplot 4
plt.subplot(2,2,4)
plt.imshow(b2)
# Save Figure
plt.savefig('fig11')

Looking at Secondary Channels

Now that we have looked at the image in terms of its primary colours we can also look at it in terms of its secondary colours. Recall the secondary colours are made up of two primary colours:

Secondary Colour	Primary Colour 1	Primary Colour 2
magenta	red	blue
cyan	green	blue
yellow	red	green

We can then plot the primary colours alongside the secondary colours using gridspec to orientate the subplots with the secondary colours around those of the primary colours.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create an empty array where rgb=0,a=1
im1empty=np.zeros([rows,cols,pages])
im1empty[:,:,3]=alpha
# Create an array with r=r, g=0, b=0, a=1
r2=np.copy(im1empty)
r2[:,:,0]=r2[:,:,0]+r
# Create an array with r=0, g=g, b=0, a=1 
g2=np.copy(im1empty)
g2[:,:,1]=g2[:,:,1]+g
# Create an array with r=0, g=0, b=b, a=1
b2=np.copy(im1empty)
b2[:,:,2]=b2[:,:,2]+b
# Combine Red and Blue
rb2=np.copy(im1empty)
rb2[:,:,0]=r2[:,:,0]+r
rb2[:,:,2]=b2[:,:,2]+b
# Combine Green and Blue
gb2=np.copy(im1empty)
gb2[:,:,1]=gb2[:,:,1]+g
gb2[:,:,2]=b2[:,:,2]+b
# Combine Red and Green
rg2=np.copy(im1empty)
rg2[:,:,0]=rg2[:,:,0]+r
rg2[:,:,1]=rg2[:,:,1]+g
# Plot Primary and Secondary Colours
plt.figure(12)
grid=plt.GridSpec(6, 6, wspace=0.1, hspace=0.1)
# Plot Red only
plt.subplot(grid[0:2, 2:4])
plt.imshow(r2)
plt.axis('off')
# Plot Green only
plt.subplot(grid[2:4, 0:2])
plt.imshow(g2)
plt.axis('off')
# Plot Blue only
plt.subplot(grid[2:4, 2:4])
plt.imshow(b2)
plt.axis('off')
# Plot Red and Blue
plt.subplot(grid[1:3, 4:])
plt.imshow(rb2)
plt.axis('off')
# Plot Green and Blue
plt.subplot(grid[4:, 1:3])
plt.imshow(gb2)
plt.axis('off')
# Plot Red and Green
plt.subplot(grid[0:2, 0:2])
plt.imshow(rg2)
plt.axis('off')
plt.savefig('fig12')

Enhancing or Reducing One of the Channels

Now that we understand the principles behind an image file i.e. a 3D array which has rows, columns and pages where the 0th page is a matrix of numeric floats for the red channel, the 1st page is a matrix of numeric floats for the green channel, the 2nd page is a matrix of numeric floats for the blue channel and the 3rd page is a matrix of numeric floats for the red channel and knowing that these floats are normalised between 0 and we can numerically perform some basic image editing.

Let us create a 2x red filter where we double the intensity of the red channel (line 31-33) and then set any floats greater than the maximum value of 1 to equal 1 (line 35). We can then create a new image (line 37-39) and plot these as subplots.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create an empty array where rgb=0,a=1
im1empty=np.zeros([rows,cols,pages])
im1empty[:,:,3]=alpha
# Create an array with r=r, g=0, b=0, a=1 
r2=np.copy(im1empty)
r2[:,:,0]=r2[:,:,0]+r
# Create an array with r=0, g=g, b=0, a=1 
g2=np.copy(im1empty)
g2[:,:,1]=g2[:,:,1]+g
# Create an array with r=0, g=0, b=b, a=1 
b2=np.copy(im1empty)
b2[:,:,2]=b2[:,:,2]+b
# Enhance red filter 2x
r3=np.copy(r2)
# Double the value of red
r3[:,:,0]=2*r3[:,:,0]
# Ensure each value is 1 or below
r3[r3>1]=1
# Create a new image
img8=np.copy(r3)
img8[:,:,1]=img8[:,:,1]+g
img8[:,:,2]=img8[:,:,2]+b
# Plot old and new image and red channel as subplots
plt.figure(13)
plt.subplot(2,2,1)
plt.imshow(img1)
plt.subplot(2,2,2)
plt.imshow(r2)
plt.subplot(2,2,3)
plt.imshow(img8)
plt.subplot(2,2,4)
plt.imshow(r3)
plt.savefig('fig13')

Now let's rewrite this as a for loop and look at the influence of a 2x, 4x, 8x, 0.5x and 0.1x filter.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create an empty array where rgb=0,a=1
im1empty=np.zeros([rows,cols,pages])
im1empty[:,:,3]=alpha
# Create an array with r=r, g=0, b=0, a=1 
r2=np.copy(im1empty)
r2[:,:,0]=r2[:,:,0]+r
# Create an array with r=0, g=g, b=0, a=1
g2=np.copy(im1empty)
g2[:,:,1]=g2[:,:,1]+g
# Create an array with r=0, g=0, b=b, a=1 
b2=np.copy(im1empty)
b2[:,:,2]=b2[:,:,2]+b

red_filter=[2,4,8,0.5,0.1]
index=np.arange(start=0,stop=len(red_filter),step=1)
for i in index:
    # Multiply Filter by Enhancement Factor
    rnew=np.copy(r2)
    rnew[:,:,0]=red_filter[i]*rnew[:,:,0]
    # Ensure each value is 1 or below
    rnew[rnew>1]=1
    # Create a new image
    img=np.copy(rnew)
    img[:,:,1]=img[:,:,1]+g
    img[:,:,2]=img[:,:,2]+b
    # Plot old and new image and red channel as subplots
    # Want to start at figure 13
    plt.figure(i+13)
    plt.subplot(2,2,1)
    plt.imshow(img1)
    plt.subplot(2,2,2)
    plt.imshow(r2)
    plt.subplot(2,2,3)
    plt.imshow(img)
    plt.subplot(2,2,4)
    plt.imshow(rnew)
    # Save
    figname='fig'+str(i)
    plt.savefig(figname)
    imgname='LondonPNG_r_x'+str(red_filter[i])+'.png'
    plt.imsave(imgname,img)

As we can see the first redx2 channel is enhanced and the picture has a red tinge.

With the 4x red filter, we see that many of the pixels in the red channel get saturated and the picture has a stronger red tinge.

With the 8x red filter, the red channel becomes ardto resolve as most of it is saturated, once again the red tinge is stronger.

With the 0.5x red filter, we see the intensity in the red channel is a lot lower. This gives a cyan tinge as blue and green combine to make cyan.

With a 0.1x red filter, we see the intensity in the red channel is very low and is very hard to resolve above the background. The image is dominated by the blue and green channels and has a stronger cyan tinge.

As practice you can experiment with the other two channels and perhaps create a different custom filter for the three channels.

Converting to Greyscale

The data shown so far has been coloured data which has four channels (rgba). Grey scale data on the other hand only has only a single channel. The data can be collapsed into a single channel using the average value of the three channels:

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Even Ratio)
greyscale=r/3+g/3+b/3
plt.figure(18)
plt.imshow(greyscale)
plt.savefig('fig18')

The variable greyscale created is a matrix of 720 rows by 960 columns. When plotted however a colourmap is applied by default giving it 'false colour'.

It was calculated using ratios of 1/3 for each channel however to compensate for our eye being more sensitive to green for instance than red or blue respectively, a compensation factor may be applied (line 19).

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1
plt.figure(19)
plt.imshow(greyscale2)
plt.savefig('fig19')

The default colourmap is viridis, we can add a colorbar to see how the colorbar corresponds to the values of each pixel.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1
plt.figure(20)
plt.imshow(greyscale2)
plt.colorbar()
plt.savefig('fig20')

Colourmaps for Greyscale Data

We can change this to other colourmaps, for example bone, jet and hot which are commonly used with grey scale data.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1
plt.figure(21)
plt.imshow(greyscale2,cmap='bone')
plt.colorbar()
plt.savefig('fig21')
plt.figure(22)
plt.imshow(greyscale2,cmap='jet')
plt.colorbar()
plt.savefig('fig22')
plt.figure(23)
plt.imshow(greyscale2,cmap='hot')
plt.colorbar()
plt.savefig('fig23')

Enhancing Greyscale Data with Colourmap

Once again we can attempt to enhance the brightness using a multiplication factor and cut off any data which exceeds 1.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1
# Plot unmodified data
plt.figure(23)
plt.imshow(greyscale2,cmap='hot')
plt.colorbar()
plt.savefig('fig23')

# Hot Colourmap - Data 2x enhancement
greyscale3=2*greyscale2
greyscale3[greyscale3>1]=1
plt.figure(24)
plt.imshow(greyscale3,cmap='hot')
plt.colorbar()
plt.savefig('fig24')

# Hot Colourmap - Data 4x enhancement
greyscale4=4*greyscale2
greyscale4[greyscale4>1]=1
plt.figure(25)
plt.imshow(greyscale4,cmap='hot')
plt.colorbar()
plt.savefig('fig25')

Adding Noise

This image was taken in daylight and as a consequence a large number of photons were available leading to a very good signal to noise ratio. In many scientific applications for instance microscopy, the signal to noise ratio may be poorer and we may mimic this case by applying a level of random noise to our image.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1
# Plot unmodified data
plt.figure(23)
plt.imshow(greyscale2,cmap='hot')
plt.savefig('fig23')

# Create Random Noise
noise=1/10*np.random.rand(rows,cols)
greyscale5=greyscale2+noise
greyscale5[greyscale5>1]=1
# Hot Colourmap - Introduce Random Noise
plt.figure(26)
plt.imshow(greyscale5,cmap='hot')
plt.savefig('fig26')

We can modify the code and look at the use of a for loop to look at the image when the signal is 1/4, 1/8, 1/16 and 1/32 of the original value and compare this with the original.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1
# Plot unmodified data
plt.figure(23)
plt.imshow(greyscale2,cmap='hot')
plt.savefig('fig23')
# Create Random Noise
noise=1/10*np.random.rand(rows,cols)
# Signal Ratio from Original
signalratio=np.array([1/4,1/8,1/16,1/32])
index=np.arange(start=0,stop=len(signalratio),step=1)
for i in index:
    # new data - attenuated signal, constant noise
    greyscaletemp=signalratio[i]*greyscale2+noise
    # Set any saturated pixels to maximum value
    greyscaletemp[greyscaletemp>1]=1
    # Create figure as hot colourmap 
    plt.figure(26+i)
    plt.imshow(greyscaletemp,cmap='hot')
    figname='fig'+str(26+i)
    plt.savefig(figname)

As we artificially decrease the intensity of the image with respect to the noise, we can see that the image becomes harder and harder to view due to the poor signal to noise ratio:

Adding a Bright Pixel

Let's now take one of our noisy images, figure 28 and find the maximum value of pixel and then introduce a pixel of maximum brightness. This emulates for instance an over-sensitive pixel in a camera.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create Greyscale Image (Ratio Eye Skewed to Sensitivity)
greyscale2=0.2989*r+0.5870*g+0.1140*b
greyscale2[greyscale2>1]=1

# Create Random Noise
noise=1/10*np.random.rand(rows,cols)

# Hot Colourmap - Data 16x decrease
greyscale6=(1/16)*greyscale2+noise
greyscale6[greyscale6>1]=1
plt.figure(28)
plt.imshow(greyscale6,cmap='hot')
plt.savefig('fig28')
plt.colorbar()

# Copy greyscale7 to greyscale7, find its max and introduce a bright pixel
greyscale6a=np.copy(greyscale6)
greyscale6max=np.max(greyscale6[:])
print(greyscale6max)
greyscale6a[0,0]=1
greyscale6a[0,1]=0.5

# Hot Colourmap - Data 16x decrease + bright pixel
plt.figure(29)
plt.imshow(greyscale6a,cmap='hot')
plt.savefig('fig29')
plt.colorbar()

0.1624901883818796

As you can see this new image is now very hard to see…

Applying Filters to Improve Image Quality

We have this very poor quality image, can we make it any better?

Let's try an upper threshold:

# Script Adding a Bright Pixel
.
.
.
greyscale6b=np.copy(greyscale6a)
greyscale6b[greyscale6b>0.8]=0
plt.figure(30)
plt.imshow(greyscale6b,cmap='hot')
plt.colorbar()

This has removed the highest artificial noisy pixel and has restored some image quality.

Let's try using a lower, upper threshold:

# Script Adding a Bright Pixel
.
.
.
greyscale6b=np.copy(greyscale6a)
greyscale6b[greyscale6b>0.4]=0
plt.figure(31)
plt.imshow(greyscale6b,cmap='hot')
plt.colorbar()

This has removed the lowest artificial noisy pixel and has restored some image quality.

We can try adding in a lower threshold also.

# Script Adding a Bright Pixel
.
.
.
greyscale6b=np.copy(greyscale6a)
greyscale6b[greyscale6b>0.4]=0
greyscale6b[greyscale6b<0.1]=0
plt.figure(32)
plt.imshow(greyscale6b,cmap='hot')
plt.colorbar()

Now we can attempt to multiply the data by a constant factor and once again threshold any value out higher than 1.

# Script Adding a Bright Pixel
.
.
.
greyscale6b=np.copy(greyscale6a)
greyscale6b[greyscale6b>0.4]=0
greyscale6b[greyscale6b<0.1]=0
greyscale6b=greyscale6b*4
greyscale6b[greyscale6b>1]=1
plt.figure(33)
plt.imshow(greyscale6b,cmap='hot')
plt.colorbar()

Image processing by carrying out only shareholding is quite limited. We can instead use other functions to work on the data, such as a Median filter

# Script Adding a Bright Pixel
.
.
.
plt.figure(34)
greyscale6c=np.copy(greyscale6)
greyscale6c=ndimage.median_filter(greyscale6c, size=10)
plt.imshow(greyscale6c,cmap='hot')
plt.colorbar()

Or Gaussian filter of width 3 which will fit a Gaussian to the original data using 3 by 3 data points and update the image accordingly.

# Script Adding a Bright Pixel
.
.
.
plt.figure(35)
greyscale6d=np.copy(greyscale6)
greyscale6d=ndimage.gaussian_filter(greyscale6d, 3)
plt.imshow(greyscale6d,cmap='hot')
plt.colorbar()

Inversion

Let's return to the original image and look at inverting it. We know that the maximum value of each pixel in each channel is 1. We can invert the data by taking the current data away from 1.

# Perquisites
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import pandas as pd
import scipy.ndimage as ndimage
# Close All Plots
plt.close('all')
# Load img1 from file
img1=mpimg.imread('LondonPNG.png')
# Dimensions of img1
[rows,cols,pages]=np.shape(img1)
# Split image "pages" into seperate r g b a channels
r=img1[:,:,0]
g=img1[:,:,1]
b=img1[:,:,2]
alpha=img1[:,:,3]
# Create an empty array where rgb=0,a=alpha
im1empty=np.zeros([rows,cols,pages])
im1empty[:,:,3]=alpha
# Create an array with r=1-r, g=0, b=0, a=1
rinv=np.copy(im1empty)
rinv[:,:,0]=1-r
# Create an array with r=0, g=1-g, b=0, a=1
ginv=np.copy(im1empty)
ginv[:,:,1]=1-g
# Create an array with r=0, g=0, b=1-b, a=1
binv=np.copy(im1empty)
binv[:,:,2]=1-b
# Combine to make img7 - the inverted image
img7=np.copy(im1empty)
img7[:,:,0]=1-r
img7[:,:,1]=1-g
img7[:,:,2]=1-b
# Create a Figure
plt.figure(35)
# Plot img7 as subplot 1
plt.subplot(2,2,1)
plt.imshow(img7)
# Plot Red Inverse only as subplot 2
plt.subplot(2,2,2)
plt.imshow(rinv)
# Plot Green Inverse only as subplot 3
plt.subplot(2,2,3)
plt.imshow(ginv)
# Plot Blue Inverse only as subplot 4
plt.subplot(2,2,4)
plt.imshow(binv)
# Save Figure
plt.savefig('fig35')
mpimg.imsave('LondonPNG_inv.png',img7)