2.6. Image manipulation and processing using Numpy and Scipy¶

Authors: Emmanuelle Gouillart, Gaël Varoquaux

This section addresses basic image manipulation and processing using the core scientific modules NumPy and SciPy. Some of the operations covered by this tutorial may be useful for other kinds of multidimensional array processing than image processing. In particular, the submodule scipy.ndimage provides functions operating on n-dimensional NumPy arrays.

See also

For more advanced image processing and image-specific routines, see the tutorial Scikit-image: image processing, dedicated to the skimage module.

Image = 2-D numerical array

(or 3-D: CT, MRI, 2D + time; 4-D, …)

Here, image == Numpy array np.array

Tools used in this tutorial:

numpy: basic array manipulation
scipy: scipy.ndimage submodule dedicated to image processing (n-dimensional images). See the documentation:
```
>>> from scipy import ndimage
```

Common tasks in image processing:

Input/Output, displaying images
Basic manipulations: cropping, flipping, rotating, …
Image filtering: denoising, sharpening
Image segmentation: labeling pixels corresponding to different objects
Classification
Feature extraction
Registration
…

Chapters contents

Opening and writing to image files
Displaying images
Basic manipulations
- Statistical information
- Geometrical transformations
Image filtering
Feature extraction
- Edge detection
- Segmentation
Measuring objects properties: ndimage.measurements
Full code examples
Examples for the image processing chapter

2.6.1. Opening and writing to image files ¶

Writing an array to a file:

fromscipyimportmisc
importimageio
f=misc.face()
imageio.imsave('face.png',f)# uses the Image module (PIL)
importmatplotlib.pyplotasplt
plt.imshow(f)
plt.show()

Creating a numpy array from an image file:

>>> fromscipyimportmisc
>>> importimageio
>>> face=misc.face()
>>> imageio.imsave('face.png',face)# First we need to create the PNG file
>>> face=imageio.imread('face.png')
>>> type(face)
<class 'imageio.core.util.Array'>
>>> face.shape,face.dtype
((768, 1024, 3), dtype('uint8'))

dtype is uint8 for 8-bit images (0-255)

Opening raw files (camera, 3-D images)

>>> face.tofile('face.raw')# Create raw file
>>> face_from_raw=np.fromfile('face.raw',dtype=np.uint8)
>>> face_from_raw.shape
(2359296,)
>>> face_from_raw.shape=(768,1024,3)

Need to know the shape and dtype of the image (how to separate data bytes).

For large data, use np.memmap for memory mapping:

>>> face_memmap=np.memmap('face.raw',dtype=np.uint8,shape=(768,1024,3))

(data are read from the file, and not loaded into memory)

Working on a list of image files

>>> foriinrange(10):
... im=np.random.randint(0,256,10000).reshape((100,100))
... imageio.imsave('random_%02d.png'%i,im)
>>> fromglobimportglob
>>> filelist=glob('random*.png')
>>> filelist.sort()

2.6.2. Displaying images ¶

Use matplotlib and imshow to display an image inside a matplotlib figure:

>>> f=misc.face(gray=True)# retrieve a grayscale image
>>> importmatplotlib.pyplotasplt
>>> plt.imshow(f,cmap=plt.cm.gray)
<matplotlib.image.AxesImage object at 0x...>

Increase contrast by setting min and max values:

>>> plt.imshow(f,cmap=plt.cm.gray,vmin=30,vmax=200)
<matplotlib.image.AxesImage object at 0x...>
>>> # Remove axes and ticks
>>> plt.axis('off')
(-0.5, 1023.5, 767.5, -0.5)

Draw contour lines:

>>> plt.contour(f,[50,200])
<matplotlib.contour.QuadContourSet ...>