Automated species identification

Automated species identification is a method of making the expertise of taxonomists available to ecologists, parataxonomists and others via digital technology and artificial intelligence.

Despite much preliminary work in the 1950s and '60s, progress in designing and implementing practical systems for fully automated object biological identification has proven frustratingly slow.

The authors found that "a small but growing number of studies sought to develop automated species identification systems based on morphological characters".

These studies propose a wealth of computer vision approaches, i.e., features reducing the high-dimensionality of the pixel-based image data while preserving the characteristic information as well as classification methods.

By training classifiers on a few hundred images it correctly identified fruit-flies, and can be used for continuous monitoring aimed at detecting species invasion or pest outbreak.

Primarily, using e-traps provide a standardized setting, which means that even though they are deployed in different countries and regions, the visual variability, in terms of size view angle and illumination are controlled.

"This expertise deficiency cuts as deeply into those commercial industries that rely on accurate identifications (e.g., agriculture, biostratigraphy) as it does into a wide range of pure and applied research programmes (e.g., conservation, biological oceanography, climatology, ecology).

Peer review only weeds out the most obvious errors of commission or omission in this area, and then only when an author provides adequate representations (e.g., illustrations, recordings, and gene sequences) of the specimens in question.

Properly designed, flexible, and robust, automated identification systems, organized around distributed computing architectures and referenced to authoritatively identified collections of training set data (e.g., images, and gene sequences) can, in principle, provide all systematists with access to the electronic data archives and the necessary analytic tools to handle routine identifications of common taxa.

Properly designed systems can also recognize when their algorithms cannot make a reliable identification and refer that image to a specialist (whose address can be accessed from another database).

DFE - the graphical interface of the Daisy system. The image is the wing of a biting midge Culicoides sp., some species of which are vectors of Bluetongue . Others may also be vectors of Schmallenberg virus an emerging disease of livestock, especially sheep.
(Credit: Mark A. O'Neill )
SDS protein gel images of sphinx moth caterpillars. It can be used in a similar way to DNA fingerprinting