2008年7月20日星期日

License Plate Recognition Algorithms and Technology

License Plate Recognition Algorithms and Technology

Automatic license plate recognition has two essential technological issues:
· the quality of the license plate recognition software with its applied recognition algorithms, and
· the quality of the image acquisition technology, the camera and the illumination.

The very key factor is the license plate recognition software. The sophistication of the recognition software, the intelligence and quality of the applied license plate recognition algorithms, the mathematical knowledge and the years of experience behind determines the capabilities of the recognition software. The better the algorithms are, the highest the quality of the recognition software is:
· the highest recognition accuracy it has,
· the fastest processing speed it has,
· the most type of plates it can handle,
· the widest range of picture quality it can handle,
· the most tolerant against distortions of input data it is.

In early years of LPR available software were bound to specific countries. One software could read - for example - Spanish plates only, other could read plates from Hong Kong only, etc. This was not accidental: the geometrical structure of the plate as well as its syntax were essential parts of the plate reader software. Without the presumption of a fixed plate geometry (character ratios, character distribution, font type, plate colour, etc.) and a well defined syntax the algorithm may not even found the plate on the picture.


Plate geometry and basic syntax

The most advanced algorithms today read plates without such presumptions. For example, a good algorithm should read all plates from Europe with the same level of quality. There are indeed a wide variety of plate types in Europe:
· black (dark) characters on white (light) plate,
· white characters on black plates,
· one-row plates,
· two-row plates,
· plates with different character-size,
· latin and cyrillic fonts,
· plates with our without region's shield or special mark, etc.


Reading plates of different type is a measure of technology level

If a license plate recognition algorithm can not utilise such additional information like the prior knowledge of the plate structure or plate syntax, it looses a very helpful part of its input data. This loss results in reduction of maybe the most important quality measure, the plate recognition accuracy.

Without using additional information about the plate the remaining part of the recognition algorithm should be significantly better than it was before, when the additional information could be used. Otherwise it would not be possible to gain back the same recognition accuracy.

We believe that there are two key technological parts of a license plate recognition algorithm that basically determines its quality level:
· a robust, very high accuracy and intelligent optical character recognition technology, and
· a technology that allows intelligent structural analysis of complex higher structures.

The robust, very high accuracy Optical Character Recognition (OCR) technology is a very essential requirement.


The OCR task

To get better perception of the nature of recognition accuracy, consider the following example:

Assume the plates have an average of 7 (seven) characters as license plate number. If the overall plate recognition accuracy is required to be above 96%, than the recognition accuracy of the individual characters should be at least 99.5%. Out of 1000 characters not more than 5 could be misread/misrecognised:

(99.5%)7 = 0.9957 = 0.995 · 0.995 · 0.995 · 0.995 · 0.995 · 0.995 · 0.995 = 0.9655 = 96.5%

If someone speaks about 99% overall recognition accuracy, than the recognition rate of the individual characters has to be at least 99.85%.

But the above calculation is only a very simple estimation of the maximum acceptable OCR error rate: it is not the real error rate the OCR can have! The real OCR error rate has to be much lower than the one given by the above estimation, as there are several other parts of the entire algorithm than can make mistake. And the overall recognition accuracy is the multiplication of the accuracy of the individual (and independent) sub-algorithms.

For example, suppose that there are three additional sub-algorithms before the OCR:
· a plate localisation sub-algorithm, responsible for finding the plate on the picture, having 98.7% accuracy,
· a contrast/brightness normalisation sub-algorithm, responsible to equalise the plate picture, having 99.2% accuracy, and
· a character segmentation sub-algorithm, responsible for finding and cutting out the individual characters on the plates and pass them to the OCR, having 99.6% accuracy.
The OCR has a 99.5% recognition accuracy on the individual characters. The overall license plate recognition accuracy is then only 94.2%, as:

0.987 · 0.992 · 0.996 · 0.9957 = 94.2%

The image acquisition technology determines the average image quality the license plate recognition algorithm has to work on. Needless to say that the better the quality of the input images are, the better conditions the license plate recognition algorithm has, and thus the higher license plate recognition accuracy can be expected to be achieved.

What does good image quality mean?

In order to expect reasonable results from a plate recognition algorithm, the processed images should contain a plate
· with reasonable good spatial resolution,
· with reasonable good sharpness,
· with reasonable high contrast,
· under reasonable good lighting conditions,
· in a reasonable good position and angle of view.

Indeed, 'reasonable' is not an exact definition, still it has a well understandable meaning. Here are some problematic images:


Low spatial resolution (too small characters on the plate)

Blurred image

Low contrast

Overexposure



Bad lighting conditions (shadow and strong light)


High distortion

An image acquisition system is considered to be good if it provides a stable, balanced, reasonable good image quality under all of its working conditions. If an LPR system has to work outdoor 24h/day, 7days/week in Middle-Europe, than it has to handle quite a wide range of lighting and weather conditions. Under the chapter Technical and Quality Issues of License Plate Recognition we provide further technical details regarding image quality.

There are some interesting sites where you can find samples of plates all around the World:
· Automobile License Plate Collectors Association
· Eugen Winklharrer collector's page
· Alessandro Libanore collector's page
· PL8S the license plate collectors website

Keywords: license plate recognition algorithm, license plate recognition technology, recognition accuracy, license plate recognition accuracy, LPR technology, car number ocr, car plate ocr, license plate structure, license plate syntax, license plate localisation, license plate localization, character segmentation, contrast brightness normalisation, contrast brightness normalization, license plate recognition

没有评论:

发表评论

欢迎访问、交流!对本博客有何建议,请
来信告知!
本博内容来源于网络,如有不当或侵犯权益,请来信告知,将及时撤除!
如引用博客内容、论文,请注明原作者!

Google一下本博客

  • 《Getting Things Done》读书笔记 - 本文来自 inertial 原创投稿。 我第一次听说《Getting Things Done》这本书的时候误以为它和世面上的那些成功学书籍没什么区别,后来在不少书中看到了这个名字,也看见了很多人的推荐,由此产生了很大的兴趣。上个月正好有不少空闲,就抽时间把这本书读完了。 本来打算读英文原版,但是原版的生...
    5 年前
  • [原]Linux下编译使用boost库 - Boost库是一个可移植、提供源代码的C++库,作为标准库的后备,是C++标准化进程的开发引擎之一。 Boost库由C++标准委员会库工作组成员发起,其中有些内容有望成为下一代C++标准库内容。在C++社区中影响甚大,是不折不扣的“准”标准库。Boost由于其对跨平台的强调,对标准C++的强调,与...
    6 年前
  • [原]猎头、培训与咨询的价值(2)【补1】——北漂18年(93) - 【上期用手机写的,同时用语音输入转化成文字,错字较多,经好友霍师傅提醒本期重写,并增加一部分新内容】 简单谈下我对猎头、培训与咨询的看法。三样都干过,算是有些浅见。 猎头 简单的说就是人才中介。虽然在公司看来是可以直接解决现有企业问题的一个直接方法,但很多时候都不太管用。 猎头费一般是人才的一个月月...
    7 年前
  • OpenCV統計應用-Mahalanobis距離 - Mahalanobis距離是一個可以準確找出資料分布上面極端值(Outliers)的統計方法,使用線性迴歸的概念,也就是說他使用的是共變數矩陣以及該資料分布的平均數來找尋極端值的產生,而可以讓一群資料系統具有穩健性(Robust),去除不必要的雜訊訊息,這邊拿前面共變數矩陣的資料為例,並且新增了兩個點座標向量來做...
    15 年前
  • 努力推进模式识别实际产品的开发与应用 - Salu 无论是手写体识别、文档处理、人脸识别、基于内容的图片搜索、嵌入人工智能的搜索技术、虚拟网络社区、还是其它相关新科技下的信息整合领域,现在都在努力实用化。 前两年、即使现在还有很多人在抱怨说人脸的方法都不能用,但是就今年出现的和正在做的有关人脸识别实际应用的各种形式的产品可以说如雨后春笋。这是一个趋...
    16 年前