## OpenCV - Detect skew angle

### OpenCV

Lately I have been interested in the library OpenCV. This library, initially developed by Intel, is dedicated to computer vision applications and is known to be one of the fastest (and maybe the fastest ?) library available for real-time computer vision.

And yet, if you think the library is not fast enough you can install Intel Integrated Performance Primitives (IPP) on your system and benefit from heavily optimized routines. If you have several cores available on your computer, you can also install Intel Threading Building Blocks (TBB). And if it's still not fast enough, you can run some operations on your GPU using CUDA !

All these features looked really exciting, so I decided to experiment a little bit with OpenCV in order to test its interface and expressive power. In this article I present a simple and very short project that detects the skew angle of a digitized document. My goal is not to present an error-proof program but rather to show the ease of use of OpenCV.

### Skew angle

The detection of the skew angle of a document is a common preprocessing step in document analysis.

The document might have been slightly rotated during its digitization. It is therefore necessary to compute the skew angle and to rotate the text before going further in the processing pipeline (i.e. words/letters segmentation and later OCR).

We will work on 5 different images of the same short (but famous) text.

The naming convention for those images is rather simple, the first letter stands for the sign of the angle (p for plus, m for minus) and the following number is the value of the angle.
m8.jpg has therefore been rotated by an angle of -8 degrees.

We assume here that the noise introduced by digitization has been removed by a previous preprocessing stage. We also assume that the text has been isolated: no images, horizontal or vertical separators, etc.

### Implementation with OpenCV

First, let's declare a function compute_skew, it takes a path to an image as input and outputs the detected angle to standard output.
First we load the image and stores its size in a variable, very simple.

 void compute_skew(const char* filename) { // Load in grayscale. cv::Mat src = cv::imread(filename, 0); cv::Size size = src.size();

In image processing, objects are white and the background is black, we have the opposite, we need to invert the colors of our image:

 cv::bitwise_not(src, src);

And here is the result:

In order to compute the skew we must find straight lines in the text. In a text line, we have several letters side by side, lines should therefore be formed by finding long lines of white pixels in the image. Here is an example below:

Of course as characters have an height, we find several lines for each actual line from the text. By fine tuning the parameters used later or by using preprocessing we can decrease the number of lines.

So, how do we find lines in the image ? We use a powerful mathematical tool called the Hough transform. I won't dig into mathematical details, but the main idea of the Hough transform is to use a 2D accumulator in order to count how many times a given line has been found in the image, the whole image is scanned and by a voting system the "best" lines are identified.
We used a more efficient variant of the Standard Hough Transform (SHT) called the Probabilistic Hough Transform (PHT). In OpenCV the PHT is implemented under the name HoughLinesP.

In addition to the standard parameters of the Hough Transform, we have two additional parameters:

• minLineLength – The minimum line length. Line segments shorter than that will be rejected. This is a great tool in order to prune out small residual lines.
• maxLineGap – The maximum allowed gap between points on the same line to link them.
This could be interesting for a multi-columns text, for example we could choose to not link lines from different text columns.

Back to C++ now, in OpenCV the PHT stores the end points of the line whereas the SHT stores the line in polar coordinates (relative to the origin). We need a vector in order to store all the end points:

 std::vector lines;

We are ready for the Hough transform now:

 cv::HoughLinesP(src, lines, 1, CV_PI/180, 100, size.width / 2.f, 20);

We use a step size of 1 for $\rho$ and $\pi/180$ for $\theta$, the threshold (the minimum number of votes) is 100.
minLineLength is width/2, this is not an unreasonable assumption if the text is well isolated.
maxLineGap is 20, it seems a sound value for a gap.

In the remaining of the code we simply calculate the angle between each line and the horizontal line using the atan2 math function and we compute the mean angle of all the lines.
For debugging purposes we also draw all the lines in a new image called disp_lines and we display this image in a new window.

 cv::Mat disp_lines(size, CV_8UC1, cv::Scalar(0, 0, 0)); double angle = 0.; unsigned nb_lines = lines.size(); for (unsigned i = 0; i < nb_lines; ++i) { cv::line(disp_lines, cv::Point(lines[i][0], lines[i][1]), cv::Point(lines[i][2], lines[i][3]), cv::Scalar(255, 0 ,0)); angle += atan2((double)lines[i][3] - lines[i][1], (double)lines[i][2] - lines[i][0]); } angle /= nb_lines; // mean angle, in radians.   std::cout << "File " << filename << ": " << angle * 180 / CV_PI << std::endl;   cv::imshow(filename, disp_lines); cv::waitKey(0); cv::destroyWindow(filename); }

We just need a main function in order to call compute_skew on several images:

 const char* files[] = { "m8.jpg", "m20.jpg", "p3.jpg", "p16.jpg", "p24.jpg"};   int main() { unsigned nb_files = sizeof(files) / sizeof(const char*); for (unsigned i = 0; i < nb_files; ++i) compute_skew(files[i]); }

That's all, here are the skew angles we obtain for each image, we have a pretty good accuracy:

• Janelle

Hi!
The syntax of cv:: (cv::HoughLinesP, etc.) does not seem to work on my VS2008. However cv functions (cvHoughLinesP, etc.) work just fine. I have opencv 2.1. Have I installed opencv to VS wrong?

• Anonymous

Hello Janelle,
I think you have included the wrong headers in your file, so what you have is the C interface instead of the C++ interface.
For instance you may have added:
#include "cv.h"
#include "highgui.h"
...

But in order to use the C++ interface you need to use other header files, here you need:
#include
#include

#include

Is it working now ?

• Anonymous

Hello Janelle,
I think you have included the wrong headers in your file, so what you have is the C interface instead of the C++ interface.
For instance you may have added:
#include "cv.h"
#include "highgui.h"
...

But in order to use the C++ interface you need to use other header files, here you need:

 #include <opencv2/core/core.hpp> #include <opencv2/highgui/highgui.hpp> #include <opencv2/imgproc/imgproc.hpp>

Is it working now ?

• there are no compile errors but it is showing undefined reference to cv::imread() when i try to build and run the program.

• i felt that it is a problem in linking. plz help me. I couldn't get it. I am using code blocks compiler on windows.

• FelixAbecassis

Hi,
I've never used CodeBlocks but you should try to read and adapt this tutorial by Philipp Wagner: http://bytefish.de/blog/opencv_visual_studio_and_libfacerec
It looks like you have set your include path correctly but your library path is not set.
You should be able to adapt the above tutorial to solve your problem with CodeBlocks.

• k thanks. can you please elaborate the working of the parameters passed to cv::HoughLinesP . Actually I am working on rotating barcodes. so i need to know more about HoughLinesP. Any link to it is also ok.

• Pingback: propecia()

• Vhevhang Vhakhikhang

I tried to run the code but I keep getting the error

"opencv error bad flag parameter or structure field unrecognized or unsopported array type in unknown function, file ......srcopencvmodukescoresrcaaray.cpp, line 2482"

Can anyone help me with this??

I really have to get this working..

• danizavtz

thanks for posting this code, i based my solution on this code.
I wish to calculate the skew of roughlinesp(). Based on your example i could calculate the skew of each line. 🙂 Thank you very much!
It worked perfectly on Opencv 2.4.8

• Chamalee De Silva

It is really helpful. Thanks Felix..!!

• Atul Rai

My output looks as follows , can please help me i just followed the code you have given

• Paulo Miguel Almeida

If you're using the line.jpg provided on this article, you may need to convert it to grayscale and invert colors before executing the code above. So your image should look like that:

• Pingback: orientation correction before ocr | 我爱源码网()

• Pronoy Barua

How do we obtain text lines from the image? That is, how to get lines.jpg from inverted.jpg?

• elham

i work on text line segmentation(code with matlab). i need to convert code above to matlab and use matlab code. send matlab code above for me.thanks

• Robel Alazar

Please, I need the c# code the above sample

• Danilo Djordjevic