Welcome to LEAD Support Forum Login | Register | Faq  

    LEAD Support Forum
  Resource to find answers and post technical questions about LEAD products.
Search    
   

OCR Table Header
Started by bsuresh@ains-india.stph.net at 04-14-2008 16:27. Topic has 7 replies.

Print Search « Previous Thread Next Thread »
  04-14-2008, 16:27
bsuresh@ains-india.stph.net is not online. Last active: 10/22/2008 8:42:48 PM bsuresh@ains-india.stph.net

Top 25 Posts
Joined on 06-24-2003
Posts 79
OCR Table Header
Reply Quote
Hi Adnan,

We have a table on a page and table's column header is spread into three lines. For example, "Last Action Date" is the column header and due to lack of room, the column header is wrapped (in the same cell) into 3 lines as below:
Last
Action
Date

How can we make sure that these three words are returned in the same order when I perform ocr on the page? We are not getting these three words in same order. We are getting the text from other column headers in between ..

Thanks
   Report 
  04-14-2008, 16:30
bsuresh@ains-india.stph.net is not online. Last active: 10/22/2008 8:42:48 PM bsuresh@ains-india.stph.net

Top 25 Posts
Joined on 06-24-2003
Posts 79
Re: OCR auto detected

Attachment: ocr - words - order - table.JPG
Reply Quote
Please check attachment.

   Report 
  04-15-2008, 8:30
Adnan Ismail is not online. Last active: 11/20/2008 4:22:06 PM Adnan Ismail



Top 10 Posts
Joined on 07-31-2006
Posts 1,271
Re: OCR auto detected
Reply Quote
You can define a Zone on each column header, and perform OCR on that Zone. The details to do this depends on what version of LEADTOOLS are you using and which LEADTOOLS programming interfaces you are using (API, COM Objects, or .NET Class library) to develop your application.


Adnan Ismail
LEADTOOLS Technical Support

   Report 
  04-15-2008, 12:04
bsuresh@ains-india.stph.net is not online. Last active: 10/22/2008 8:42:48 PM bsuresh@ains-india.stph.net

Top 25 Posts
Joined on 06-24-2003
Posts 79
Re: OCR auto detected
Reply Quote
I am using 15 SDK with C#.

Unfortunately I cannot know whether a document will contain table or other data. All I want to acheive is to be able to get the list of words on the page in the correct order. This is the requirement. So for some pages that contain the table, the order of words is not corred (as in the above case, though the text belongs to the same column, they are not being reported together because the text is wrapped within the column header). How can we get this.
   Report 
  04-17-2008, 7:50
Qasem Lubani is not online. Last active: 11/9/2008 3:40:25 PM Qasem Lubani



Top 10 Posts
Joined on 08-13-2006
Posts 1,135
Re: OCR auto detected
Reply Quote

I tested on your image, and did not define my own zones. Instead, I used the default zones that the engine finds for itself. The result was that the 3 words "Last Action Date" were automatically grouped into one zone, and when I displayed the list of recognized words, they were listed in this exact order.

Can you post or send us the actual full image you're trying to OCR instead of the partial screen capture.


Qasem Al-Lubani
LEAD Technical Support
www.leadtools.com
   Report 
  04-17-2008, 13:34
bsuresh@ains-india.stph.net is not online. Last active: 10/22/2008 8:42:48 PM bsuresh@ains-india.stph.net

Top 25 Posts
Joined on 06-24-2003
Posts 79
Re: OCR auto detected

Attachment: Note0001.tif
Reply Quote
Qasem,

Thanks for the response. Here I am attaching the original page file (TIF).

Thanks,
Suresh
   Report 
  04-19-2008, 15:33
bsuresh@ains-india.stph.net is not online. Last active: 10/22/2008 8:42:48 PM bsuresh@ains-india.stph.net

Top 25 Posts
Joined on 06-24-2003
Posts 79
Re: OCR auto detected
Reply Quote
Hi Qasem,

Any update on this please.
   Report 
  04-20-2008, 11:21
Qasem Lubani is not online. Last active: 11/9/2008 3:40:25 PM Qasem Lubani



Top 10 Posts
Joined on 08-13-2006
Posts 1,135
Re: OCR auto detected
Reply Quote
I have tested here with the full file and I got the same results you did. You can try to achieve this by checking the recognized words X coordinates and comparing them with each other, if they are close then the worlds belong to the same column and you can arrange them accordingly.
Qasem Al-Lubani
LEAD Technical Support
www.leadtools.com
   Report 
Post
LEAD Support Fo... » Developer » OCR » OCR Table Header

Powered by Community Server, by Telligent Systems