0

Customizable OCR (AI) Prompts for Enhanced Recognition

Dear Awesome Encodian Team,

I've been working with your OCR AI and, while it's performing well, I believe there's potential for enhancement tailored to specific needs. Here’s a suggestion that could further improve its accuracy, particularly with handwriting and specific document types.

Enhancement Proposal:

  • Custom Prompts: Allow users to provide custom "prompts" or "tips" to guide OCR towards organization- or country-specific formats.

  • Use Case Example: With Canadian postal codes, typically formatted as regex ^[A-Za-z]\d[A-Za-z] \d[A-Za-z]\d$, the OCR often misreads handwritten forms, e.g., interpreting "V9K 1S7" as "VAK 1S7".

  • Suggested Prompt: Users could input specific instructions such as: "You are an OCR expert for Canadian local government documents in British Columbia. Recognize postal codes following this regex...".

Implementing such guidance could significantly enhance the OCR's accuracy in niche and highly structured data contexts. Looking forward to your thoughts and any potential timelines for this feature.

Thanks for considering this improvement!

Best,

Reno

2 comments

  • Avatar
    Jay Goodison Official comment

    Hello Reno Sun

    Thank you for the suggestion. 

    We are adding the ability to forcefully set the locale/language, currently the action will auto-detect only.

    That said; I don't expect this will help as the OCR action is focussed generically on OCR and not to any specific use case for data extraction. To cover your requirement we'd really need to create a custom extraction model focussed specifically on your need. Our professional services team will be able to assist if you would like to explore further. 

    Not forgetting; Once the OCR layer has been produced you can utilise our existing actions to extract data from the form zonally or using regex, have you reviewed:

    PDF - Extract Text – Encodian Customer Help

    PDF - Extract Text from Regions – Encodian Customer Help

    Utility - Search Text (Regex) – Encodian Customer Help

  • 0
    Avatar
    Reno Sun

    Hi Jay,

    Thanks Jay. Higher accuracy and improvable extraction models sound like always requiring professional services or custom model (ML library). I will keep in mind that Encodian provide such services but also keep researching for cost-effective solutions for my organization. 

    Cheers,

    Reno

Please sign in to leave a comment.