人工智能方案目录

解决方案编号

A-0173

解决方案名称

Smart Entity Extraction platform for automated form and document processing

解决方案描述

The Generic Rule-based Entity Extraction Network Platform (GREEN) is an innovative platform developed to address the challenges of extracting information from forms and documents. Traditional methods rely on aligning images with predefined templates, but variations between the image and template can lead to incorrect cropping and recognition errors. Additionally, predefined fields limit recognition accuracy when characters are written outside their bounds.

The objectives of GREEN are to design a system that provides flexibility in extracting information from documents, even in the absence of templates, and to ensure extensibility to accommodate different document information extraction scenarios. It starts by recognizing all characters in the image and grouping them into words. Language model-based contextual understanding is applied to classify words and determine if they form sentences. Each grouped text block is assigned a 2D position and type, such as "name", "date", "number", "HKID" and "address" Spatial linkage information is then used to establish relationships between text blocks. Users can define rules based on the relative positions of recognized text blocks to recognize specific areas.

应用领域

研究和信息检索文件撰写

使用例子

By employing GREEN, organizations can streamline the extraction process, improve accuracy, and adapt to changing document formats. This innovative solution empowers businesses to efficiently extract valuable information from various documents, enhancing productivity and reducing manual effort.

支持本地伺服器部署

是

支持笔记本电脑独立运行

是

需要图形处理器(GPU)运行

否

付款模式

一次性费用，每年续期

免费试用

否

公司／机构名称

香港应用科技研究院（应科院）

电邮地址

sharonling@astri.org

电话号码

+85234062718

网址

https://www.astri.org/

地址

5/F, Photonics Centre, 2 Science Park East Avenue, Hong Kong Science Park, Shatin, Hong Kong

方案简报

点击下载

方案简介影片

如果任何政府部门希望获取有关AI解决方案的额外资料，请联络Smart LAB。