PatentAuction.com lists patented inventions available for sale or licensing. Inventors can list their patented inventions (or patent pending) for sale. New inventions for sale are added on a daily basis !
Home    List your patent    My account     Help     Support us    

Method to identify the document structure (sections, references)

[Category : - SOFTWARES]
[Viewed 453 times]

An inventive approach for identifying the structure of unstructured documents, whether they are digital native or scanned, regardless of the language they are written in (examples of documents in Chinese and Hebrew are depicted in the illustrations). This method, known as ITS (Identify The Structure), employs a heuristic approach that outperforms AI-based methods in terms of speed and accuracy.

In a continuation-in-part, the patent also encompasses the extraction of data from documents and the organization of that data into hierarchies based on the document structure.

The method offers several practical use cases, including:
1. Transforming unstructured documents into structured and navigable formats and converting internal references into clickable links.
2. Extracting data from a large collection of documents using Deep Learning techniques by applying preprocessing methods to optimize performance and output quality.
3. Enabling real-time translation of lengthy documents by presenting them section by section, which proves particularly useful compared to translating the entire document at once.
4. Improving the efficiency of LLM-based assistants (e.g., GPT assistants) that interact with lengthy documents by structuring the text, parallelizing computations, and adding annotations.

For more information and practical demonstrations of the applications, please visit Link

















[ Home | List a patent | Manage your account | F.A.Q.|Terms of use | Contact us]

Copyright PatentAuction.com 2004-2017
Page created at 2025-04-02 11:11:03, Patent Auction Time.