Location>code7788 >text

OCR PDF parsing supporting front-end tools open source details!

Popularity:712 ℃/2024-10-10 11:41:52

In the face of daily life and work in the common OCR recognition, PDF parsing, translation, proofreading and other scenes, supporting visualization tools can greatly enhance our experience and work efficiency.

Through the visualization interface, we can intuitively see the results of text recognition, parsing and translation, and conveniently evaluate the effectiveness of the product.

Today to share with you a great open source project - TextIn ParseX-Frontend, to help us easily build an excellent front-end visualization interface."Link"

 

/16YSJx

 

 

Project Profile

ParseX-Frontend is a set of visualization tools developed by TextIn, which is aimed at OCR or PDF parsing results review and proofreading, effect evaluation scenarios, but also applies to a series of translation software and other tools that need to be visualized comparison. The project is developed in ES6 , based on the React framework , can clearly and comprehensively show the results of parsing , with rich visualization and interactive features .

TextIn provides an excellent model for front-end development in related fields.

The project is currently open source on Github!

 

Performance Features

  • Free and stable: The front-end component code is all open source, and ParseX, as a core TextIn product, maintains stable front-end updates and has announced plans to import editing features.
  • Powerful rendering capabilities: preview rendering of mainstream image formats and pdf files, providing zoom and rotate functions; markdown results rendering, support for all levels of title, picture, formula rendering display.
  • Supports element extraction and location traceability: All kinds of parsing elements are extracted and displayed, support for viewing tables, formulas, pictures, and original JSON results; parsing elements document location traceability, the original text box labeled with the location of each element, you can click the box to jump to the parsing results, you can also click the parsing results to jump to the original text box.
  • Catalog Restore: Reduced display of each level of the catalog tree, support for clicking to jump to the appropriate chapter.
  • Flexible parameter configuration: Interface call option parameters , support configuration of different combinations of parameters , to obtain the corresponding parsing results .
  • Convenient copy and export function: Supports copying and exporting markdown files; after copying parsed tables and pictures, you can paste them directly into Excel tables.

Installation and use

Installation of dependencies

Environmental requirements:

  1. node version >= 18.20.4
  2. It is highly recommended to use the yarn package manager, which is included in the project repository and specifies the dependency versions.

Pulling items:

git clone /intsig-textin/

Install using npm or yarn:

yarn install
# or
npm install

Initiation of projects

yarn start
# or
npm run start

Browser to access http://localhost:10007

script command

Project structure

A brief description of the project file structure to help contributors and users understand how the project is organized.

├── src/ # source code
│ ├── assets/ # Static resources
│ ├── components/ # Global common components
│ ├── layouts/ # page frame components
│ ├── modules/ # store
│ ├── pages/ # page components
│ ├── service/ # Interface service
│ ├── utils/ # utility functions
│ └── # Entry files
├── public/ # Static Resources │ ├── config/ # Configuration
├── config/ # Configuration
│ ├── routes # routes
│ └── config.* # Other umi configuration
├── .            # ESLint configuration
├── # TypeScript configuration
├── # Project configuration
└── # Project description file

Effective demonstration

 

 

 

 

ParseX-Frontend, as a visualization tool, has rich interactive features. Original positioning traceability, one-click copy text, tables, pictures, flexible parameter configuration, not only can help users quickly compare and evaluate the effect of the product, improve work efficiency, but also reduce the technical threshold, non-developer users can also easily get started.

 

For more detailed features, interested parties can check out the project address:

Project Address:/intsig-textin/parsex-frontend