Skip to content

OCR (PDF Tooling) #294

@mmcky

Description

@mmcky

We have currently had great success with

  1. https://github.com/datalab-to/marker

to extract data from pdf to markdown components.

but it would be interesting to compare to a couple of newly released tools:

  1. https://github.com/Yuliang-Liu/MonkeyOCR

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions