PDF转MarkDown(nougat-base)
Nougat model trained on PDF-to-markdown. It was introduced in the paper Nougat: Neural Optical Understanding for Academic Documents by Blecher et al. and first released in this repository.
Nougat is a Donut model trained to transcribe scientific PDFs into an easy-to-use markdown format. The model consists of a Swin Transformer as vision encoder, and an mBART model as text decoder.
The model is trained to autoregressively predict the markdown given only the pixels of the PDF image as input.
使用
https://openi.pcl.ac.cn/cubeai-model-zoo/hffacebooknougat-base
模型来源
https://hf-mirror.com/facebook/nougat-base
评论