 |
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
Xudong Xie*,
Liang Yin*,
Hao Yan*,
Yang Liu*,
Jing Ding,
Minghui Liao,
Yuliang Liu,
Wei Chen†,
Xiang Bai†
Arxiv, 2024
[arXiv]
[Code]
[Project Page]
In this paper, we propose PDF-Wukong, A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling.
|