Too slow loading #353

foxjaw · 2024-06-10T14:00:04Z

Takes like a minute to even open a document.

andiwand · 2024-06-11T06:27:57Z

Hi @foxjaw, I fear the loading screen is not telling us much. Do you have a file to share?

I suspect that it is a PDF which tend to take quite some time to open if they are large. Is this something that was better in the past in your experience?

foxjaw · 2024-06-11T06:34:04Z

Yes. All PDFs greater than 1 mB load slowly. It's like a nightmare. Meanwhile I use MuPDF separately to open PDFs it's very fast. Why is open document taking a long time to load big documents?

andiwand · 2024-06-11T07:46:02Z

I think this is an issue with pdf2htmlEX. Potentially we have some regression there with build flags / jpeg vs png? cc @TomTasche @ViliusSutkus89

foxjaw · 2024-06-11T07:50:44Z

Oh okay. So Open Document does not have a native pdf viewing library and instead relies on html conversion which is why it's inefficient ?

andiwand · 2024-06-11T07:56:47Z

I think this does not necessarily mean that it is inefficient. pdf2htmlEX will render the PDF to images and put those into HTML. This is very much like what the browser does but one more step of indirection. I can remember that there was a problem in the past which caused the rendering being quite slow. A simple configuration change resolved that.

Apart from that we could also render the pages in parallel and display what is rendered already instead of waiting for the whole document to finish. But I fear we are lacking the personpower to achieve this in the near future.

foxjaw · 2024-06-11T08:00:02Z

I don't think browsers do this. If they were converted into images it would be impossible to select the text & ctrl+f the document, which I can with browsers.
See I'm technically weaker. But at least this is how I see.

ViliusSutkus89 · 2024-06-13T00:03:51Z

Hello y'all

So I don't think it's a regression, it was always slow and we have a few of reasons why it's slow.

First conversion is extra slow because we have to extract asset files from pdf2htmlEX, Poppler and FontForge. Ideally we should use them without extracting, but this requires some work ( opendocument-app/pdf2htmlEX-Android#9 , opendocument-app/pdf2htmlEX-Android#10 ). Currently these libraries expect assets to be found as regular files on a disk.

Second reason is the thing @andiwand mentioned - we convert the whole document and only then render it. Other viewers do conversion and rendering at the same time. Page by page conversion might be the lowest hanging fruit here. But the thing is, once opendocument-app/pdf2htmlEX-Android#93 is implemented, we can interface pdf2htmlEX from odr.core through C++ instead of odr.droid through Java. This means that whatever improvement we code up now, would probably be needed to be reimplemented.

Also, when pdf2htmlEX does conversion to HTML, it's not just images. Normally pdf texts end up as selectable html text elements

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Too slow loading #353

Too slow loading #353

foxjaw commented Jun 10, 2024

andiwand commented Jun 11, 2024

foxjaw commented Jun 11, 2024

andiwand commented Jun 11, 2024

foxjaw commented Jun 11, 2024

andiwand commented Jun 11, 2024

foxjaw commented Jun 11, 2024

ViliusSutkus89 commented Jun 13, 2024

Too slow loading #353

Too slow loading #353

Comments

foxjaw commented Jun 10, 2024

andiwand commented Jun 11, 2024

foxjaw commented Jun 11, 2024

andiwand commented Jun 11, 2024

foxjaw commented Jun 11, 2024

andiwand commented Jun 11, 2024

foxjaw commented Jun 11, 2024

ViliusSutkus89 commented Jun 13, 2024