[Z7 Beta] Random (?) sentences in the outline view

Hi all,

when working with this PDF (https://www.degruyter.com/document/doi/10.1515/9781400856626.159/html), I've noticed that Zotero shows an outline as available, which was already a surprise because Firefox doesn't show an outline for this document.

Apparently there is some automatic outline detection happening (which would be a great feature) that in this case is not very helpful; it just shows some apparently random sentences in the outline view:


  • Yes, this feature is still experimental, but it will improve over time. Thanks for reporting.
  • So far I've noticed quite a few cases in which the outline extraction produces somewhat unhelpful results. In this PDF, for example (https://www.degruyter.com/document/doi/10.1524/9783050060187/html), Zotero just recognizes the PDFs table of contents as chapter headings:



    Is it helpful to you if I report all the problems I'm seeing with this feature?
  • Is it helpful to you if I report all the problems I'm seeing with this feature?
    Definitely helpful!
  • Okay, I'll keep them coming in this case:

    In this book (http://link.springer.com/10.1007/978-3-322-80378-8), the extraction just extracts the book's title:


    In this text (https://www.nomos-elibrary.de/index.php?doi=10.5771/0023-5652-2015-182-78) it just extracts the first sentence and the title:


    In another scanned and OCRed book, it just recognizes part of the heading of one chapter title:

  • This paper: https://doi.org/10.2151/sola.15A-012

    The detected outline:

  • edited June 11, 2024
    Another OA example: https://doi.org/10.1063/5.0086745



    In this case, I had removed the first page of the PDF file before generating the outline to obtain this results.

    If I keep the first page, here is the result:


    Zotero 7.0.0-beta.85+c0c00a00e (64-bit)
    Windows 10
  • Another OA example showing its attraction to punctuation in equations: https://doi.org/10.1103/PhysRevFluids.9.053301



    Note that it is working nicely in some cases, so it is really useful to have this feature, even if only partially working.

    Zotero 7.0.0-beta.85+c0c00a00e (64-bit)
    Windows 10
  • I have found 3 books where I see the problem also.
    I have sent them to support@zotero.org.





  • Another interesting recent OA article: https://doi.org/10.1021/acsami.3c17037
    It is extracting some useful bookmarks, but the structure is not recognized:



    Zotero 7.0.0-beta.87+f59a4da7f (64-bit)
    Windows 10
  • In an OCRed PDF, Zotero detects some random sentences as Outline:



    Maybe it is because I quite often work with OCRed texts, but so far the outline detection feature has rarely been successful for me.
Sign In or Register to comment.