diff --git a/README.md b/README.md index 391a246..f39e74e 100644 --- a/README.md +++ b/README.md @@ -42,6 +42,7 @@ You can get quick insights on a PDF file with these commands: - `overview` outputs text data about the structure and the metadata. - `disasm` outputs a dump of the file structure on the terminal. - `text` outputs extracted text spatially, as if it was a kind of scan. +- `fonts` outputs list of fonts used. - `browse` outputs static html data that lets you browse the internal structure of the PDF file: the PDF source is pretty-printed and augmented with hyperlinks. ## API overview diff --git a/docs/cli.html b/docs/cli.html index 0a98db5..c176fb6 100644 --- a/docs/cli.html +++ b/docs/cli.html @@ -27,9 +27,16 @@

overview

disasm

The output shows a terse and greppable view of the file internal structure.Please refer to the Disassembler article for details.

-

text

The output shows a full extract of the text content, with a spatial awareness: the algorithm tries to respect the original layout, as if characters of all sizes were approximately rendered on a fixed-size grid.

+

fonts

+

The output shows a list of fonts used in the file, with the following tabular data:

+

browse

The generated HTML looks like the raw PDF file with the following additions: