Module: pdfjsLib

Classes

PDFDataRangeTransport
PDFDocumentProxy

Methods

(inner) getDocument(src) → {PDFDocumentLoadingTask}

This is the main entry point for loading a PDF and interacting with it. NOTE: If a URL is used to fetch the PDF data a standard XMLHttpRequest(XHR) is used, which means it must follow the same origin rules that any XHR does e.g. No cross domain requests without CORS.
Parameters:
Name Type Description
src string | TypedArray | DocumentInitParameters | PDFDataRangeTransport Can be a url to where a PDF is located, a typed array (Uint8Array) already populated with data or parameter object.
Source:
Returns:
Type
PDFDocumentLoadingTask

Type Definitions

DocumentInitParameters

Document initialization / loading parameters object.
Type:
  • Object
Properties:
Name Type Attributes Description
url string <optional>
The URL of the PDF.
data TypedArray | Array | string <optional>
Binary PDF data. Use typed arrays (Uint8Array) to improve the memory usage. If PDF data is BASE64-encoded, use atob() to convert it to a binary string first.
httpHeaders Object <optional>
Basic authentication headers.
withCredentials boolean <optional>
Indicates whether or not cross-site Access-Control requests should be made using credentials such as cookies or authorization headers. The default is false.
password string <optional>
For decrypting password-protected PDFs.
initialData TypedArray <optional>
A typed array with the first portion or all of the pdf data. Used by the extension since some data is already loaded before the switch to range requests.
length number <optional>
The PDF file length. It's used for progress reports and range requests operations.
range PDFDataRangeTransport <optional>
rangeChunkSize number <optional>
Specify maximum number of bytes fetched per range request. The default value is 2^16 = 65536.
worker PDFWorker <optional>
The worker that will be used for the loading and parsing of the PDF data.
verbosity number <optional>
Controls the logging level; the constants from {VerbosityLevel} should be used.
docBaseUrl string <optional>
The base URL of the document, used when attempting to recover valid absolute URLs for annotations, and outline items, that (incorrectly) only specify relative URLs.
cMapUrl string <optional>
The URL where the predefined Adobe CMaps are located. Include trailing slash.
cMapPacked boolean <optional>
Specifies if the Adobe CMaps are binary packed.
CMapReaderFactory Object <optional>
The factory that will be used when reading built-in CMap files. Providing a custom factory is useful for environments without `XMLHttpRequest` support, such as e.g. Node.js. The default value is {DOMCMapReaderFactory}.
stopAtErrors boolean <optional>
Reject certain promises, e.g. `getOperatorList`, `getTextContent`, and `RenderTask`, when the associated PDF data cannot be successfully parsed, instead of attempting to recover whatever possible of the data. The default value is `false`.
maxImageSize number <optional>
The maximum allowed image size in total pixels, i.e. width * height. Images above this value will not be rendered. Use -1 for no limit, which is also the default value.
isEvalSupported boolean <optional>
Determines if we can eval strings as JS. Primarily used to improve performance of font rendering, and when parsing PDF functions. The default value is `true`.
disableFontFace boolean <optional>
By default fonts are converted to OpenType fonts and loaded via font face rules. If disabled, fonts will be rendered using a built-in font renderer that constructs the glyphs with primitive path commands. The default value is `false`.
fontExtraProperties boolean <optional>
Include additional properties, which are unused during rendering of PDF documents, when exporting the parsed font data from the worker-thread. This may be useful for debugging purposes (and backwards compatibility), but note that it will lead to increased memory usage. The default value is `false`.
disableRange boolean <optional>
Disable range request loading of PDF files. When enabled, and if the server supports partial content requests, then the PDF will be fetched in chunks. The default value is `false`.
disableStream boolean <optional>
Disable streaming of PDF file data. By default PDF.js attempts to load PDFs in chunks. The default value is `false`.
disableAutoFetch boolean <optional>
Disable pre-fetching of PDF file data. When range requests are enabled PDF.js will automatically keep fetching more data even if it isn't needed to display the current page. The default value is `false`. NOTE: It is also necessary to disable streaming, see above, in order for disabling of pre-fetching to work correctly.
pdfBug boolean <optional>
Enables special hooks for debugging PDF.js (see `web/debugger.js`). The default value is `false`.
Source:

GetAnnotationsParameters

Page annotation parameters.
Type:
  • Object
Properties:
Name Type Description
intent string Determines the annotations that will be fetched, can be either 'display' (viewable annotations) or 'print' (printable annotations). If the parameter is omitted, all annotations are fetched.
Source:

getTextContentParameters

Page getTextContent parameters.
Type:
  • Object
Properties:
Name Type Description
normalizeWhitespace boolean replaces all occurrences of whitespace with standard spaces (0x20). The default value is `false`.
disableCombineTextItems boolean do not attempt to combine same line TextItem's. The default value is `false`.
Source:

GetViewportParameters

Page getViewport parameters.
Type:
  • Object
Properties:
Name Type Attributes Description
scale number The desired scale of the viewport.
rotation number <optional>
The desired rotation, in degrees, of the viewport. If omitted it defaults to the page rotation.
offsetX number <optional>
The horizontal, i.e. x-axis, offset. The default value is `0`.
offsetY number <optional>
The vertical, i.e. y-axis, offset. The default value is `0`.
dontFlip boolean <optional>
If true, the y-axis will not be flipped. The default value is `false`.
Source:

PDFDocumentStats

Type:
  • Object
Properties:
Name Type Description
streamTypes Object Used stream types in the document (an item is set to true if specific stream ID was used in the document).
fontTypes Object Used font types in the document (an item is set to true if specific font ID was used in the document).
Source:

PDFOperatorList

PDF page operator list.
Type:
  • Object
Properties:
Name Type Description
fnArray Array Array containing the operator functions.
argsArray Array Array containing the arguments of the functions.
Source:

PDFWorkerParameters

Type:
  • Object
Properties:
Name Type Attributes Description
name string <optional>
The name of the worker.
port Object <optional>
The `workerPort`.
verbosity number <optional>
Controls the logging level; the constants from {VerbosityLevel} should be used.
Source:

RenderParameters

Page render parameters.
Type:
  • Object
Properties:
Name Type Attributes Description
canvasContext Object A 2D context of a DOM Canvas object.
viewport PageViewport Rendering viewport obtained by calling the `PDFPageProxy.getViewport` method.
intent string <optional>
Rendering intent, can be 'display' or 'print' (default value is 'display').
enableWebGL boolean <optional>
Enables WebGL accelerated rendering for some operations. The default value is `false`.
renderInteractiveForms boolean <optional>
Whether or not interactive form elements are rendered in the display layer. If so, we do not render them on canvas as well.
transform Array <optional>
Additional transform, applied just before viewport transform.
imageLayer Object <optional>
An object that has beginLayout, endLayout and appendImage functions.
canvasFactory Object <optional>
The factory that will be used when creating canvases. The default value is {DOMCanvasFactory}.
background Object <optional>
Background to use for the canvas. Can use any valid canvas.fillStyle: A DOMString parsed as CSS value, a CanvasGradient object (a linear or radial gradient) or a CanvasPattern object (a repetitive image). The default value is 'rgb(255,255,255)'.
Source:

TextContent

Page text content.
Type:
  • Object
Properties:
Name Type Description
items array array of TextItem
styles Object TextStyle objects, indexed by font name.
Source:

TextItem

Page text content part.
Type:
  • Object
Properties:
Name Type Description
str string text content.
dir string text direction: 'ttb', 'ltr' or 'rtl'.
transform array transformation matrix.
width number width in device space.
height number height in device space.
fontName string font name used by pdf.js for converted font.
Source:

TextStyle

Text style.
Type:
  • Object
Properties:
Name Type Description
ascent number font ascent.
descent number font descent.
vertical boolean text is in vertical mode.
fontFamily string possible font family
Source: