Class WebPDFLoader

A document loader for loading data from PDFs.

Hierarchy

BaseDocumentLoader
- WebPDFLoader

Index

Constructors

constructor

new WebPDFLoader(blob, __namedParameters?): WebPDFLoader
Parameters
- blob: Blob
- __namedParameters: {
      parsedItemSeparator: undefined | string;
      pdfjs: undefined | (() => Promise<{
          getDocument: ((src) => PDFDocumentLoadingTask);
          version: string;
      }>);
      splitPages: undefined | boolean;
  } = {}
  - parsedItemSeparator: undefined | string
  - pdfjs: undefined | (() => Promise<{
    getDocument: ((src) => PDFDocumentLoadingTask);
    version: string;
    }>)
  - splitPages: undefined | boolean
Returns WebPDFLoader
Overrides BaseDocumentLoader.constructor
- Defined in docs/api_refs/langchain/src/document_loaders/web/pdf.ts:17

Properties

`Protected` blob

blob: Blob

`Protected` parsedItemSeparator

parsedItemSeparator: string

`Protected` splitPages

splitPages: boolean = true

Methods

load

load(): Promise<Document<Record<string, any>>[]>
Loads the contents of the PDF as documents.

Returns Promise<Document<Record<string, any>>[]>
An array of Documents representing the retrieved data.
Overrides BaseDocumentLoader.load
- Defined in docs/api_refs/langchain/src/document_loaders/web/pdf.ts:36

loadAndSplit

loadAndSplit(splitter?): Promise<Document<Record<string, any>>[]>
Loads the documents and splits them using a specified text splitter.
Parameters
- splitter: TextSplitter = ...
Returns Promise<Document<Record<string, any>>[]>
A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.
Inherited from BaseDocumentLoader.loadAndSplit
- Defined in docs/api_refs/langchain/src/document_loaders/base.ts:32

Class WebPDFLoader

Hierarchy

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

blob: Blob

__namedParameters: {
    parsedItemSeparator: undefined | string;
    pdfjs: undefined | (() => Promise<{
        getDocument: ((src) => PDFDocumentLoadingTask);
        version: string;
    }>);
    splitPages: undefined | boolean;
} = {}

parsedItemSeparator: undefined | string

pdfjs: undefined | (() => Promise<{
getDocument: ((src) => PDFDocumentLoadingTask);
version: string;
}>)

splitPages: undefined | boolean

Returns WebPDFLoader

Properties

`Protected` blob

`Protected` parsedItemSeparator

`Protected` splitPages

Methods

load

Returns Promise<Document<Record<string, any>>[]>

loadAndSplit

Parameters

splitter: TextSplitter = ...

Returns Promise<Document<Record<string, any>>[]>

Settings

Member Visibility

Theme

On This Page

Class WebPDFLoader

Hierarchy

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

blob: Blob

__namedParameters: { parsedItemSeparator: undefined | string; pdfjs: undefined | (() => Promise<{ getDocument: ((src) => PDFDocumentLoadingTask); version: string; }>); splitPages: undefined | boolean; } = {}

parsedItemSeparator: undefined | string

pdfjs: undefined | (() => Promise<{ getDocument: ((src) => PDFDocumentLoadingTask); version: string; }>)

splitPages: undefined | boolean

Returns WebPDFLoader

Properties

Protected blob

Protected parsedItemSeparator

Protected splitPages

Methods

load

Returns Promise<Document<Record<string, any>>[]>

loadAndSplit

Parameters

splitter: TextSplitter = ...

Returns Promise<Document<Record<string, any>>[]>

Settings

Member Visibility

Theme

On This Page

__namedParameters: {
parsedItemSeparator: undefined | string;
pdfjs: undefined | (() => Promise<{
getDocument: ((src) => PDFDocumentLoadingTask);
version: string;
}>);
splitPages: undefined | boolean;
} = {}

pdfjs: undefined | (() => Promise<{
getDocument: ((src) => PDFDocumentLoadingTask);
version: string;
}>)

`Protected` blob

`Protected` parsedItemSeparator

`Protected` splitPages