A document loader specifically designed for loading documents from the College Confidential website. It extends the CheerioWebBaseLoader.

Hierarchy

Constructors

Properties

caller: AsyncCaller
timeout: number
webPath: string
selector?: SelectorType
textDecoder?: TextDecoder

Methods

  • Overrides the base load() method to extract the text content from the loaded document using a specific selector for the College Confidential website. It creates a Document instance with the extracted text and metadata, and returns an array containing the Document instance.

    Returns Promise<Document<Record<string, any>>[]>

    An array containing a Document instance with the extracted text and metadata from the loaded College Confidential web document.

  • Loads the documents and splits them using a specified text splitter.

    Parameters

    Returns Promise<Document<Record<string, any>>[]>

    A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.

  • Fetches the web document from the webPath and loads it using Cheerio. It returns a CheerioAPI instance.

    Returns Promise<CheerioAPI>

    A Promise that resolves to a CheerioAPI instance.

  • A static method that dynamically imports the Cheerio library and returns the load function. If the import fails, it throws an error.

    Returns Promise<{
        load: ((content, options?, isDocument?) => CheerioAPI);
    }>

    A Promise that resolves to an object containing the load function from the Cheerio library.

Generated using TypeDoc