Cédric KACZMAREK
first commit
70b87af

A newer version of the Gradio SDK is available: 5.27.0

Upgrade

News Article Loader

This loader makes use of the newspaper3k library to parse web page urls which have news articles in them.

Usage

pip install newspaper3k

Pass in an array of individual page URLs:

from llama_index.readers.web.news import NewsArticleReader

reader = NewsArticleReader(use_nlp=False)
documents = reader.load_data(
    [
        "https://www.cnbc.com/2023/08/03/amazon-amzn-q2-earnings-report-2023.html",
        "https://www.theverge.com/2023/8/3/23818388/brave-search-image-video-results-privacy-index",
    ]
)