A BBC dataset sample of over 1000 records. Dataset was extracted using the Bright Data API.
id
: Unique identifier for the news articleurl
: The web address where the article is publishedauthor
: The name of the journalist or contributor who wrote the articleheadline
: The main title of the articletopics
: Array of topics related to the articlepublication_date
: The date when the article was publishedcontent
: The full text of the articlevideos
: Any embedded videos related to the articleimages
: Any images included in the articlerelated_articles
: Links to other articles that are relevant to the topic
And a lot more.
This is a sample subset which is derived from the "BBC news" dataset which includes more than 75K records.
Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.
Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.
Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.
Data enrichment available as an addition to the data points extracted: Based on request.
Track media trends and analyze the evolution of news coverage over time using BBC datasets, with a focus on topic frequency and framing. To develop algorithms using BBC datasets that detect fake news and assess the integrity of information. Integrate BBC datasets into sophisticated algorithmic trading models and economic forecasting tools. By feeding real-time news data into trading algorithms, the goal is to enable these systems to respond swiftly and effectively to market movements triggered by breaking news events, economic reports, or political developments.The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.