Global News Index and Extracted Features Repository

Banner image skyline

Global News Index and Extracted Features Repository

The Cline Center’s text analytics research draws on over 170 million historical news reports from around the world including:

  • Over 110 million unduplicated news stories collected using the Cline Center’s web crawl system from thousands of global news websites between 2006 to the present, with new stories added daily;
  • API access to the Lexis Nexis News Database
  • Nearly 6 million news stories from the New York Times from 1945 to 2005;
  • Over 2 million news stories from the Wall Street Journal from 1945 to 2005;
  • Open-source information encompassing over 4 million news stories from BBC Monitoring’s Summary of World Broadcasts (SWB) from 1979 to 2015, and over 3 million news stories from the U.S. Government’s Foreign Broadcast Information Service (FBIS) from 1995 to 2003. News data from these sources consist of stories from every country in the world that have been translated into English by fluent speakers who are culturally resonant with the countries in which news items originally appeared;
  • Another 6.2 million scanned and digitized microfilm and microfiche records of SWB and FBIS content captures from 1945 to the 1990s;
  • The only digitized record in the world of story summaries for four of the five main American newsreels that formed the hub of a worldwide newsreel system serving a global audience with visual news items nearly seven decades before the advent of CNN. The Cline Center also holds story summaries for one of the leading British newsreel companies along with a unique set of early television news summaries. Together, these various sources include around 130,000 stories broadcast between 1915 and 1985.

 

When citing the Global News Index and Extracted Features Repository, please use the following guidelines:

1) To cite the GNI codebook (or any other documentation associated with the Global News Index and Archer) please use the following citation:

Cline Center for Advanced Social Research. 2023. Global News Index and Extracted Features Repository [codebook]. Champaign, IL: University of Illinois. doi:10.13012/B2IDB-5649852_V5

2) To cite data from the Global News Index (accessed via Archer or otherwise) please use the following citation (filling in the correct date of access):

Cline Center for Advanced Social Research. 2023. Global News Index and Extracted Features Repository [database]. Champaign, IL: University of Illinois. Accessed Month, DD, YYYY. doi:10.13012/B2IDB-5649852_V5