Section 1

research

Global News Archive

The Cline Center’s text analytics research draws on an archive of over 85 million historical news reports from around the world including:

  • Over 60 million unduplicated news stories (or over 100 million news stories if duplicates are included) collected using the Cline Center’s web crawl system from thousands global news websites between 2006 to the present, with new stories added daily.
  • Nearly 6 million news stories from the New York Times from 1945 to 2005.
  • Over 2 million news stories from the Wall Street Journal from 1945 to 2005.
  • Open-source information encompassing over 4 million news stories from BBC Monitoring’s Summary of World Broadcasts (SWB) from 1979 to 2012, and over 3 million news stories from the U.S. Government’s Foreign Broadcast Information Service (FBIS) from 1994 to 2003. News data from these sources consist of stories from every country in the world that have been translated into English by fluent speakers who are culturally resonant with the countries in which news items originally appeared.
  • Another 6.2 million scanned and digitized microfilm and microfiche records of SWB and FBIS content captures from 1945 to the 1990s.
  • The only digitized record in the world of story summaries for four of the five main American newsreels that formed the hub of a worldwide newsreel system serving a global audience with visual news items nearly seven decades before the advent of CNN. The Cline Center also holds story summaries for one of the leading British newsreel companies along with a unique set of early television news summaries. Together, these various sources include around 130,000 stories broadcast between 1915 and 1985.