This Website hosts a database tracking all words that news organizations have used since 2019. You can use the keyword tracker MAXX to visually graph keywords over time. Keyword tracker MAXX is flexible. You can fill in the keyword and select the organization. Select all the same organizations and compare different words, or select different organizations and track the same word. Mix them both up if you're feeling crazy.
My hope is to find the unexpected, and follow trends.
All HTML was scraped from archive.org. Their URLs are structured perfectly for programatically querying which is awesome.
I've collected various datasets based on different scraping methods. BoneBroth is a custom method that attempts to only track words found within the body of the webpage. However, it is not perfect. I've also put together a list of keywords to track for a DeepSearch. This method uses regex to find any mention of the word in the entire HTML document. This also has it's flaws and limitations.