Older flowchart of the process of getting Common Crawl data into MongoDB as websites and webpages collections for querying