The first working hadoop spark script for processing common crawl data. This one successfully got all the commoncrawl warc INDEX data for the specified period where content_languages contained mri (as any of the document's 3 primary languages) and put them out into a csv file