source: other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt@ 34006

Last change on this file since 34006 was 34006, checked in by ak19, 4 years ago

Committing more data I've collected for generating pie charts and the pie-charts for the first dataset, which is how the seed URLs for crawling were obtained.

File size: 844 bytes
Line 
1https://www.rapidtables.com/tools/pie-chart.html
2
3Title: 38724 out of >11.4 billion URLs in 12-month CommonCrawl data had content_language=MRI
4data names: discarded_10290 greylisted_2751 pruned_4 crawlSeeds_25679
5data values: 10290 2751 4 25679
6slice text: (Percentage)
7
8
9------
10https://www.meta-chart.com/pie#/data
11
12Number of slices -> 4
13Series Unit: URLs
14
15Slice 1: discarded (red) 10290
16Slice 2: greyListed (grey) 2751
17Slice 3: further pruned away (yellow) 4
18Slice 4: final crawl seeds (green) 25679
19
20https://www.meta-chart.com/pie#/labels
21Graph title: Processing the 38724 out of >11.4 billion URLs in the 12-month CommonCrawl data which had content_language=MRI
22Slice Display data label display setting: Name, Value and Percent
23
24https://www.meta-chart.com/pie#/display
25Export as SVG and PNG
26Leave Sort setting at botton to "ORIG (default)"
Note: See TracBrowser for help on using the repository browser.