Changeset 33825
- Timestamp:
- 2020-01-13T21:47:33+13:00 (4 years ago)
- Location:
- other-projects/maori-lang-detection
- Files:
-
- 1 added
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/maori-lang-detection/hdfs-cc-work/GS_README.TXT
r33824 r33825 20 20 --- 21 21 22 APPENDIX: Legend of mongodb-data folder's contents 22 23 APPENDIX: Reading data from hbase tables and backing up hbase 23 24 … … 983 984 984 985 -------------------------------------------------------- 985 APPENDIX: Legend of mongodb-data folder's contents 986 APPENDIX: Legend of mongodb-data folder's contents 986 987 -------------------------------------------------------- 987 988 1. allCrawledSites: all sites from CommonCrawl where the content-language=MRI, which we then crawled with Nutch with depth=10. Some obvious auto-translated websites were skipped.
Note:
See TracChangeset
for help on using the changeset viewer.