Changeset 33896


Ignore:
Timestamp:
2020-02-03T23:29:59+13:00 (4 years ago)
Author:
ak19
Message:

Clarification in comments

File:
1 edited

Legend:

Unmodified
Added
Removed
  • other-projects/maori-lang-detection/MoreReading/mongodb.txt

    r33849 r33896  
    985985
    986986
    987 AFTER BUGFIX FOR miInURLPath being set at the correct now:
     987AFTER BUGFIX FOR miInURLPath being set at the correct stage now:
    988988db.getCollection('Websites').find(
    989989{$and: [
     
    10181018]);
    10191019
    1020 Can inspect websites' pages for whether it's relevant/auto-translated as follows:
     1020Can inspect websites' pages for whether it's relevant vs auto-translated as follows:
    10211021    db.getCollection('Webpages').find({URL:/svenkirsten.com/, mriSentenceCount: {$gt: 0}})
    10221022
     
    13261326X http://www.precious-testimonies.com, http://precious-testimonies.com/JesusDidItTranslations/JesusDidItMaoriTranslation.htm may be autotranslated as the Dutch page looks more like Danish or some Scandinavian language and the French page is missing accented characters.
    13271327
    1328 CHECK - PROBABLY:
     1328CHECK, PROBABLY HAS MRI - PROCESSED:
    13291329!! https://maorinews.com,
    13301330!! http://maaori.com,
     
    13461346X http://www.waimate.com, [English, NZ place]
    13471347
    1348 MAYBE, INSPECT:
     1348MAYBE HAS MRI, INSPECT - PROCESSED:
    13491349? https://www.natekore2018.com, [lots of English, but COMMUNITY, CULTURE]
    13501350+ http://tatai09.blogspot.com,
     
    14081408X https://www.dbnames.net, [Name database, lots misdetected]
    14091409
    1410 STILL TO DO LIST:
     1410STILL TO DO LIST - PROCESSED:
    14111411
    14121412X https://www.myadsclassified.com, [misdetected 3 short English sentences as MRI]
Note: See TracChangeset for help on using the changeset viewer.