Changeset 34833
- Timestamp:
- 2021-02-13T22:48:46+13:00 (3 years ago)
- Location:
- main/trunk/model-sites-dev/eurovision-lod/collect/eurovision
- Files:
-
- 2 edited
Legend:
- Unmodified
- Added
- Removed
-
main/trunk/model-sites-dev/eurovision-lod/collect/eurovision/etc/collectionConfig.xml
r34822 r34833 145 145 </search> 146 146 147 <!-- 147 148 <search type="jenaTDB" orthogonal="true"/> 148 149 --> 150 149 151 150 152 <infodb type="jdbm"/> … … 155 157 <plugin name="JSONSPARQLResultPlugin"> 156 158 <option name="metadata_merge_on_concat_fields" value="Country,Year" /> 159 <option name="OIDtype" value="assigned" /> 160 <option name="OIDmetadata" value="Identifier" /> 157 161 </plugin> 158 162 <plugin name="MetadataXMLPlugin"/> 163 <plugin name="MetadataJSONPlugin"/> 159 164 <plugin name="ArchivesInfPlugin"/> 160 165 <plugin name="DirectoryPlugin"/> … … 431 436 <gsf:metadata name="Country"/> 432 437 <gsf:metadata name="Year"/> 438 439 <gsf:metadata name="Abstract_en"/> 440 <gsf:metadata name="TitleAbstract_en"/> 441 <gsf:metadata name="CreatorAbstract_en"/> 433 442 434 443 -
main/trunk/model-sites-dev/eurovision-lod/collect/eurovision/transform/pages/about.xsl
r34824 r34833 26 26 27 27 <p> 28 The aim of this site is to demonstrate the use the Greenstone3 Digital Library software platform to channel 29 and transform Linked Open Data Resources into an appealing web site for fans of the contest. 28 The aim of this site is to demonstrate the use the 29 Greenstone3 Digital Library software platform to channel 30 and transform Linked Open Data Resources into an appealing 31 web site for fans of the contest. 30 32 </p> 31 33 … … 72 74 GROUP BY ?country ORDER BY asc(?country) 73 75 </xsl:attribute> 74 <xsl:text> </xsl:text>76 <xsl:text> Loading ...</xsl:text> 75 77 </div> 76 78 77 78 <h2>Technical Development</h2> 79 <p> 80 In terms of how this collection was developed using the Greenstone DL architecture, the starting point is 81 the formulation of a SPARQL query to retrieve from DBpedia entries about all the entrants in the contest 82 over the years: 83 <pre style="background-color: #fff; color: #000; padding: 12px;"> 79 80 <div id="technicaldev-turnstyle" style="margin-top: 12px;"> 81 <div class="turnstyle-header" style="background-image: none; background-color: hsl(195, 47%, 35%);"> 82 Technical Development 83 </div> 84 85 <div style="display: none; padding-left: 6px; padding-top: 6px; margin-left: 2px; margin-right: 2px; border-left: white solid 1px; border-right: white solid 1px; border-bottom: white solid 1px;"> 86 <p> 87 In terms of how this collection was developed using the 88 Greenstone DL architecture, the starting point is the 89 formulation of a SPARQL query to retrieve from DBpedia 90 entries about all the entrants in the contest over the 91 years: 92 <pre style="background-color: #fff; color: #000; padding: 12px; margin-right: 6px;"> 84 93 SELECT ?countries_in_esc_by_year ?country_in_year ?year as ?Year ?country as ?Country ?entrant ?entrant_label as ?Creator ?song ?song_label as ?Title ?was_derived_from as ?WikipediaURL 85 94 WHERE { … … 105 114 } 106 115 ORDER BY DESC(?countries_in_esc_by_year) 107 </pre> 108 109 </p> 110 111 <p> 112 113 The resulting SPARQL query result set (JSON format selected for output) is then ingested into a Greenstone DL 114 collection, and used in a variety of ways. 115 For now an (admittedly cryptic) list of technical steps that were developed and/or deployed to 116 provide the functionality encountered in interacting with this site. 117 118 <ul> 119 <li>New SPARQL plugin for <i>download_from.pl</i> developed, used in GLI to enter the above query</li> 120 <li>New SPARQL <i>Document Processing</i> plugin developed</li> 121 <li>Greenstone3 Apache Jena Triple Store Extension activated</li> 122 <li>SGVizler used to display Google Visualizations such as the pie-chart above.</li> 123 <li>Metadata in document view enhanced through Greenstone Format Statements micro-data</li> 124 <li>Custom <i>interface</i> developed</li> 125 </ul> 126 </p> 127 128 <p> 129 Bullet points above to be expanded upon! 130 </p> 131 132 <p> 133 Viewing the <a download="collectionConfig.xml" href="sites/{$site_name}/collect/{$collName}/etc/collectionConfig.xml">collection configuration file</a> provides 134 a good insight into how all of these technical aspectcs are brought together. 135 </p> 116 </pre> 117 118 </p> 119 120 <p> 121 The resulting SPARQL query result set (JSON format 122 selected for output) is then ingested into a Greenstone 123 DL collection, and used in a variety of ways. For now 124 an (admittedly cryptic) list of technical steps that 125 were developed and/or deployed to provide the 126 functionality encountered in interacting with this site. 127 128 <ul> 129 <li>New SPARQL plugin for <i>download_from.pl</i> developed, used in GLI to enter the above query</li> 130 <li>New SPARQL <i>Document Processing</i> plugin developed</li> 131 <li>Greenstone3 Apache Jena Triple Store Extension activated</li> 132 <li>SGVizler used to display Google Visualizations such as the pie-chart above.</li> 133 <li>Metadata in document view enhanced through Greenstone Format Statements micro-data</li> 134 <li>Custom <i>interface</i> developed</li> 135 </ul> 136 </p> 137 138 <p> 139 Bullet points above to be expanded upon! 140 </p> 141 142 <p> 143 Viewing the 144 <a download="collectionConfig.xml" 145 href="sites/{$site_name}/collect/{$collName}/etc/collectionConfig.xml">collection 146 configuration file</a> provides a good insight into how 147 all of these technical aspectcs are brought together. 148 </p> 149 150 <p> 151 Full disclosure as to how the collection all ticks is 152 provided through our Subversion repository. Topping up 153 our 154 <a href="https//trac.greenstone.org/browser/main/trunk/greenstone3">Greenstone3 155 code base</a> we have: 156 157 <ul> 158 <li>The site: <a href="https://trac.greenstone.org/browser/main/trunk/model-sites-dev/eurovision-lod">eurovision-lod</a></li> 159 <li>The interface: <a href="https://trac.greenstone.org/browser/main/trunk/model-interfaces-dev/eurovision-lod">eurovision-lod</a></li> 160 <li>The triplestore extension: <a href="https://trac.greenstone.org/browser/gs2-extensions/apache-jena/trunk/src">apache-jena</a></li> 161 </ul> 162 163 164 </p> 165 166 <h2>Eurovision LOD SPARQL Endpoints</h2> 167 168 <p> 169 The source data can be access vis the DBpedia SPARQL endpoint. The ingested, 170 data (with correction) is available through the collection's local 171 SPARQL endpoint: 172 <ul> 173 <li> 174 <a href="https://dbpedia.org/sparql/">DBpedia SPARQL endpoint</a> 175 </li> 176 <li> 177 <a href="{$library_name}/collection/{$collName}/page/sparql">local SPARQL endpoint</a> 178 </li> 179 </ul> 180 </p> 181 182 <h2>Voting Data</h2> 183 <p> 184 The Voting data used in this collection is sourced from the Kaggle, which in turn 185 is derived from work available through Data Graver: 186 <ul> 187 <li><a href="https://www.kaggle.com/datagraver/eurovision-song-contest-scores-19752019">Kaggle Eurovision Voting dataset 1975-2019</a></li> 188 <li><a href="https://data.world/datagraver/eurovision-song-contest-scores-1975-2019">Data Graver</a></li> 189 <li><a href="https://docs.google.com/spreadsheets/d/1veXpiF54hQGP4OVuf1xjowumIe8HUOhI/edit#gid=528591420">Google Spreadsheet (internal use only)</a></li> 190 191 </ul> 192 </p> 193 194 <h2>Eurovision LOD Errata</h2> 195 <p> 196 Songs titles and Entrants (artists) that do not resolve to URIs: 197 <ul> 198 <li> 199 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-songs.html">Problem Songs (titles are string literals)</a> 200 </li> 201 <li> 202 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-entrants.html">Problem Entrants (artists are string literals)</a> 203 </li> 204 <li> 205 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-songs-and-entrants.html">Problem Songs and Entrants (both titles and artists are string literals)</a> 206 </li> 207 <li> 208 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-return-path-songs-and-entrants.html">Problem 'return paths' for Songs and Entrants (the reverse check that a song entry specifies the dbp:artist as a URI fails)</a> 209 </li> 210 </ul> 211 </p> 212 213 </div> 136 214 137 <p> 138 Full disclosure as to how the collection all ticks is provided through our 139 Subversion repository. Topping up our <a href="https//trac.greenstone.org/browser/main/trunk/greenstone3">Greenstone3 code base</a> 140 we have: 141 142 <ul> 143 <li>The site: <a href="https://trac.greenstone.org/browser/main/trunk/model-sites-dev/eurovision-lod">eurovision-lod</a></li> 144 <li>The interface: <a href="https://trac.greenstone.org/browser/main/trunk/model-interfaces-dev/eurovision-lod">eurovision-lod</a></li> 145 <li>The triplestore extension: <a href="https://trac.greenstone.org/browser/gs2-extensions/apache-jena/trunk/src">apache-jena</a></li> 146 </ul> 147 148 149 </p> 150 151 <h2>Eurovision LOD SPARQL Endpoint</h2> 152 <p> 153 This Digital Library collection can be queried through this 154 <a href="{$library_name}/collection/{$collName}/page/sparql">SPARQL endpoint</a> 155 </p> 156 157 <p> 158 This DBpedia endpoint is available 159 <a href="https://dbpedia.org/sparql/">here</a> 160 </p> 215 </div> 216 </div> 161 217 162 <h2>Voting Data</h2> 163 <p> 164 <ul> 165 <li>https://data.world/datagraver/eurovision-song-contest-scores-1975-2019</li> 166 <li>https://docs.google.com/spreadsheets/d/1veXpiF54hQGP4OVuf1xjowumIe8HUOhI/edit#gid=528591420</li> 167 <li>(https://www.kaggle.com/datagraver/eurovision-song-contest-scores-19752019)</li> 168 </ul> 169 </p> 170 171 <h2>Eurovision LOD Errata</h2> 172 <p> 173 Songs titles and Entrants (artists) that do not resolve to URIs: 174 <ul> 175 <li> 176 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-songs.html">Problem Songs (titles are string literals)</a> 177 </li> 178 <li> 179 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-entrants.html">Problem Entrants (artists are string literals)</a> 180 </li> 181 <li> 182 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-songs-and-entrants.html">Problem Songs and Entrants (both titles and artists are string literals)</a> 183 </li> 184 <li> 185 <a href="sites/{$site_name}/collect/{$collName}/prepare/problem-return-path-songs-and-entrants.html">Problem 'return paths' for Songs and Entrants (the reverse check that a song entry specifies the dbp:artist as a URI fails)</a> 186 </li> 187 </ul> 188 </p> 189 190 </div> 191 218 <script> 219 <xsl:text disable-output-escaping="yes"> 220 $(function(){ 221 transformToTurnstyleBlock("technicaldev"); 222 }); 223 </xsl:text> 224 </script> 192 225 193 226 </xsl:template>
Note:
See TracChangeset
for help on using the changeset viewer.