- Timestamp:
- 2019-03-13T20:19:47+13:00 (5 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
documentation/trunk/tutorials/xml-source/tutorial_en.xml
r32896 r32897 1607 1607 </NumberedItem> 1608 1608 <NumberedItem> 1609 <Text id="ew-27">In the <AutoText key="glidict::GUI.Enrich"/> panel, look at the metadata that has been extracted for <Path>word05.doc</Path> and <Path>word06.doc</Path>. Now open the documents in Word and look at what properties have been set (<Menu>File → Properties</Menu> for Word 2003. In Word 2007/2010, click the Word Icon on the top left, then choose <Menu>Prepare → Properties</Menu>. In Word 2013, <Menu>File → Info</Menu>; the Properties section is on the right.) .They have Title, Author, Subject, and Keywords properties. <AutoText text="WordPlugin"/> can be configured to look for these properties and extract them.</Text>1609 <Text id="ew-27">In the <AutoText key="glidict::GUI.Enrich"/> panel, look at the metadata that has been extracted for <Path>word05.doc</Path> and <Path>word06.doc</Path>. Now open the documents in Word and look at what properties have been set (<Menu>File → Properties</Menu> for Word 2003. In Word 2007/2010, click the Word Icon on the top left, then choose <Menu>Prepare → Properties</Menu>. In Word 2013, <Menu>File → Info</Menu>; the Properties section is on the right.) They have Title, Author, Subject, and Keywords properties. <AutoText text="WordPlugin"/> can be configured to look for these properties and extract them.</Text> 1610 1610 </NumberedItem> 1611 1611 <NumberedItem> … … 1844 1844 </NumberedItem> 1845 1845 <NumberedItem> 1846 <Text id="0393a">Many HTML documents contain metadata in <Format><meta></Format> tags in the <Format><head></Format> of the page. Open up the <Path>englishhistory.net → tudor → monarchs → boleyn.html</Path> file by navigating to it in the tree on the left hand side, and double clicking it. This will open it in a web browser. View the HTML source of the page (<Menu>View → Source</Menu> in Internet Explorer, <Menu>Tools → Web Developer → Page Source</Menu> in Mozilla ). You will notice that this page has <AutoText text="page_topic, content" type="italics"/> and <AutoText text="author" type="italics"/> metadata.</Text>1846 <Text id="0393a">Many HTML documents contain metadata in <Format><meta></Format> tags in the <Format><head></Format> of the page. Open up the <Path>englishhistory.net → tudor → monarchs → boleyn.html</Path> file by navigating to it in the tree on the left hand side, and double clicking it. This will open it in a web browser. View the HTML source of the page (<Menu>View → Source</Menu> in Internet Explorer, <Menu>Tools → Web Developer → Page Source</Menu> in Mozilla, and press Ctrl+U in Microsoft Edge). You will notice that this page has <AutoText text="page_topic, content" type="italics"/> and <AutoText text="author" type="italics"/> metadata.</Text> 1847 1847 </NumberedItem> 1848 1848 <NumberedItem> … … 1914 1914 </Heading> 1915 1915 <Comment> 1916 <Text id="0457a">Next we'll add an interactive hierarchical phrase browsing classifier to this collection. </Text>1916 <Text id="0457a">Next we'll add an interactive hierarchical phrase browsing classifier to this collection. Java applet support has been phased out in various browsers and browser versions. As a result the following will not work on <Link url="https://stackoverflow.com/questions/31816839/how-do-i-enable-java-in-microsoft-edge-web-browser">Microsoft Edge</Link> browsers, among others.</Text> 1917 1917 </Comment> 1918 1918 <NumberedItem> … … 1972 1972 </Comment> 1973 1973 <NumberedItem> 1974 <Text id="0463">Switch to the <AutoText key="glidict::GUI.Create"/> panel ,select <AutoText text="Import Options"/> on the left and view the options that are then displayed to the right. Select <AutoText text="maxdocs"/> and set its numeric counter to <AutoText text="3"/>. (When in GLI's <AutoText key="glidict::Preferences.Mode.Expert"/> Mode, the <AutoText text="maxdocs"/> option for the import process are located under the <AutoText text="Import Options"/> of the <AutoText key="glidict::GUI.Create"/> panel.) Now <b>build</b>.</Text>1974 <Text id="0463">Switch to the <AutoText key="glidict::GUI.Create"/> panel. Expand teh top panel to be able to see the options for collection building. Scroll to view them all, then select <AutoText text="Import Options"/> on the left and view the options that are then displayed to the right. Select <AutoText text="maxdocs"/> and set its numeric counter to <AutoText text="3"/>. (When in GLI's <AutoText key="glidict::Preferences.Mode.Expert"/> Mode, the <AutoText text="maxdocs"/> option for the import process are located under the <AutoText text="Import Options"/> of the <AutoText key="glidict::GUI.Create"/> panel.) Now <b>build</b>.</Text> 1975 1975 </NumberedItem> 1976 1976 <NumberedItem>
Note:
See TracChangeset
for help on using the changeset viewer.