source: other-projects/metadata-encoding/README.txt@ 38630

Last change on this file since 38630 was 38628, checked in by jc550, 5 months ago

fix typo

File size: 2.2 KB
Line 
1Python based code for analysing the quality of metadata provided by
2academic publishing sources such as CrossRef.
3
4Quick, rough guide:
5
6#----
71. Download the CrossRef Dataset:
8#----
9
10The CrossRef dataset is available via AcademicTorrents:
11
12 https://academictorrents.com/details/d9e554f4f0c3047d9f49e448a7004f7aa1701b69
13
14At the time of writing, the provided dataset page doesn't link forward to
15newer releases. To keep an eye for new releases, Google Search "Public Data File from Crossref"
16
17 https://www.google.com/search?channel=fs&client=ubuntu&q=+Public+Data+File+from+Crossref+
18
19Transmission is a Torrent client availabe on Ubuntu Linux machine in
20CMS, which can be used to download the JSON files. If looking
21looking for a non-admin way to run a Torrent client then it
22is possible to download Vuze for Linux from
23
24 https://www.vuze.com/download.php
25
26(downloads as VuzeInstaller.tar.bz2)
27
28From that download, you can untar it, and then run ./vuze
29
30#----
312. Setup a MongoDB Server
32#----
33
34Get going with a mongodb server, for example:
35
36 svn co https://svn.greenstone.org/gs3-extensions/mongodb/trunk mongodb
37
38Then follow the instruction in mongodb/README.txt
39
40--
41Additional Notes:
42--
43
44Studio 3T is a GUI client for MongoDB. It can be downloaded from:
45
46 https://studio3t.com/
47
48It can be run with a free open source license (formly Robo 3T), but
49since the move to Studio 3T the developers start you off in the trial
50Pro/Ultimate version, so there are a few hoops to jump through to get
51to the open source version.
52
53You need to sign-up for an account as part of the installation process
54(can use Google Sign-in, which simplifies things), and then back in
55the GUI you can change its configuration settings straightaway to be
56the Free 3T version.
57
58In the GUI, setup/open a connection for:
59 mongodb:http://localhost:27017
60
61
62#----
633. Working with the Python code
64#----
65
66Create your own Python virtual environment, for example:
67
68 python3 -mvenv my-python3
69 source my-python3/bin/activate
70
71 pip install wheel
72
73
74To get going with the Python code itself
75
76 cd py
77 pip install -r requirements.txt
78
79
80
81
82
83
84Developed by Joel Crombie (jc550) as a Summer Research Project
85(ALPSS373-23C)
86
87
88
89
90
91--------------------------------------------------
Note: See TracBrowser for help on using the repository browser.