Opened 7 years ago
Last modified 7 years ago
#921 new defect
Encoding issue while import html with associated files
Reported by: | Georgiy Litvinov | Owned by: | nobody |
---|---|---|---|
Priority: | moderate | Milestone: | |
Component: | Collection Building | Severity: | major |
Keywords: | Cc: |
Description
If importing html files in folder named in russian, importing associated files failed. Build logs: Warning: archiveinf_files_to_field() /home/iphlib/testing/web/sites/localsite/collect/philosna/import/ÐаÑÑенди Ð. СоÑÐ¸Ð½ÐµÐ½Ð¸Ñ Ñ.1 (ФилоÑоÑÑкое наÑледие Ñ.21) 1966/Гассенди П. Сочинения т.1 (Философское наследие т.21) - 1966-img001.jpg does not appear to be on the file system
Further investigations showed that path is concatenated from html filepath and html intenal link (both looks well separately). Looks like they are in different encoding.
Looks like failing to set metadata with metadataCSVPlugin while file is in russain (not latin symbols) named folder related to the same problem.