Opened 7 years ago

Last modified 7 years ago

#921 new defect

Encoding issue while import html with associated files

Reported by: Georgiy Litvinov Owned by: nobody
Priority: moderate Milestone:
Component: Collection Building Severity: major
Keywords: Cc:


If importing html files in folder named in russian, importing associated files failed. Build logs: Warning: archiveinf_files_to_field() /home/iphlib/testing/web/sites/localsite/collect/philosna/import/Гассенди П. Сочинения т.1 (Философское наследие т.21) 1966/Гассенди П. Сочинения т.1 (Философское наследие т.21) - 1966-img001.jpg does not appear to be on the file system

Further investigations showed that path is concatenated from html filepath and html intenal link (both looks well separately). Looks like they are in different encoding.

Change History (1)

comment:1 by Georgiy Litvinov, 7 years ago

Looks like failing to set metadata with metadataCSVPlugin while file is in russain (not latin symbols) named folder related to the same problem.

Note: See TracTickets for help on using tickets.