Ticket #921 (new defect)

Opened 2 years ago

Last modified 2 years ago

Encoding issue while import html with associated files

Reported by: litvinovg Owned by: nobody
Priority: moderate Milestone:
Component: Collection Building Severity: major
Keywords: Cc:

Description

If importing html files in folder named in russian, importing associated files failed. Build logs: Warning: archiveinf_files_to_field() /home/iphlib/testing/web/sites/localsite/collect/philosna/import/Гассенди П. Сочинения т.1 (Философское наследие т.21) 1966/Гассенди П. Сочинения т.1 (Философское наследие т.21) - 1966-img001.jpg does not appear to be on the file system

Further investigations showed that path is concatenated from html filepath and html intenal link (both looks well separately). Looks like they are in different encoding.

Change History

Changed 2 years ago by litvinovg

Looks like failing to set metadata with metadataCSVPlugin while file is in russain (not latin symbols) named folder related to the same problem.

Note: See TracTickets for help on using tickets.