-
Type:
Bug
-
Status: Awaiting Documentation (View Workflow)
-
Priority:
Minor
-
Resolution: Unresolved
-
Affects Version/s: 6.1, 6.2, 6.3
-
Fix Version/s: 7.0
-
Component/s: None
-
Labels:None
-
Attachments:
-
Comments:5
-
Documentation Status:Complete or Committed
When running filter-media on a .doc (Word) bitstream an error occurs and filter-media crashes.
The issue seems to be caused by a dependency on different versions of org.apache.poi. In DSpace 6 the version of org.apache.poi is 3.17: https://github.com/DSpace/DSpace/blob/dspace-6.3/pom.xml#L1204https://github.com/DSpace/DSpace/blob/dspace-6_x/pom.xml#L1204
The dspace-tm-extractors dependency requires an older version of poi (I think version 3.6):
https://github.com/DSpace/DSpace/blob/dspace-6.3/pom.xml#L1074
Stacktrace:
Exception: org.apache.poi.util.LittleEndian.getUnsignedByte([BI)IException: org.apache.poi.util.LittleEndian.getUnsignedByte([BI)Ijava.lang.NoSuchMethodError: org.apache.poi.util.LittleEndian.getUnsignedByte([BI)I at org.textmining.extraction.word.model.FormattedDiskPage.<init>(FormattedDiskPage.java:66) at org.textmining.extraction.word.model.CHPFormattedDiskPage.<init>(CHPFormattedDiskPage.java:62) at org.textmining.extraction.word.model.CHPBinTable.<init>(CHPBinTable.java:70) at org.textmining.extraction.word.Word97TextExtractor.getText(Word97TextExtractor.java:122) at org.textmining.extraction.word.Word97TextExtractor.getText(Word97TextExtractor.java:63) at org.dspace.app.mediafilter.WordFilter.getDestinationStream(WordFilter.java:83) at org.dspace.app.mediafilter.MediaFilterServiceImpl.processBitstream(MediaFilterServiceImpl.java:361) at org.dspace.app.mediafilter.MediaFilterServiceImpl.filterBitstream(MediaFilterServiceImpl.java:218) at org.dspace.app.mediafilter.MediaFilterServiceImpl.filterItem(MediaFilterServiceImpl.java:181) at org.dspace.app.mediafilter.MediaFilterServiceImpl.applyFiltersItem(MediaFilterServiceImpl.java:159) at org.dspace.app.mediafilter.MediaFilterCLITool.main(MediaFilterCLITool.java:315) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:497) at org.dspace.app.launcher.ScriptLauncher.runOneCommand(ScriptLauncher.java:229) at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:81)
- is related to
-
DS-1140 Update MSWord Media Filter to use Apache POI (like PPT Filter) and also support .docx
-
- Closed
-