Conversion from PDF to DOCX with Aspose.PDF results in a distorted file

Dear Aspose Team,

We use Aspose.PDF to transform the incoming PDF files to DOCX files before we process them in out system. The process that we make is quite easy, we load the PDF into a Document, and then call the Save method with the output path and saveOptions.

Please find the attached PDF file, and the distorted DOCX file as well:

PPT_PDFmemoQ確認用ダミーデータ.pdf (303.0 KB)

temp.docx (211.9 KB)

We observed that if we use in the SaveOptions the DocSaveOptions.RecognitionMode.EnhancedFlow mode, then the result is what I already attached to you. If we use the DocSaveOptions.RecognitionMode.Flow mode, then the result is OK, the conversion is not distorted, it is similar to the original PDF file. We decided to use the DocSaveOptions.RecognitionMode.EnhancedFlow 6 months ago, because we observed that this mode is more precise and proved to be better, and also resolved lot of issues in our system. But now, we have a problem where the DocSaveOptions.RecognitionMode.EnhancedFlow has some weaknesses.

Also, we could reproduce the issue with the 23.11.1 version and with the latest version of Aspose.PDF as well.

Is there a setting that we miss or is this a real issue?

We are waiting for your findings/answers.

Kind regards,
Varga Matild

This topic has been moved to the related forum: Conversion from PDF to DOCX with Aspose.PDF results in a distorted file - Free Support Forum - aspose.com