From 7f2e8f6b5f795162cd54977702c7e92081ec3cac Mon Sep 17 00:00:00 2001 From: fuksja <49232467+fuksja@users.noreply.github.com> Date: Wed, 13 Jul 2022 10:38:56 +0200 Subject: [PATCH 1/2] update> new functionality --- README.md | 25 ++++++++++++++++++++----- 1 file changed, 20 insertions(+), 5 deletions(-) diff --git a/README.md b/README.md index 678233f..8b4b7bc 100644 --- a/README.md +++ b/README.md @@ -20,35 +20,50 @@ Stakeholder requirements: there is a need for a tool for fast and automated conv General description of fuctionality: user goes to the upload page, uploads a pdf file, conversion takes place and user receives doc file as output. +EDIT: added new functionality: conversion to .pptx format. User uploads file and chooses whether to convert to .doc or .pptx + ### Assumptions: - the project will only be using open source software and will be open software licensed - no conversion of encrypted files for now - all pages converted as default - custom max file size limitation +- no special security features +- simple conversion from pdf to .pptx as images put in slides, no strings OCRed ### Limitations: - english language version for now - no security features, user profiles, login option, session control, simple file input and output for now -- limitations derived from conversion method and library [pdf2doc](https://pypi.org/project/pdf2docx/): +- for conversion to doc format: + limitations derived from conversion method and library [pdf2doc](https://pypi.org/project/pdf2docx/): - text based files - language from left to right - no rotation possible - no 1:1 layout conversion achievable +- for conversion to .pptx format: + limitations derived from conversion method and library [pdf2pptx](https://pypi.org/project/pdf2pptx/): + - each original file page rendered as a PNG image and input into a Powerpoint slide + - slides not editable, no OCR - but may be presented as slides ## Getting started - chosen language/method: Python3 and flask -- chosen method of file conversion: pdf2docx 0.5.3 library: https://pypi.org/project/pdf2docx/ +- chosen method of file conversion: + - pdf2docx 0.5.3 library: https://pypi.org/project/pdf2docx/ + - pdf2pptx 1.0.5 library: https://pypi.org/project/pdf2pptx/ ## Time frame -Project completed June 2022. May be continued and improved upon in the future. +First part of the project completed in June 2022. Second part, with addition of .pptx feature completed in July. Project will be updated in the future. ## Documentation This github repository serves as projects documentation. ## License and copyright notice -This project uses GPLv3 license. Part of this project is derived from other software, created by other programmers, community or made in different way also under the GNU General Public License v3.0: -[Source of pdf2docx library used for file conversion](https://github.com/dothinking/pdf2docx) +This project uses GPLv3 license and MIT license. Part of this project is derived from other software, created by other programmers, community or made in different way also under the GNU General Public License v3.0: + +[Source of pdf2docx library used for file conversion to .doc](https://github.com/dothinking/pdf2docx) [License](https://github.com/dothinking/pdf2docx/blob/master/LICENSE) + +[Source of pdf2pptx library used for file conversion to .pptx](https://github.com/kevinmcguinness/pdf2pptx) +[License](https://github.com/kevinmcguinness/pdf2pptx/blob/master/LICENSE) From 9db207ca25266b7e911842de67db927b057b0462 Mon Sep 17 00:00:00 2001 From: fuksja <49232467+fuksja@users.noreply.github.com> Date: Wed, 13 Jul 2022 10:40:15 +0200 Subject: [PATCH 2/2] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 8b4b7bc..c7dc967 100644 --- a/README.md +++ b/README.md @@ -20,7 +20,7 @@ Stakeholder requirements: there is a need for a tool for fast and automated conv General description of fuctionality: user goes to the upload page, uploads a pdf file, conversion takes place and user receives doc file as output. -EDIT: added new functionality: conversion to .pptx format. User uploads file and chooses whether to convert to .doc or .pptx +EDIT: added new functionality: conversion to .pptx format. User uploads file and chooses whether to convert to .doc or .pptx and converted file pops out. ### Assumptions: - the project will only be using open source software and will be open software licensed