PDF Analysis Tool

This is a Python script that analyzes PDF files using two command-line tools: sha256sum and pdfx. The script can be used to extract metadata and other information from multiple PDF files at once and store the results in a single text file.

Requirements:

Python 3.x
Command-line tools sha256sum and pdfx (typically installed on Linux or macOS systems)

Usage Open a terminal window. Navigate to the directory containing the run.py script. To execute the script in the current working directory, enter the following command:

python run.py

To execute the script in a specific directory, provide the directory path as a command-line argument: bash

python run.py /path/to/directory

The script will analyze all PDF files in the specified directory and write the results to a text file called 'output.txt'. The output for each file will include the name of the file, the SHA-256 hash calculated by 'sha256sum', and the metadata and other information extracted by 'pdfx'.

Output

The output is written to a text file called output.txt in the same directory as the script. Each file's output is separated by a horizontal line and includes the following information:

File name
SHA-256 hash calculated by sha256sum
Metadata and other information extracted by pdfx

That's it! With these instructions, you should be able to use the PDF Analysis Tool to quickly analyze multiple PDF files and extract useful information.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Analysis Tool

Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF Analysis Tool

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages