PyCollateX

PyCollateX is a fork of CollateX-pythonport with:

improvements for Python3 and Unicode regex
less dependencies (at the cost of export features)

CollateX is a software to

read multiple (>= 2) versions of a text, splitting each version into parts (tokens) to be compared,
identify similarities of and differences between the versions (including moved/transposed segments) by aligning tokens, and
output the alignment results in a variety of formats for further processing, for instance to support the production of a critical apparatus or the stemmatic analysis of a text's genesis.

Features

Partially non-progressive multiple-sequence alignment
Multiple output formats: alignment table, variant graph
Near matching (optional)

Simple example

from pycollatex import *

collation = Collation()
collation.add_plain_witness("A", "The quick brown fox jumps over the dog.")
collation.add_plain_witness("B", "The brown fox jumps over the lazy dog.")

collation_graph = collate(collation, segmentation=False)
alignment_table = output_collation_graph(collation, collation_graph)
print(alignment_table)

outputs:

+---+-----+-------+--------------------------+------+------+
| A | The | quick | brown fox jumps over the | -    | dog. |
| B | The | -     | brown fox jumps over the | lazy | dog. |
+---+-----+-------+--------------------------+------+------+

Name		Name	Last commit message	Last commit date
Latest commit History 3,244 Commits
ClusterShell		ClusterShell
pycollatex		pycollatex
tests		tests
use_cases		use_cases
.editorconfig		.editorconfig
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
openclaw_documentation_README.md		openclaw_documentation_README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyCollateX

Features

Simple example

Original CollateX Contributors

Authors

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PyCollateX

Features

Simple example

Original CollateX Contributors

Authors

Contributors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages