This set of scripts and classes is the example of subsystem responsible for the data migration
from the legacy MSSQL database to the new system
The code is mostly striped for the obvious reasons, still allowing to showcase overall structure and approach
legacy_migration.pymain entry point, launch specified, parralel jobs in chunkssrc/services/data_migration/datacollection of classes responsible to collect data from the legacy systemssrc/services/data_migration/handlercollection of classes responsible to process source data and import to the new systemsrc/service/data_migration/schemas- data schemas
$ python legacy_migration.py --help
usage: legacy_migration.py [-h] [--concurrency CONCURRENCY] [--chunk-size CHUNK_SIZE] [--sources SOURCES] [--log-level LOG_LEVEL]
Migrate legacy data
options:
-h, --help show this help message and exit
--concurrency CONCURRENCY
Concurrency level
--chunk-size CHUNK_SIZE
Chunk size to process in one thread
--sources SOURCES Comma separated list of sources to migrate. all to migrate all of them
--log-level LOG_LEVEL
Logger level