Ghostferry is a library that enables you to selectively copy data from one mysql instance to another with minimal amount of downtime.
It is inspired by Github's gh-ost, although instead of copying data from and to the same database, Ghostferry copies data from one database to another and have the ability to only partially copy data.
There is an example application called ghostferry-copydb included (under the
copydb directory) that demonstrates this library by copying an entire
database from one machine to another.
An overview of Ghostferry's high-level design is expressed in the TLA+
specification, under the tlaplus directory. It maybe good to consult with
that as it has a concise definition. However, the specification might not be
entirely correct as proofs remain elusive.
On a high-level, Ghostferry is broken into several components, enabling it to copy data. This is documented at https://shopify.github.io/ghostferry/master/technicaloverview.html
For a detailed tutorial, see the documentation.
Install:
- Have Docker installed
- Clone the repo
docker-compose up -d mysql-1 mysql-2
Test copydb locally:
mysql --protocol=tcp -u root -P 29291 < sql/n1create.sqlto create databases on the source servermysql --protocol=tcp -u root -P 29291 < sql/n1users.sqlto create ghostferry user on source servermysql --protocol=tcp -u root -P 29292 < sql/n2users.sqlto create ghostferry user on destination serverdocker run --name ghostferry -p 8000:8000 -v $(pwd)/log:/log -v $(pwd)/config:/config pasientskyhosting/ps-ghostferry ghostferry-copydb -verbose -dryrun config/examplerun.jsonto perform a dry rundocker run --name ghostferry -p 8000:8000 -d -v $(pwd)/log:/log -v $(pwd)/config:/config pasientskyhosting/ps-ghostferry ghostferry-copydb -verbose config/examplerun.jsonto start the run- You can access the web UI on port 8000
- Log and state dump can be found in the
logdirectory - For a more detailed tutorial, see the documentation.