Orchestrator-agent. Redesign and support for new seed methods #1

MaxFedotov · 2020-04-12T14:31:52Z

Hi @shlomi-noach,
We spoke some time ago about adding support for different seed methods for orchestrator and orchestrator-agent, and here it is.
This PR adds following:

Redesign base struct and methods for agent
New logging
New command execution with pipes support
New API commands
New configuration file parameters and format
5 supported seed methods: mysqldump, mydumper, xtrabackup, lvm and clone plugin
Packaging using goreleaser
Integration tests for all new seed methods (see /tests/integration/README.MD)

We define seed as an operation, which consists of a set of predefined stages, where 2 orchestrator agents participate - one agent is called source agent(so it is on a source side of seed), another - target agent (so it is on a target side of seed), and the goal of seed is to transfer all data from source agent to target agent and add target agent as a slave to source agent.

Seed can be executed using different seed methods - special ways or programs used to transfer data from source agent to target agent

In order to simplify developing new seed methods for orchestrator-agent, each of seed methods should support following interface:

type Plugin interface {
	Prepare(side Side)
	Backup(seedHost string, mysqlPort int)
	Restore()
	GetMetadata() (*SeedMetadata, error)
	Cleanup(side Side)
	isAvailable() bool
	getSupportedEngines() []mysql.Engine
	backupToDatadir() bool
}

By design, each seed operation consists of 5 stages, which are executed one after another by orchestrator (and each stage is a function, which seed method should support and had a corresponding orchestrator-agent API method which is used by 'orchestrator' to start it) :

Prepare. On this stage different preparations for seed are done on both sides, target and source. These preparations could be creating or cleanup of different directories, starting socat for data transfer
Backup. On this stage backup is executed - transfer of data from source to target. The side of the execution (target or source) depends on the seed method (some of them, like mydumper\mysqldump\clone plugin, are executed on target side, while xtrabackup\lvm are executed on source side). Each of the registered seed methods should provide information about the side, on which backup process is executed
Restore. On this stage restore is executed. This stage is always executed on target side. Operations on this stage depend on seed method - for mysqldump it's just uploading of .sql backup file to database, for xtrabackup - it's more complex process of uncompressing\preparing data in backup.
ConnectSlave. This stage will be executed completely by orchestrator. During this stage target agent is connected as a slave for source agent. But in order to implement this, orchestrator should know position\gtids from backup. So it asks target agent to provide backup metadata by calling GetMetadata method.
Cleanup. This stage is executed on both sides, target and source. Different cleanup activities will be executed on this stage - cleaning backup directories, unmounting snapshots

Other functions in this Interface are helper methods:

isAvailable() - is a function, which tests if seed method is available on a current host (for example, for xtrabackup it calls xtrabackup --version command to check availability of xtrabackup binary)
getSupportedEngines() - returns an array of supported engines for seed method. We need this because different seed methods supports different engines, and we can't, for example, use xtrabackup on MySQL 5.7 if there are tables with Myrocks engine, as they are not supported
backupToDatadir() - is a helper function, which returns true in case of this seed method is capable of backing up data directly to mysql data directory (like xtrabackup`cloneplugin`, 'lvm' - the do not create intermediate backup files with data)

Stages are exposed through orchestrator-agent API and orchestrator calls them in order.
As each stage can take quite a big amount of time to execute, they are executed asynchronously in goroutines. In order to be able to track progress of each stage, 'seed method' sends to 'orchestrator-agent' information about seed stage progress via channel and this information is also exposed through API, so orchestrator can call this API to be able to track stage progress.

That is the basic overview of the process from the orchestrator-agent side. All the philosophy is still the same as was - it's just exposes API calls to process seed and all logic is handled on the orchestrator side.

If you want to test it - there are integration tests available for all seed methods for different versions of mysql (5.7 and 8) and different binlog settings (gtid and positional). They are located in /tests/integration/ folders and in README.MD there is information about how to run them.

We started testing new agent and orchestrator functionality on our staging environments and I think we will find different issues related to this PR, so I think this functionality should be mostly classified as beta right now.

But I will be very grateful for your feedback and comments :)
Thanks,
Max

…ethods

…t call

…e]*seed.SeedStageState

…edstage if it is already running

…ckage. Move post-seed-command to custom-commands section in config

…\md5 checksums for rpm

…t\stop\status commands. additonal logging

aderumier · 2021-02-19T09:00:16Z

@MaxFedotov

Hi,

Thanks for your works !

I'm interested to test it, do you have any builds somewhere for both agent && orchestrator ?

Maksim Fedotov added 30 commits December 27, 2019 20:17

orchestrator-agent redesign. Build everything upon Agent struct and m…

5d81e72

…ethods

beautify logging

c25be86

add Mysql.Datadir, Mysql.LogFile to config

9cca0e1

dbagent package

86b1732

dbagent package. Add comments for methods\structs

c52f5b2

dbagent package. Functions to get database info

bd74d6d

move lvm functions to lvm. Replace old api calls with single get-agen…

600ebe2

…t call

AgentInfo use pointers instead of structures for LogicalVolume and Mount

8424f61

lvm and xtrabackup seed methods init

7bf1b01

seed methods initialization

fddda71

move MySQLPort to AgentParams. Use POST for SubmitAgent

4137819

add new cmd wrappers with pipe and output redirection support

7c4f297

seedstages, api for seeding + mysqldump seed method

355a122

remove BackupOldDatadir config option

351de78

add supported engines to seedMethods. Refactor engines for databases

d3267dd

add backupToDatadir to seed methods

ca820d9

agent. support structure changes on orchestrator side

d740548

agent. rename stages

3ef1e47

remove replication-user and replicaiton-password from config

e9c57af

remove old config. Change SeedStageStatus to be map[int]map[seed.Stag…

0ea37eb

…e]*seed.SeedStageState

dbagent return full mysql version

3e06e31

add cancelled seedStage

199b115

additional information about current agent active seed. Do not run se…

89d36a3

…edstage if it is already running

remove mysql version info from agent, as orchestrator already knows it

fe45d6b

add post seed cmd api"

9196f55

add support for custom-commands

70da89f

update ssl_test

68b290b

switch all packages to use logrus

e581420

update vendor + add go mod

75f47b0

remove old json config file

a9de778

Maksim Fedotov added 28 commits February 28, 2020 19:22

add integration and functional tests

f5ebb49

integration tests + bugfix

0af4596

get rid of SeedStageStatus. Move current seed status to ActiveSeed

14f284e

add addtional-opts config for mysqldump. Move ssl_test to separate pa…

a4751c1

…ckage. Move post-seed-command to custom-commands section in config

mysqldump support gtids + tests for gtids

78c0bc3

add stderr in case of cmd errors

6f14c97

add mydumper seed method

61327ff

add xtrabackup seed method

4742966

add cloneplugin seed method

3cf762f

lvm seed method

588be62

clonePlugin additional config opts

0c751d9

beautify tests + bugfix

974e716

README for tests

f993bca

update vagrant box version for mysql 5.7

8e30bb5

delete unused files

3047d8d

integration tests + bugfix

dfc2987

return mysql error log as string

7f14b26

rename seed-user and seed-password to user and password

5eca1e5

add system info and agent commands

8357c71

use custom script to build rpm packages as goreleaser do not add sha1…

b3db070

…\md5 checksums for rpm

goreleaser disable release creation

1eb0a7d

add http url prefix if not set in orchestrator url

8751796

additional configuration params. support for sudo-user and mysql star…

40f8d60

…t\stop\status commands. additonal logging

add orchestrator-agent.lock file when using xtrabackup\lvm seed

e9db8ba

use hostname from hostname -f command instead of os.hostname

00deb4c

add README

f733128

add supported seed methods for README

623ff08

fix bug with multiple gtids in GetMetadata

10c49f5

MaxFedotov mentioned this pull request Apr 12, 2020

Orchestrator. New seeding algorithm, support for different seed methods and agent redesign openark/orchestrator#1120

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Orchestrator-agent. Redesign and support for new seed methods #1

Orchestrator-agent. Redesign and support for new seed methods #1

MaxFedotov commented Apr 12, 2020 •

edited

Loading

aderumier commented Feb 19, 2021

Orchestrator-agent. Redesign and support for new seed methods #1

Are you sure you want to change the base?

Orchestrator-agent. Redesign and support for new seed methods #1

Conversation

MaxFedotov commented Apr 12, 2020 • edited Loading

aderumier commented Feb 19, 2021

MaxFedotov commented Apr 12, 2020 •

edited

Loading