If you really want DR, everything should be 500+ miles apart and easily able to switch over with a few DNS changes. This might be an opportunity to swap out custom C&C for something like Rex, Puppet, Chef, Salt, or one of these: The only thing more important is the restore. Backups are the most important thing that any admin can accomplish. RAID1 is great for HA, but doesn't help at all for backups. I've used a tool called epkg myself, but since switching to APT, haven't needed that. From what I understand, it isn't too difficult to do this. If it doesn't already exist, they make the package themselves. I know many places will not load programs unless a package has been created - PERIOD. * any extra programs installed outside the package manager. For example, I do not backup entire OSes here. Of course, things can be as simple or complex as you choose. * use a DB backup tool specific to the DB being run. * dump the db to CSV or some other format that can be backed up easily * shutdown the DB while the backup is going on There are lots of ways to ensure a consistent DB backup. That means you backup everything using normal backup methods, then take extra steps to backup a consistent DB. Running backups is usually not an issue unless files are open and being written - this usually only happens with databases. If you are not using LVM, then a backup is the answer. "Snapshot" usually means a specific LVM command. One of my (terrible) ideas is, with some planned downtime, remove a drive from the array, sync the disks, then return the original disk to the array and store the freshly synced disk as a kind of snapshot. I know a snapshot is less than ideal, but in this environment I don't have a great deal of options.Īlso note that the servers all run hardware RAID1 arrays. So what DR options are available? What is available to snapshot the system to allow a makeshift recovery? If one goes down, I have no way to rebuild the host. deb packages, source code, or system-install-image. The CnC role is performed by a custom internal (closed source) set of applications, for which there are no. The primary and storage roles both have an install image available for building/rebuilding them, even though all software (bar some custom scripts for administration) is opensource and available to the world. There are 3 roles amongst these, the primary role, a storage role, and 2 hosts are CnC servers for all 78 hosts that spread over 3 locations. I am responsible for about 80 physical servers.
0 Comments
Leave a Reply. |