glusterfind - A tool to find Modified files/dirs
A tool which helps to get full/incremental list of files/dirs from GlusterFS Volume using Changelog/Find. In Gluster volumes, detecting the modified files is challenging. Readdir on a directory leads to multiple network calls since files in a directory are distributed across nodes.
This tool should be run in one of the node, which will get Volume info and gets the list of nodes and brick paths. For each brick, it spawns the process and runs crawler command in respective node. Crawler will be run in brick FS(xfs, ext4 etc) and not in Gluster Mount. Crawler generates output file with the list of files modified after last run or after the session creation.
Create a glusterfind session to remember the time when last sync or processing complete. For example, your backup application runs every day and gets incremental results on each run. The tool maintains session in
$GLUSTERD_WORKDIR/glusterfind/, for each session it creates and directory and creates a sub directory with Volume name. (Default working directory is /var/lib/glusterd, in some systems this location may change. To find Working dir location run
grep working-directory /etc/glusterfs/glusterd.vol or
grep working-directory /usr/local/etc/glusterfs/glusterd.vol if source install)
For example, if the session name is "backup" and volume name is "datavol", then the tool creates
$GLUSTERD_WORKDIR/glusterfind/backup/datavol. Now onwards we refer this directory as
create => pre => post => [delete]
Once the session is created, we can run the tool with two steps Pre and Post. To collect the list of modified files after the create time or last run time, we need to call pre command. Pre command finds the modified files and generates output file. Consumer can check the exit code of pre command and start processing those files. As a post processing step run the post command to update the session time as per latest run.
For example, backup utility runs Pre command and gets the list of files/directories changed. Sync those files to backup target and inform to glusterfind by calling Post command.
At the end of Pre command,
$SESSION_DIR/status.pre status file will get created. Pre status file stores the time when current crawl is started, and get all the files/dirs modified till that time. Once Post is called,
$SESSION_DIR/status.pre will be renamed to
$SESSION_DIR/status. content of this file will be used as start time for the next crawl.
During Pre, we can force the tool to do full find instead of incremental find. Tool uses
find command in brick backend to get list of files/dirs.
glusterfind create, in that node it generates ssh key($GLUSTERD_WORKDIR/glusterfind.secret.pem) and distributes to all Peers via Glusterd. Once ssh key is distributed in Trusted pool, tool can run ssh commands and copy files from other Volume nodes.
glusterfind pre is run, it internally runs
gluster volume info to get list of nodes and respective brick paths. For each brick, it calls respective node agents via ssh to find the modified files/dirs which are local them. Once each node agents generates output file, glusterfind collects all the files via scp and merges it into given output file.
glusterfind post is run, it renames
$SESSION_DIR/status.pre file to
Changelog Mode and GFID to Path conversion
Incremental find uses Changelogs to get the list of GFIDs modified/created. Any application expects file path instead of GFID. Their is no standard/easy way to convert from GFID to Path.
If we set build-pgfid option in Volume GlusterFS starts recording each files parent directory GFID as xattr in file on any ENTRY fop.
To convert from GFID to path, we can mount Volume with aux-gfid-mount option, and get Path information by a getfattr query.
getfattr -n glusterfs.ancestry.path -e text /mnt/datavol/.gfid/<GFID>
This approach is slow, for a requested file gets parent GFID via xattr and reads that directory to gets the file which is having same inode number as of GFID file. To improve the performance, glusterfind uses build-pgfid option, but instead of using getfattr on mount it gets the details from brick backend. glusterfind collects all parent GFIDs at once and starts crawling each directory. Instead of processing one GFID to Path conversion, it gets inode numbers of all input GFIDs and filter while reading parent directory.
Above method is fast compared to
find -samefile since it crawls only required directories to find files with same inode number as GFID file. But pgfid information only available when a lookup is made or any ENTRY fop to a file after enabling build-pgfid. The files created before build-pgfid enable will not get converted to path from GFID with this approach.
Tool collects the list of GFIDs failed to convert with above method and does a full crawl to convert it to path. Find command is used to crawl entire namespace. Instead of calling find command for every GFID, glusterfind uses an efficient way to convert all GFID to path with single call to
Create the session
glusterfind create SESSION_NAME VOLNAME [--force] glusterfind create --help
Where, SESSION_NAME is any name without space to identify when run second time. When a node is added to Volume then the tool expects ssh keys to be copied to new node(s) also. Run Create command with
--force to distribute keys again.
glusterfind create --help glusterfind create backup datavol glusterfind create antivirus_scanner datavol glusterfind create backup datavol --force
glusterfind pre SESSION_NAME VOLUME_NAME OUTFILE glusterfind pre --help
We need not specify Volume name since session already has the details. List of files will be populated in OUTFILE.
To trigger the full find, call the pre command with
--full argument. Multiple crawlers are available for incremental find, we can choose crawl type with
glusterfind pre backup datavol /root/backup.txt glusterfind pre backup datavol /root/backup.txt --full # Changelog based crawler, works only for incremental glusterfind pre backup datavol /root/backup.txt --crawler=changelog # Find based crawler, works for both full and incremental glusterfind pre backup datavol /root/backup.txt --crawler=brickfind
Output file contains list of files/dirs relative to the Volume mount, if we need to prefix with any path to have absolute path then,
glusterfind pre backup datavol /root/backup.txt --file-prefix=/mnt/datavol/
To get the list of sessions and respective session time,
glusterfind list [--session SESSION_NAME] [--volume VOLUME_NAME]
glusterfind list glusterfind list --session backup
SESSION VOLUME SESSION TIME --------------------------------------------------------------------------- backup datavol 2015-03-04 17:35:34
glusterfind post SESSION_NAME VOLUME_NAME
glusterfind post backup datavol
glusterfind delete SESSION_NAME VOLUME_NAME
glusterfind delete backup datavol
Adding more Crawlers
Adding more crawlers is very simple, Add an entry in
$GLUSTERD_WORKDIR/glusterfind.conf. glusterfind can choose your crawler using
[crawlers] changelog=/usr/libexec/glusterfs/glusterfind/changelog.py brickfind=/usr/libexec/glusterfs/glusterfind/brickfind.py
For example, if you have a multithreaded brick crawler, say
parallelbrickcrawl add it to the conf file.
[crawlers] changelog=/usr/libexec/glusterfs/glusterfind/changelog.py brickfind=/usr/libexec/glusterfs/glusterfind/brickfind.py parallelbrickcrawl=/root/parallelbrickcrawl
Custom crawler can be executable script/binary which accepts volume name, brick path, output_file and start time(and optional debug flag)
/root/parallelbrickcrawl SESSION_NAME VOLUME BRICK_PATH OUTFILE START_TIME [--debug]
START_TIME is in unix epoch format,
START_TIME will be zero for full find.
- Deleted files will not get listed, since we can't convert GFID to Path if file/dir is deleted.
- Only new name will get listed if Renamed.
- All hardlinks will get listed.