btrfs: Flush before reflinking any extent to prevent NOCOW write falling back to...

author Qu Wenruo <wqu@suse.com>

Wed, 8 May 2019 10:49:58 +0000 (18:49 +0800)

committer David Sterba <dsterba@suse.com>

Mon, 1 Jul 2019 11:35:00 +0000 (13:35 +0200)
author Qu Wenruo <wqu@suse.com>
Wed, 8 May 2019 10:49:58 +0000 (18:49 +0800)
committer David Sterba <dsterba@suse.com>
Mon, 1 Jul 2019 11:35:00 +0000 (13:35 +0200)
diff --git a/fs/btrfs/ioctl.c b/fs/btrfs/ioctl.c

index 2a1be0d..5b4beeb 100644 (file)
--- a/fs/btrfs/ioctl.c
+++ b/fs/btrfs/ioctl.c
@@ -3999,6 +3999,27 @@ static int btrfs_remap_file_range_prep(struct file *file_in, loff_t pos_in,
         if (!same_inode)
                 inode_dio_wait(inode_out);
  
+       /*
+        * Workaround to make sure NOCOW buffered write reach disk as NOCOW.
+        *
+        * Btrfs' back references do not have a block level granularity, they
+        * work at the whole extent level.
+        * NOCOW buffered write without data space reserved may not be able
+        * to fall back to CoW due to lack of data space, thus could cause
+        * data loss.
+        *
+        * Here we take a shortcut by flushing the whole inode, so that all
+        * nocow write should reach disk as nocow before we increase the
+        * reference of the extent. We could do better by only flushing NOCOW
+        * data, but that needs extra accounting.
+        *
+        * Also we don't need to check ASYNC_EXTENT, as async extent will be
+        * CoWed anyway, not affecting nocow part.
+        */
+       ret = filemap_flush(inode_in->i_mapping);
+       if (ret < 0)
+               return ret;
+
         ret = btrfs_wait_ordered_range(inode_in, ALIGN_DOWN(pos_in, bs),
                                        wb_len);
         if (ret < 0)
author	Qu Wenruo <wqu@suse.com>
	Wed, 8 May 2019 10:49:58 +0000 (18:49 +0800)
committer	David Sterba <dsterba@suse.com>
	Mon, 1 Jul 2019 11:35:00 +0000 (13:35 +0200)