linux - linux

	Commit message (Collapse)	Author	Age	Files	Lines
*	block: add QUEUE_FLAG_NOWAIT	Mike Snitzer	2020-09-25	2	-4/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add QUEUE_FLAG_NOWAIT to allow a block device to advertise support for REQ_NOWAIT. Bio-based devices may set QUEUE_FLAG_NOWAIT where applicable. Update QUEUE_FLAG_MQ_DEFAULT to include QUEUE_FLAG_NOWAIT. Also update submit_bio_checks() to verify it is set for REQ_NOWAIT bios. Reported-by: Konstantin Khlebnikov <khlebnikov@yandex-team.ru> Suggested-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	vsprintf: use bd_partno in bdev_name	Christoph Hellwig	2020-09-25	1	-2/+2
\| \| \| \| \| \| \|	No need to go through the hd_struct to find the partition number. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: use bd_partno in bdevname	Christoph Hellwig	2020-09-25	1	-1/+1
\| \| \| \| \| \| \|	No need to go through the hd_struct to find the partition number. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	target/iblock: fix holder printing in iblock_show_configfs_dev_params	Christoph Hellwig	2020-09-25	1	-3/+2
\| \| \| \| \| \| \| \| \|	bd_contains is never NULL for an open block device. In addition ibd_bd is always set to a block device that was exclusively opened by the target code, so the holder is guranteed to be ib_dev as well. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	drbd: don't set ->bd_contains	Christoph Hellwig	2020-09-25	1	-2/+0
\| \| \| \| \| \| \| \|	The ->bd_contains field is set by __blkdev_get and drivers have no business manipulating it. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	drbd: don't detour through bd_contains for the gendisk	Christoph Hellwig	2020-09-25	2	-2/+2
\| \| \| \| \| \| \|	bd_disk is set on all block devices, including those for partitions. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	md: don't detour through bd_contains for the gendisk	Christoph Hellwig	2020-09-25	2	-2/+2
\| \| \| \| \| \| \| \|	bd_disk is set on all block devices, including those for partitions. Signed-off-by: Christoph Hellwig <hch@lst.de> Acked-by: Song Liu <song@kernel.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	md: compare bd_disk instead of bd_contains	Christoph Hellwig	2020-09-25	1	-4/+3
\| \| \| \| \| \| \| \| \|	To check for partitions of the same disk bd_contains works as well, but bd_disk is way more obvious. Signed-off-by: Christoph Hellwig <hch@lst.de> Acked-by: Song Liu <song@kernel.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: add a bdev_is_partition helper	Christoph Hellwig	2020-09-25	10	-17/+22
\| \| \| \| \| \| \| \| \|	Add a littler helper to make the somewhat arcane bd_contains checks a little more obvious. Signed-off-by: Christoph Hellwig <hch@lst.de> Acked-by: Ulf Hansson <ulf.hansson@linaro.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	Documentation/hdio: fix up obscure bd_contains references	Christoph Hellwig	2020-09-25	1	-12/+12
\| \| \| \| \| \| \| \|	bd_contains is an implementation detail and should not be mentioned in a userspace API documentation. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bdi: replace BDI_CAP_NO_{WRITEBACK,ACCT_DIRTY} with a single flag	Christoph Hellwig	2020-09-24	10	-58/+29
\| \| \| \| \| \| \| \| \| \| \| \|	Replace the two negative flags that are always used together with a single positive flag that indicates the writeback capability instead of two related non-capabilities. Also remove the pointless wrappers to just check the flag. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bdi: invert BDI_CAP_NO_ACCT_WB	Christoph Hellwig	2020-09-24	4	-13/+8
\| \| \| \| \| \| \| \| \| \| \|	Replace BDI_CAP_NO_ACCT_WB with a positive BDI_CAP_WRITEBACK_ACCT to make the checks more obvious. Also remove the pointless bdi_cap_account_writeback wrapper that just obsfucates the check. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bdi: replace BDI_CAP_STABLE_WRITES with a queue and a sb flag	Christoph Hellwig	2020-09-24	18	-36/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The BDI_CAP_STABLE_WRITES is one of the few bits of information in the backing_dev_info shared between the block drivers and the writeback code. To help untangling the dependency replace it with a queue flag and a superblock flag derived from it. This also helps with the case of e.g. a file system requiring stable writes due to its own checksumming, but not forcing it on other users of the block device like the swap code. One downside is that we an't support the stable_pages_required bdi attribute in sysfs anymore. It is replaced with a queue attribute which also is writable for easier testing. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	mm: use SWP_SYNCHRONOUS_IO more intelligently	Christoph Hellwig	2020-09-24	1	-8/+10
\| \| \| \| \| \| \| \| \| \|	There is no point in trying to call bdev_read_page if SWP_SYNCHRONOUS_IO is not set, as the device won't support it. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bdi: remove BDI_CAP_SYNCHRONOUS_IO	Christoph Hellwig	2020-09-24	6	-20/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	BDI_CAP_SYNCHRONOUS_IO is only checked in the swap code, and used to decided if ->rw_page can be used on a block device. Just check up for the method instead. The only complication is that zram needs a second set of block_device_operations as it can switch between modes that actually support ->rw_page and those who don't. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bdi: remove BDI_CAP_CGROUP_WRITEBACK	Christoph Hellwig	2020-09-24	3	-7/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Just checking SB_I_CGROUPWB for cgroup writeback support is enough. Either the file system allocates its own bdi (e.g. btrfs), in which case it is known to support cgroup writeback, or the bdi comes from the block layer, which always supports cgroup writeback. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: lift setting the readahead size into the block layer	Christoph Hellwig	2020-09-24	11	-68/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Drivers shouldn't really mess with the readahead size, as that is a VM concept. Instead set it based on the optimal I/O size by lifting the algorithm from the md driver when registering the disk. Also set bdi->io_pages there as well by applying the same scheme based on max_sectors. To ensure the limits work well for stacking drivers a new helper is added to update the readahead limits from the block limits, which is also called from disk_stack_limits. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Mike Snitzer <snitzer@redhat.com> Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com> Acked-by: Coly Li <colyli@suse.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	md: update the optimal I/O size on reshape	Christoph Hellwig	2020-09-24	2	-10/+22
\| \| \| \| \| \| \| \| \| \| \| \| \|	The raid5 and raid10 drivers currently update the read-ahead size, but not the optimal I/O size on reshape. To prepare for deriving the read-ahead size from the optimal I/O size make sure it is updated as well. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com> Acked-by: Song Liu <song@kernel.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bdi: initialize ->ra_pages and ->io_pages in bdi_init	Christoph Hellwig	2020-09-24	10	-15/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	Set up a readahead size by default, as very few users have a good reason to change it. This means code, ecryptfs, and orangefs now set up the values while they were previously missing it, while ubifs, mtd and vboxsf manually set it to 0 to avoid readahead. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Acked-by: David Sterba <dsterba@suse.com> [btrfs] Acked-by: Richard Weinberger <richard@nod.at> [ubifs, mtd] Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	aoe: set an optimal I/O size	Christoph Hellwig	2020-09-24	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	aoe forces a larger readahead size, but any reason to do larger I/O is not limited to readahead. Also set the optimal I/O size, and remove the local constants in favor of just using SZ_2G. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bcache: inherit the optimal I/O size	Christoph Hellwig	2020-09-24	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	Inherit the optimal I/O size setting just like the readahead window, as any reason to do larger I/O does not apply to just readahead. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com> Acked-by: Coly Li <colyli@suse.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	drbd: remove dead code in device_to_statistics	Christoph Hellwig	2020-09-24	1	-6/+0
\| \| \| \| \| \| \| \| \| \| \|	Ever since the switch to blk-mq, a lower device not used for VM writeback will not be marked congested, so the check will never trigger. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	fs: remove the unused SB_I_MULTIROOT flag	Christoph Hellwig	2020-09-24	2	-3/+2
\| \| \| \| \| \| \| \| \| \|	The last user of SB_I_MULTIROOT is disappeared with commit f2aedb713c28 ("NFS: Add fs_context support.") Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jan Kara <jack@suse.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: mark blkdev_get static	Christoph Hellwig	2020-09-23	2	-3/+1
\| \| \| \| \| \| \|	There are no users outside the core block code left now. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	PM: mm: cleanup swsusp_swap_check	Christoph Hellwig	2020-09-23	1	-6/+4
\| \| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of bdget + blkdev_get. Signed-off-by: Christoph Hellwig <hch@lst.de> Acked-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	mm: split swap_type_of	Christoph Hellwig	2020-09-23	4	-40/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	swap_type_of is used for two entirely different purposes: (1) check what swap type a given device/offset corresponds to (2) find the first available swap device that can be written to Mixing both in a single function creates an unreadable mess. Create two separate functions instead, and switch both to pass a dev_t instead of a struct block_device to further simplify the code. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	PM: rewrite is_hibernate_resume_dev to not require an inode	Christoph Hellwig	2020-09-23	3	-9/+9
\| \| \| \| \| \| \| \| \|	Just check the dev_t to help simplifying the code. Signed-off-by: Christoph Hellwig <hch@lst.de> Acked-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Acked-by: Pavel Machek <pavel@ucw.cz> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	mm: cleanup claim_swapfile	Christoph Hellwig	2020-09-23	1	-3/+3
\| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of bdgrab + blkdev_get. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	ocfs2: cleanup o2hb_region_dev_store	Christoph Hellwig	2020-09-23	1	-18/+10
\| \| \| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of igrab (aka open coded bdgrab) + blkdev_get. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Joseph Qi <joseph.qi@linux.alibaba.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	dasd: cleanup dasd_scan_partitions	Christoph Hellwig	2020-09-23	1	-11/+4
\| \| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of bdget_disk + blkdev_get. Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Stefan Haberland <sth@linux.ibm.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	raw: don't keep unopened block device around	Christoph Hellwig	2020-09-23	1	-32/+19
\| \| \| \| \| \| \| \|	Turn binding into a normal dev_t as the struct block device doesn't buy us anything and use blkdev_open_by_dev to actually open it. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	zram: cleanup backing_dev_store	Christoph Hellwig	2020-09-23	1	-3/+4
\| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of bdgrab + blkdev_get. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	pktcdvd: use blkdev_get_by_dev instead of open coding it	Christoph Hellwig	2020-09-23	1	-14/+11
\| \| \| \| \| \| \|	Replace bdget + blkdev_get by blkdev_get_by_dev. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	pktcdvd: remove the if 0'ed pkt_start_recovery function	Christoph Hellwig	2020-09-23	1	-65/+2
\| \| \| \| \| \| \|	Remove code which has been dead since the initial commit. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: cleanup blkdev_bszset	Christoph Hellwig	2020-09-23	1	-7/+6
\| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of bdgrab + blkdev_get. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: cleanup partition scanning in register_disk	Christoph Hellwig	2020-09-23	1	-19/+14
\| \| \| \| \| \| \| \| \|	Use blkdev_get_by_dev instead of open coding it using bdget_disk + blkdev_get, and split the code to read the partition table into a separate helper to make it a little more obvious. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: move the NEED_PART_SCAN flag to struct gendisk	Christoph Hellwig	2020-09-23	6	-14/+13
\| \| \| \| \| \| \| \|	We can only scan for partitions on the whole disk, so move the flag from struct block_device to struct gendisk. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: allow 'chunk_sectors' to be non-power-of-2	Mike Snitzer	2020-09-23	2	-9/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It is possible, albeit more unlikely, for a block device to have a non power-of-2 for chunk_sectors (e.g. 10+2 RAID6 with 128K chunk_sectors, which results in a full-stripe size of 1280K. This causes the RAID6's io_opt to be advertised as 1280K, and a stacked device _could_ then be made to use a blocksize, aka chunk_sectors, that matches non power-of-2 io_opt of underlying RAID6 -- resulting in stacked device's chunk_sectors being a non power-of-2). Update blk_queue_chunk_sectors() and blk_max_size_offset() to accommodate drivers that need a non power-of-2 chunk_sectors. Reviewed-by: Ming Lei <ming.lei@redhat.com> Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: use lcm_not_zero() when stacking chunk_sectors	Mike Snitzer	2020-09-23	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Like 'io_opt', blk_stack_limits() should stack 'chunk_sectors' using lcm_not_zero() rather than min_not_zero() -- otherwise the final 'chunk_sectors' could result in sub-optimal alignment of IO to component devices in the IO stack. Also, if 'chunk_sectors' isn't a multiple of 'physical_block_size' then it is a bug in the driver and the device should be flagged as 'misaligned'. Reviewed-by: Ming Lei <ming.lei@redhat.com> Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com> Signed-off-by: Mike Snitzer <snitzer@redhat.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: fix bmd->is_null_mapped initialization	Christoph Hellwig	2020-09-23	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \|	bmd is allocated using kmalloc in bio_alloc_map_data, so make sure is_null_mapped is properly initialized to false for the !null_mapped case. Fixes: f3256075ba49 ("block: remove the BIO_NULL_MAPPED flag") Reported-by: Marc Hartmayer <mhartmay@linux.ibm.com> Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	block: drop double zeroing	Julia Lawall	2020-09-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	sg_init_table zeroes its first argument, so the allocation of that argument doesn't have to. the semantic patch that makes this change is as follows: (http://coccinelle.lip6.fr/) // <smpl> @@ expression x; @@ x = - kzalloc + kmalloc (...) ... sg_init_table(x,...) // </smpl> Signed-off-by: Julia Lawall <Julia.Lawall@inria.fr> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	blk-throttle: Avoid checking bps/iops limitation if bps or iops is unlimited	Baolin Wang	2020-09-15	1	-0/+12
\| \| \| \| \| \| \|	Do not need check the bps or iops limitation if bps or iops is unlimited. Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	blk-throttle: Avoid calculating bps/iops limitation repeatedly	Baolin Wang	2020-09-15	1	-9/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The tg_may_dispatch() will call tg_with_in_bps_limit() and tg_with_in_iops_limit() to check if we can dispatch a bio or not, which will calculate bps/iops limitation multiple times. But tg_may_dispatch() is always called under queue lock, which means the bps/iops limitation will not change in tg_may_dispatch(). So we can calculate the bps/iops limitation only once, and pass them to tg_with_in_bps_limit() and tg_with_in_iops_limit() to avoid calculating bps/iops limitation repeatedly. Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	blk-throttle: Define readable macros instead of static variables	Baolin Wang	2020-09-15	1	-5/+5
\| \| \| \| \| \| \| \| \|	The 'throtl_grp_quantum' and 'throtl_quantum' are both read-only variables, thus better to use readable macros instead of static variables, which can also save some spaces for .bss area. Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	blk-throttle: Use readable READ/WRITE macros	Baolin Wang	2020-09-15	1	-2/+2
\| \| \| \| \| \| \|	Use readable READ/WRITE macros instead of magic numbers. Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	blk-throttle: Fix some comments' typos	Baolin Wang	2020-09-15	1	-7/+7
\| \| \| \| \| \| \|	Fix some comments' typos. Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	iocost: fix infinite loop bug in adjust_inuse_and_calc_cost()	Tejun Heo	2020-09-15	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	adjust_inuse_and_calc_cost() is responsible for reducing the amount of donated weights dynamically in period as the budget runs low. Because we don't want to do full donation calculation in period, we keep latching up inuse by INUSE_ADJ_STEP_PCT of the active weight of the cgroup until the resulting hweight_inuse is satisfactory. Unfortunately, the adj_step calculation was reading the active weight before acquiring ioc->lock. Because the current thread could have lost race to activate the iocg to another thread before entering this function, it may read the active weight as zero before acquiring ioc->lock. When this happens, the adj_step is calculated as zero and the incremental adjustment loop becomes an infinite one. Fix it by fetching the active weight after acquiring ioc->lock. Fixes: b0853ab4a238 ("blk-iocost: revamp in-period donation snapbacks") Signed-off-by: Tejun Heo <tj@kernel.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	blk-iocost: fix divide-by-zero in transfer_surpluses()	Tejun Heo	2020-09-12	1	-4/+10
\| \| \| \| \| \| \| \| \| \| \|	Conceptually, root_iocg->hweight_donating must be less than WEIGHT_ONE but all hweight calculations round up and thus it may end up >= WEIGHT_ONE triggering divide-by-zero and other issues. Bound the value to avoid surprises. Fixes: e08d02aa5fc9 ("blk-iocost: implement Andy's method for donation weight updates") Signed-off-by: Tejun Heo <tj@kernel.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	bcache: use part_[begin\|end]_io_acct instead of disk_[begin\|end]_io_acct	Song Liu	2020-09-12	1	-4/+6
\| \| \| \| \| \| \| \| \|	This enables proper statistics in /proc/diskstats for bcache partitions. Signed-off-by: Song Liu <songliubraving@fb.com> Reviewed-by: Coly Li <colyli@suse.de> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>
*	md: use part_[begin\|end]_io_acct instead of disk_[begin\|end]_io_acct	Song Liu	2020-09-12	1	-4/+4
\| \| \| \| \| \| \| \|	This enables proper statistics in /proc/diskstats for md partitions. Signed-off-by: Song Liu <songliubraving@fb.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@kernel.dk>